Gene Elen_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1119 
Symbol 
ID8415409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1348934 
End bp1350082 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content65% 
IMG OID645024081 
Productperiplasmic binding protein 
Protein accessionYP_003181478 
Protein GI257790872 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000476149 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000345093 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAGGGA CCATCGCAAG GAACATGTCC CGGGCGAAAC GCCTGCTGGT CGTAGGCGCG 
CTATGTCTGG GACTGGCGTT CGCGCTGAAC GGATGCTCGC AGGCTCCGTC GCAGAACGCA
CCCGAAAGCG CCGACGGCGC CGCGCAAACC GATCAGGCGG CGCAGAACGA GACGCGCACG
TTCACCGACT CGGCCGGACG CACGGTGGAA GTGCCCGCGA AGATCGACCG CATCGCGCCG
GCGGGCCACA CCGCCACGCA GGTCCTGCTT ACGATGGCAC CGGACAAGCT GGTGACGATC
TCGACGGAGC TCACGGCCGA CCAGGCGAAG TACCTGGGAG GCGACTACGC CAACCTGCCC
GTGACAGGAG CCGCGTTCGG AGCGAAAGGC GACCTCAACA AGGAGGCCGT CGCCGCATCC
GGTGCGCAGA TCCTCATCGA CACCGGCGAG ATCAAAGACG GCATGAAAGA GGACCTCGAC
ACGATGCAGC AGCAGCTGGG CATCCCCGTA GTGGTCATCG AGACGAAGAT GGAGGACTAC
GGCGCAGCCT ACGAGAAGCT CGGCGAGCTG TTGGGCATGG AGGATCGCGG CAAGGAGCTG
TCCGACTACT GCAAGGCCGC CTACGACGAG ACTGTATCCG TCATGAGCAA GATTCCCGAG
GGCGACCGCG CGAAGGTGGC GTACCTGCTC GGCGACAAGG GGACGAACAC CATCGCGAAG
AACTCCTACC AGGGCCAGGT GATCGACCTC GTGGCCGACA ACGTCGCCGA CCTGGGCAAG
GTGTCCGGCA GCGGCGCCGG CGTCGAGATC GGCATGGAAC AGCTGGCTAT CTGGGATCCT
GCGGTCATCC TGTTCGGCCC GGACAGCATC TACGACACGG TGGGCTCCGA TGCGGCCTTC
GCCGACCTGT CCGCCGTCAA GAACGGCTCC TACTACAAGG TGCCCGGCAC GCCGTGGAAC
TGGCTGAACA GCCCGCCGAC GGTGAACCAG GTGCTGGGCA TGCAGTGGCT GCCGCGCCTG
CTGTACCCCG AGCAGTACGA CAACGACCTG CACAAGACGG TCGCAGGCTA CTTCAAGACG
TTCTACGGCT ACGACCTGAG CGAGTCCGAG TTCAACGAGA TCGCCGCGAA CGCGCAGCCC
AAGGCATAG
 
Protein sequence
MGGTIARNMS RAKRLLVVGA LCLGLAFALN GCSQAPSQNA PESADGAAQT DQAAQNETRT 
FTDSAGRTVE VPAKIDRIAP AGHTATQVLL TMAPDKLVTI STELTADQAK YLGGDYANLP
VTGAAFGAKG DLNKEAVAAS GAQILIDTGE IKDGMKEDLD TMQQQLGIPV VVIETKMEDY
GAAYEKLGEL LGMEDRGKEL SDYCKAAYDE TVSVMSKIPE GDRAKVAYLL GDKGTNTIAK
NSYQGQVIDL VADNVADLGK VSGSGAGVEI GMEQLAIWDP AVILFGPDSI YDTVGSDAAF
ADLSAVKNGS YYKVPGTPWN WLNSPPTVNQ VLGMQWLPRL LYPEQYDNDL HKTVAGYFKT
FYGYDLSESE FNEIAANAQP KA