Gene Elen_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2026 
Symbol 
ID8416337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2372777 
End bp2373988 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content61% 
IMG OID645025003 
ProductEthanolamine utilization protein EutH 
Protein accessionYP_003182379 
Protein GI257791773 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3192] Ethanolamine utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.86347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00000329746 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGATGA TCGGAACGGC CGTGGTGTAC ATCATCATGG TCTGCGCATT GGCGGGCGCC 
GTCGCATCGG CCATCAAGCC GGAGAGCGAG CTCGGCCGGC AATTCGTGGC CGGCATCGAC
TCCATCGGCC CCATCTTCCT TCCCGTAGCG GGCATCATGG CATCGGCTCC TTATCTGACG
GCGTTCGTGA GCACGGTCTT CGGGCCGGCG TACGGGGCGC TCGGCGCCGA CCCAGCCATG
GCAGCGACGA CGTTCATCGC CATCGACATG GGCGGATACC AGCTGGCGGA CGCGCTTGCG
CAGACGCGTG AGAGCTGGAT CATGGCGATG ATGACCGGGT ATATGGCGGG CGCAACCATC
GTTTTCACGA TACCGGTGGC GCTGAAGATG CTCGAGAAAC GCGATCGGAA GTACTTGGCG
CTCGGAGTGA TGAGCGGCCT TCTCGCGATT CCTATCGGCG TGCTCGTTGC GAGCATCATC
ATCGCGCTTT CGCACCCGGT GATCCGGGAG GTCGTATCGA CGAACGCCGA AGCGACCTAT
CAGCTTGCGT TGAGCTTCGC CCAGATCGGC GTCAACCTCG TGCCGCTGAT CATCATATGC
GTTGCGCTGG CATTGGGGCT CAAGTTCAAG CCCGACGCCA TGATCAAGGG GTTCATCGTG
TTCGGTCGCG TGATGGAAGC GACGCTCAAA ATCGTGTTCG TGCTGGCGGT TATCGAATAC
TTCACGGGCA TCTTCACCAC GGTCTTCGGC TCCTTCGGGT TCGATCCTAT CATCGCCGAC
GAGGAGGATA TCTTCCGAGC ACTCGAGGTG TCGGGTGCTA TCGGCATGAT GTTGTGCGGC
GCGTTTCCCA TGGTGTACCT CATCAAGCGC TATCTTGCGA AGCCACTCGC CAAAATCGGC
GGCGCGGTGG GGCTGAGCTC GGACGCAACC GCTGGCTTGC TGGCCGCCTC GGCGAACGTG
CTTGCGGCGC TGTCGATGGT GAAAGACCTC AAGGCGCGCG ACAAGGTGCT GGTCATGTCG
TTTGCCGTGT GCTGCGCATT TCTGTTCGGC GATCATCTGT CGTTCACGGC GAACTTCCAA
CCGACGCTGA TCGTGCCGGT GCTTGTAGGG AAGCTGTCAG CGGGTGTGTG TGCCGTCGTG
TTCGCAAGCT TGCTTGCCGT GAAGAAGGCT GAGGAACTCG AGCGGATCGA TAGGGCAGAA
GCCGAAAACT AG
 
Protein sequence
MEMIGTAVVY IIMVCALAGA VASAIKPESE LGRQFVAGID SIGPIFLPVA GIMASAPYLT 
AFVSTVFGPA YGALGADPAM AATTFIAIDM GGYQLADALA QTRESWIMAM MTGYMAGATI
VFTIPVALKM LEKRDRKYLA LGVMSGLLAI PIGVLVASII IALSHPVIRE VVSTNAEATY
QLALSFAQIG VNLVPLIIIC VALALGLKFK PDAMIKGFIV FGRVMEATLK IVFVLAVIEY
FTGIFTTVFG SFGFDPIIAD EEDIFRALEV SGAIGMMLCG AFPMVYLIKR YLAKPLAKIG
GAVGLSSDAT AGLLAASANV LAALSMVKDL KARDKVLVMS FAVCCAFLFG DHLSFTANFQ
PTLIVPVLVG KLSAGVCAVV FASLLAVKKA EELERIDRAE AEN