Gene EcSMS35_2598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2598 
SymboleutH 
ID6144711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2651229 
End bp2652455 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content55% 
IMG OID641617469 
Productethanolamine utilization protein EutH 
Protein accessionYP_001744634 
Protein GI170683597 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3192] Ethanolamine utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATTA ACGAAATCAT CATGTACATC ATGATGTTCT TTATGCTGAT AGCTGCCGTA 
GACAGGATCC TGTCGCAGTT CGGCGGTTCT GCTCGTTTCC TCGGTAAGTT CGGTAAAAGT
ATCGAAGGAT CCGGCAGTCA GTTCGAAGAA GGCTTTATGG CAATGGGCGC TTTGGGCCTG
GCGATGGTTG GTATGACCGC GCTGGCACCT GTTCTGGCCC ACGTACTCGG ACCGGTGATT
ATCCCGGTTT ACGAAATGCT CGGCGCAAAC CCGTCAATGT TCGCCGGAAC ACTGCTGGCG
TGCGATATGG GCGGCTTCTT CCTCGCCAAA GAGTTGGCGG GCGGCGACGT AGCAGCGTGG
CTATACTCTG GATTAATTCT CGGGTCGATG ATGGGGCCAA CGATTGTGTT TTCCATTCCG
GTGGCGCTCG GCATTATCGA ACCTTCTGAC CGTCGTTATC TGGCGCTCGG CGTGCTGGCG
GGCATTGTGA CCATTCCGAT TGGCTGTATT GCCGGTGGTC TGGTGGCTAT GTACTCCGGT
GTGCAGATCA ACGGTCAGCC AGTGGAATTC ACCTTTGCGC TGATCCTGAT GAACATGATC
CCGGTACTTA TCGTTGCGGT GCTGGTGGCG CTGGGGCTGA AATTCATCCC GGAAAAAATG
ATCAACGGCT TCCAGATCTT CGCCAAATTC CTCGTTGCAT TGATCACCCT CGGTCTTGCC
GCTGCGGTAG TGAAATTCCT CCTTGGCTGG GAACTGATCC CGGGCCTTGA TCCTATCTTT
ATGGCCCCTG GCGATAAACC CGGTGAAGTG ATGCGCGCCA TTGAAGTTAT CGGCTCGATC
TCCTGCGTTC TGTTAGGGGC GTATCCGATG GTGCTGCTGC TGACTCGCTG GTTTGAAAAA
CCGCTGATGA GTGTCGGTAA GGTGCTGAAT ATGAACAATA TAGCGGCAGC CGGCATGGTG
GCAACGCTTG CCAACAACAT CCCGATGTTT GGCATGATGA AGCAGATGGA TACCCGCGGC
AAAGTCATCA ACTGTGCCTT CGCCGTTTCC GCTGCTTTCG CCCTGGGCGA CCATTTAGGC
TTCGCGGCTG CCAACATGAA CGCCATGATC TTCCCGATGA TTGTCGGCAA GCTGATCGGC
GGCGTCACGG CGATTGGCGT GGCGATGATG CTGGTACCTA AAGAAGACGC GAGCGCGGCT
AAAACCGAAG CGGAGGCGCA ATCGTGA
 
Protein sequence
MGINEIIMYI MMFFMLIAAV DRILSQFGGS ARFLGKFGKS IEGSGSQFEE GFMAMGALGL 
AMVGMTALAP VLAHVLGPVI IPVYEMLGAN PSMFAGTLLA CDMGGFFLAK ELAGGDVAAW
LYSGLILGSM MGPTIVFSIP VALGIIEPSD RRYLALGVLA GIVTIPIGCI AGGLVAMYSG
VQINGQPVEF TFALILMNMI PVLIVAVLVA LGLKFIPEKM INGFQIFAKF LVALITLGLA
AAVVKFLLGW ELIPGLDPIF MAPGDKPGEV MRAIEVIGSI SCVLLGAYPM VLLLTRWFEK
PLMSVGKVLN MNNIAAAGMV ATLANNIPMF GMMKQMDTRG KVINCAFAVS AAFALGDHLG
FAAANMNAMI FPMIVGKLIG GVTAIGVAMM LVPKEDASAA KTEAEAQS