Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2599 |
Symbol | eutG |
ID | 6142888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2652625 |
End bp | 2653812 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641617470 |
Product | ethanolamine utilization protein EutG |
Protein accession | YP_001744635 |
Protein GI | 170681253 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATG AATTGCAGAC CGCGCTCTTT CAGGCGTTCG ATACCCTGAA TCTGCAACGG GTAAAAACAT TTAGCGTTCC ACCGGTGACG CTTTGCGGTC CGGGCGCGGT GAGCAGTTGT GGGCAGCAAG CGCAAACGCG TGGGCTGAAA CATCTGTTCG TGATGGCAGA CAGCTTTTTG CATCAGGCAG GGATGACCGC CGGGCTGACA CGTAGCCTGG CCGTTAAAGG TATCGCCATG ACGCTCTGGC CATGTCCGGT GGGCGAACCA TGCATCACCG ACGTGTGTGC AGCCGTGGCG CAGTTGCGTG AATCAGGCTG TGACGGGGTG ATTGCGTTTG GCGGCGGCTC GGTGCTGGAT GCAGCAAAAG CCGTGGCGTT GCTGGTGACG AACCCGGATA GCACGCTGGC AGAGATGTCA GAAACCAGCG TTCTGCAACC GCGCCTGCCG TTGATTGCCA TTCCGACTAC CGCCGGAACC GGCTCTGAAA CCACCAACGT AACGGTGATT ATCGATGCGG TGAGCGGGCG TAAGCAGGTG TTAGCCCATG CCTCGCTGAT GCCGGATGTG GCGATTCTCG ACGCCGCATT GACCGAAGGT GTGCCGTCGC ATGTCACGGC GATGACTGGC ATTGATGCGT TAACCCATGC CATTGAAGCG TACAGCGCGC TGAACGCCAC ACCATTTACC GACAGCCTGG CGATTGGCGC CATTGCGATG ATTGGTCAAT CGCTGCCGAA AGCGGTGGGC TACGGTCACG ACCTTGCCGC GCGCGAGAGC ATGTTGCTGG CTTCATGTAT GGCGGGAATG GCGTTTTCCA GTGCGGGGCT TGGGTTGTGC CACGCGATGG CGCATCAGCC AGGGGCGGCG CTGCATATTC CGCACGGTCT GGCGAACGCT ATGTTGCTGC CAACGGTAAT GGAATTTAAC CGGATGGTTT GTCGTGATCG GTTTAGTCAG ATTGGCCGGG CGCTGAGAAC TAAAAAATCC GACGATCGTG ACGCTATTAA CGCGGTAAGT GAGCTGATTG CGGAAGTCGG GATTGGTAAA CGACTGGGCG ATATTGGTGC GACATCTGCG CATTACGGCG CATGGGCGCA GGCCGCGCTG GAAGATATTT GTCTGCGCAG CAACCCGCGT ACCGCCAGCC TGGAGCAGAT TATTGGTCTG TATGCTGCGG CGCAATAA
|
Protein sequence | MQNELQTALF QAFDTLNLQR VKTFSVPPVT LCGPGAVSSC GQQAQTRGLK HLFVMADSFL HQAGMTAGLT RSLAVKGIAM TLWPCPVGEP CITDVCAAVA QLRESGCDGV IAFGGGSVLD AAKAVALLVT NPDSTLAEMS ETSVLQPRLP LIAIPTTAGT GSETTNVTVI IDAVSGRKQV LAHASLMPDV AILDAALTEG VPSHVTAMTG IDALTHAIEA YSALNATPFT DSLAIGAIAM IGQSLPKAVG YGHDLAARES MLLASCMAGM AFSSAGLGLC HAMAHQPGAA LHIPHGLANA MLLPTVMEFN RMVCRDRFSQ IGRALRTKKS DDRDAINAVS ELIAEVGIGK RLGDIGATSA HYGAWAQAAL EDICLRSNPR TASLEQIIGL YAAAQ
|
| |