Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2819 |
Symbol | eutG |
ID | 6268593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2614502 |
End bp | 2615689 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641726771 |
Product | ethanolamine utilization protein EutG |
Protein accession | YP_001881244 |
Protein GI | 187732023 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAATG AATTGCAGAC CGCGCTCTTT CAGGCGTTCG ATACCCTGAA TCTGCAACGG GTAAAAACAT TTAGCGTTCC ACCGGTGACG CTTTGCGGTC CGGGCTCGGT GAGCAGTTGC GGACAGCAAG CGCAAACGCG TGGGCTGAAA CATCTGTTCG TGATGGCAGA CAGCTTTTTG CATCAGGCAG GGATGACCGC CGGGCTGACG CGCAGCCTGG CTGTTAAAGG CATCGCCATG ACGCTCTGGC CATGTCCGGT GGGCGAACCG TGCATCACCG ACGTGTGTGC AGCCGTGGCG CAGTTGCGTG AGTCAGGCTG TGATGGGGTG ATCGCATTTG GCGGCGGCTC GGTGCTGGAT GCGGCGAAAG CCGTGGCGTT GCTGGTGACG AACCCGGATA GCACGCTGGC AGAGATGTCA GAAACCAGCG TTCTGCAACC GCGCTTGCCG CTGATTGCCA TTCCAACGAC CGCCGGAACC GGCTCTGAAA CCACCAATGT AACGGTGATT ATCGACGCGG TGAGCGGGCG CAAGCAGGTG TTAGCCCATG CCTCGCTGAT GCCGGATGTG GCGATCCTCG ACGCCGCATT GACCGAAGAT GTGCCGTCGC ATGTCACGGC GATGACCGGC ATTGATGCGT TAACCCATGC CATTGAAGCA TACAGCGCCC TGAACGCTAT ACCGCTTACC GACAGCCTGG CGATTGGCGC TATTGCGATG ATTGGCAAAT CGCTGCCGAA AGCGGTGGGC TACGGTCACG ACCTTGCCGC GCGCGAGAGC ATGTTGCTGG CTTCATGTAT GGCGGGAATG GCGTTTTCCA GTGCGGGTCT TGGGTTGTGC CACGCGATGG CGCATCAGCC GGGCGCGGCG CTGCATATTC CACACGGTCT GGCGAACGCC ATGTTGCTGC CAACGGTGAT GGAATTTAAC CGGATGGTTT GTCGTGAACG CTTTAGTCAG ATTGGTCGGG CACTGCGAAC TAAAAAATCC GACGATCGTG ACGCTATTAA CGCGGTAAGT GAGCTGATTG CGGAAGTTGG GATTGGTAAA CGACTGGGCG ATGTTGGTGC GACATCTGCG CATTACGGCG CATGGGCGCA GGCCGCGCTG GAAGATATTT GTCTGCGCAG TAACCCGCGT ACCGCCAGCC TGGAGCAGAT TGTCGGCCTG TACGCAGCGG CGCAATAA
|
Protein sequence | MQNELQTALF QAFDTLNLQR VKTFSVPPVT LCGPGSVSSC GQQAQTRGLK HLFVMADSFL HQAGMTAGLT RSLAVKGIAM TLWPCPVGEP CITDVCAAVA QLRESGCDGV IAFGGGSVLD AAKAVALLVT NPDSTLAEMS ETSVLQPRLP LIAIPTTAGT GSETTNVTVI IDAVSGRKQV LAHASLMPDV AILDAALTED VPSHVTAMTG IDALTHAIEA YSALNAIPLT DSLAIGAIAM IGKSLPKAVG YGHDLAARES MLLASCMAGM AFSSAGLGLC HAMAHQPGAA LHIPHGLANA MLLPTVMEFN RMVCRERFSQ IGRALRTKKS DDRDAINAVS ELIAEVGIGK RLGDVGATSA HYGAWAQAAL EDICLRSNPR TASLEQIVGL YAAAQ
|
| |