Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2818 |
Symbol | eutH |
ID | 6273295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2613059 |
End bp | 2614285 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641726770 |
Product | ethanolamine utilization protein EutH |
Protein accession | YP_001881243 |
Protein GI | 187730214 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3192] Ethanolamine utilization protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAATTA ACGAAATCAT CATGTACATC ATGATGTTCT TTATGCTGAT AGCTGCCGTA GACAGGATCC TGTCGCAGTT CGGCGGTTCT GCTAGTTTCC TCGGTAAGTT CGGTAAAAGT ATCGAAGGAT CAGGCGGTCA GTTCGAAGAA GGCTTTATGG CAATGGGCGC TCTGGGTCTG GCGATGGTCG GTATGACCGC GCTGGCACCG GTACTGGCTC ACGTACTCGG GCCGGTAATT ATCCCGGTTT ACGAAATGCT CGGCGCAAAC CCGTCAATGT TCGCCGGAAC GCTGCTGGCG TGCGATATGG GCGGCTTCTT CCTCGCCAAA GAGCTGGCGG GCGGCGACGT AGCAGCGTGG CTATACTCTG GGTTAATTCT TGGGTCGATG ATGGGGCCAA CGATTGTGTT TTCCATTCCG GTGGCGCTCG GCATTATCGA ACCTTCTGAC CGTCGTTATC TGGCGCTCGG CGTGCTGGCG GGCATTGTGA CCATTCCGAT TGGCTGTATT GTTGGTGGTC TGGTTGCTAT GTACTCCGGT GTGCAGATCA ACGGTCAGCC GGTGGAATTC ACTTTCGCCC TGATCCTGAT GAACATGATC CCGGTGATCA TTGTTGCGAT TCTGGTGGCG CTGGGGCTGA AATTCATCCC GGAAAAAATG ATCAACGGCT TCCAGATCTT CGCCAAATTC CTCGTTGCAT TGATCACCCT CGGTCTTGCC GCTGCGGTAG TGAAATTCCT GCTTGGCTGG GAACTGATCC CCGGTCTGGA TCCTATCTTT ATGGCCCCTG GCGATAAACC CGGTGAGGTG ATGCGCGCCA TTGAAGTTAT CGGTTCTATC TCCTGCGTTC TGTTAGGGGC GTATCCGATG GTGCTGCTGC TGACTCGCTG GTTTGAAAAA ACGCTGATGA GCGTCGGTAA AGTGCTGAAT ATGAACAACA TCGCGGCAGC CGGCATGGTG GCAACGCTTG CCAACAACAT CCCGATGTTC GGCATGATGA AGCAGATGGA TACCCGCGGC AAAGTCATCA ACTGCGCCTT CGCCGTTTCT GCTGCTTTCG CCCTGGGCGA CCACTTAGGG TTCGCCGCTG CCAACATGAA TGCCATGATC TTCCCGATGA TTGTCGGCAA GTTGATCGGC GGCGTAACGG CGATTGGCGT GGCGATGATG CTGGTGCCAA AAGATGACGC GACCGCAGCT AAAACCGAAG CGGAGGCACA ATCGTGA
|
Protein sequence | MGINEIIMYI MMFFMLIAAV DRILSQFGGS ASFLGKFGKS IEGSGGQFEE GFMAMGALGL AMVGMTALAP VLAHVLGPVI IPVYEMLGAN PSMFAGTLLA CDMGGFFLAK ELAGGDVAAW LYSGLILGSM MGPTIVFSIP VALGIIEPSD RRYLALGVLA GIVTIPIGCI VGGLVAMYSG VQINGQPVEF TFALILMNMI PVIIVAILVA LGLKFIPEKM INGFQIFAKF LVALITLGLA AAVVKFLLGW ELIPGLDPIF MAPGDKPGEV MRAIEVIGSI SCVLLGAYPM VLLLTRWFEK TLMSVGKVLN MNNIAAAGMV ATLANNIPMF GMMKQMDTRG KVINCAFAVS AAFALGDHLG FAAANMNAMI FPMIVGKLIG GVTAIGVAMM LVPKDDATAA KTEAEAQS
|
| |