Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2720 |
Symbol | eutH |
ID | 6490998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 2627259 |
End bp | 2628485 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642742898 |
Product | ethanolamine utilization protein EutH |
Protein accession | YP_002046525 |
Protein GI | 194449723 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3192] Ethanolamine utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.741445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 0.29915 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATTA ACGAAATCAT CATGTACATC ATGATGTTCT TTATGCTGAT TGCCGCCGTG GACAGAATCC TGTCGCAGTT CGGCGGGTCG GCGCGCTTCC TCGGTAAATT CGGTAAGAGT ATCGAGGGGT CTGGCGGCCA GTTTGAAGAG GGCTTTATGG CGATGGGCGC GCTGGGGCTG GCGATGGTCG GTATGACCGC GCTGGCGCCG GTGCTGGCGC ATGTACTGGG GCCGGTTATT ATCCCGGTAT ACGAAATGCT GGGCGCGAAC CCATCCATGT TCGCGGGTAC GTTGCTGGCC TGTGATATGG GCGGATTCTT CCTCGCCAAA GAGCTGGCCG GCGGTGATGT CGCGGCGTGG CTATACTCAG GGTTAATACT TGGGTCGATG ATGGGGCCGA CCATTGTGTT CTCCATTCCG GTGGCGCTCG GCATTATCGA ACCGTCTGAC CGCCGCTACC TGGCGCTTGG CGTACTGGCG GGTATCGTCA CCATTCCCAT TGGCTGCATT GCGGGGGGAT TGATCGCCAT GTACTCAGGC GTGCAGATCA ATGGTCAGCC AGTGGAGTTT ACCTTCGCGC TGATCCTGAT GAACATGATC CCGGTATTGA TTGTCGCGGT GCTGGTAGCG CTGGGGCTGA AGTTCATCCC GGAAAAAATG ATCAACGGTT TCCAGATTTT CGCCAAATTT CTGGTGGCGC TGATCACCAT CGGTCTGGCG GCTGCGGTGA TTAAATTCCT GTTGGGCTGG GAGTTGATTC CGGGGCTCGA CCCGATCTTT ATGGCGCCAG GCGACAAACC TGGCGAAGTG ATGCGCGCCA TTGAAGTGAT CGGCTCTATC TCCTGCGTGC TGCTCGGCGC GTATCCGATG GTGCTGTTAC TGACCCGCTG GTTTGAAAAA CCGTTGATGA ATGTCGGTAA GCTGCTGAAC GTAAATAATA TTGCGGCGGC AGGCATGGTG GCGACCCTGG CGAACAATAT CCCGATGTTC GGCATGATGA AGCAGATGGA TACCCGCGGC AAAGTGATTA ACTGCGCATT TGCCGTCTCT GCGGCGTTCG CGCTGGGCGA CCATTTAGGC TTCGCCGCCG CCAACATGAA CGCCATGATC TTCCCGATGA TTGTCGGCAA GCTGATCGGC GGCGTGACAG CGATTGGCGT GGCGATGATG CTGGTGCCGA AAGATGACGC CGCCCAGGTG AAAACCGAAG CGGAGGCGCA ATCGTGA
|
Protein sequence | MGINEIIMYI MMFFMLIAAV DRILSQFGGS ARFLGKFGKS IEGSGGQFEE GFMAMGALGL AMVGMTALAP VLAHVLGPVI IPVYEMLGAN PSMFAGTLLA CDMGGFFLAK ELAGGDVAAW LYSGLILGSM MGPTIVFSIP VALGIIEPSD RRYLALGVLA GIVTIPIGCI AGGLIAMYSG VQINGQPVEF TFALILMNMI PVLIVAVLVA LGLKFIPEKM INGFQIFAKF LVALITIGLA AAVIKFLLGW ELIPGLDPIF MAPGDKPGEV MRAIEVIGSI SCVLLGAYPM VLLLTRWFEK PLMNVGKLLN VNNIAAAGMV ATLANNIPMF GMMKQMDTRG KVINCAFAVS AAFALGDHLG FAAANMNAMI FPMIVGKLIG GVTAIGVAMM LVPKDDAAQV KTEAEAQS
|
| |