Gene SeHA_C2720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2720 
SymboleutH 
ID6490998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2627259 
End bp2628485 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content57% 
IMG OID642742898 
Productethanolamine utilization protein EutH 
Protein accessionYP_002046525 
Protein GI194449723 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3192] Ethanolamine utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.741445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.29915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATTA ACGAAATCAT CATGTACATC ATGATGTTCT TTATGCTGAT TGCCGCCGTG 
GACAGAATCC TGTCGCAGTT CGGCGGGTCG GCGCGCTTCC TCGGTAAATT CGGTAAGAGT
ATCGAGGGGT CTGGCGGCCA GTTTGAAGAG GGCTTTATGG CGATGGGCGC GCTGGGGCTG
GCGATGGTCG GTATGACCGC GCTGGCGCCG GTGCTGGCGC ATGTACTGGG GCCGGTTATT
ATCCCGGTAT ACGAAATGCT GGGCGCGAAC CCATCCATGT TCGCGGGTAC GTTGCTGGCC
TGTGATATGG GCGGATTCTT CCTCGCCAAA GAGCTGGCCG GCGGTGATGT CGCGGCGTGG
CTATACTCAG GGTTAATACT TGGGTCGATG ATGGGGCCGA CCATTGTGTT CTCCATTCCG
GTGGCGCTCG GCATTATCGA ACCGTCTGAC CGCCGCTACC TGGCGCTTGG CGTACTGGCG
GGTATCGTCA CCATTCCCAT TGGCTGCATT GCGGGGGGAT TGATCGCCAT GTACTCAGGC
GTGCAGATCA ATGGTCAGCC AGTGGAGTTT ACCTTCGCGC TGATCCTGAT GAACATGATC
CCGGTATTGA TTGTCGCGGT GCTGGTAGCG CTGGGGCTGA AGTTCATCCC GGAAAAAATG
ATCAACGGTT TCCAGATTTT CGCCAAATTT CTGGTGGCGC TGATCACCAT CGGTCTGGCG
GCTGCGGTGA TTAAATTCCT GTTGGGCTGG GAGTTGATTC CGGGGCTCGA CCCGATCTTT
ATGGCGCCAG GCGACAAACC TGGCGAAGTG ATGCGCGCCA TTGAAGTGAT CGGCTCTATC
TCCTGCGTGC TGCTCGGCGC GTATCCGATG GTGCTGTTAC TGACCCGCTG GTTTGAAAAA
CCGTTGATGA ATGTCGGTAA GCTGCTGAAC GTAAATAATA TTGCGGCGGC AGGCATGGTG
GCGACCCTGG CGAACAATAT CCCGATGTTC GGCATGATGA AGCAGATGGA TACCCGCGGC
AAAGTGATTA ACTGCGCATT TGCCGTCTCT GCGGCGTTCG CGCTGGGCGA CCATTTAGGC
TTCGCCGCCG CCAACATGAA CGCCATGATC TTCCCGATGA TTGTCGGCAA GCTGATCGGC
GGCGTGACAG CGATTGGCGT GGCGATGATG CTGGTGCCGA AAGATGACGC CGCCCAGGTG
AAAACCGAAG CGGAGGCGCA ATCGTGA
 
Protein sequence
MGINEIIMYI MMFFMLIAAV DRILSQFGGS ARFLGKFGKS IEGSGGQFEE GFMAMGALGL 
AMVGMTALAP VLAHVLGPVI IPVYEMLGAN PSMFAGTLLA CDMGGFFLAK ELAGGDVAAW
LYSGLILGSM MGPTIVFSIP VALGIIEPSD RRYLALGVLA GIVTIPIGCI AGGLIAMYSG
VQINGQPVEF TFALILMNMI PVLIVAVLVA LGLKFIPEKM INGFQIFAKF LVALITIGLA
AAVIKFLLGW ELIPGLDPIF MAPGDKPGEV MRAIEVIGSI SCVLLGAYPM VLLLTRWFEK
PLMNVGKLLN VNNIAAAGMV ATLANNIPMF GMMKQMDTRG KVINCAFAVS AAFALGDHLG
FAAANMNAMI FPMIVGKLIG GVTAIGVAMM LVPKDDAAQV KTEAEAQS