Gene ECH74115_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2034 
SymboltehA 
ID6970361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1932672 
End bp1933664 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID643385948 
Productpotassium-tellurite ethidium and proflavin transporter 
Protein accessionYP_002270437 
Protein GI209396012 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1275] Tellurite resistance protein and related permeases 
TIGRFAM ID[TIGR00816] C4-dicarboxylate transporter/malic acid transport protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00204759 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.87722e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGAGCG ATAAAGTGCT CAATTTGCCG GCAGGCTACT TTGGTATTGT GTTGGGGACG 
ATAGGGATGG GATTTGCCTG GCGCTATGCC AGCCAGGTTT GGCAGGTCAG CCACTGGTTA
GGGGATGGGC TGGTGATTCT GGCGATGATC ATCTGGGGAT TATTGACTAG CGCATTTATT
ACCCGACTCA TACGCTTTCC GCATAGCGTG CTGGCGGAAG TTCGCCATCC AGTGCTGAGC
AGTTTTGTGA GTTTGTTTCC TGCAACGACG ATGCTGGTGG CGATTGGTTT TGTTCCGTGG
TTTCGCCCAC TGGCGGTGTG CCTGTTCAGT TTTGGTGTCG TGGTTCAGTT GGCTTATGCC
GCCTGGCAAA CTGCGGGATT ATGGCGCGGA TCTCACCCTG AAGAAGCTAC TACGCCTGGA
CTGTATCTGC CGACAGTTGC CAACAACTTT ATCAGCGCAA TGGCCTGTGG TGCGTTGGGC
TACACCGACG CCGGTCTGGT GTTTTTAGGC GCAGGCGTTT TCTCATGGCT AAGCCTTGAA
CCGGTGATCT TGCAGCGTCT GCGTAGTTCG GGAGAATTAC CCACGGCACT GCGGACATCA
CTCGGCATTC AGCTCGCTCC TGCGCTGGTG GCCTGTAGTG CCTGGCTGAG CGTCAACGGC
GGCGAGGGTG ACACGCTGGC GAAAATGCTT TTCGGTTATG GACTGCTGCA ACTGCTGTTT
ATGCTACGTC TGATGCCATG GTATCTCTCC CAGCCATTTA ATGCTTCATT CTGGAGTTTC
TCGTTCGGCG TATCTGCACT GGCAACCACC GGTTTGCATC TGGGGAGTGG CAGCGATAAT
GGATTTTTCC ATACGCTGGC GGTGCCGCTG TTTATCTTTA CCAATTTTAT TATTGCAATA
CTGCTCATCC GTACTTTGGC GCTTCTGATG CAGGGAAAAT TGTTAGTCAG AACCGAGCGC
GCCGTTTTAA TGAAAGCAGA GGACAAAGAA TGA
 
Protein sequence
MQSDKVLNLP AGYFGIVLGT IGMGFAWRYA SQVWQVSHWL GDGLVILAMI IWGLLTSAFI 
TRLIRFPHSV LAEVRHPVLS SFVSLFPATT MLVAIGFVPW FRPLAVCLFS FGVVVQLAYA
AWQTAGLWRG SHPEEATTPG LYLPTVANNF ISAMACGALG YTDAGLVFLG AGVFSWLSLE
PVILQRLRSS GELPTALRTS LGIQLAPALV ACSAWLSVNG GEGDTLAKML FGYGLLQLLF
MLRLMPWYLS QPFNASFWSF SFGVSALATT GLHLGSGSDN GFFHTLAVPL FIFTNFIIAI
LLIRTLALLM QGKLLVRTER AVLMKAEDKE