Gene EcSMS35_1746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1746 
SymboltehA 
ID6145173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1749361 
End bp1750353 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID641616622 
Productpotassium-tellurite ethidium and proflavin transporter 
Protein accessionYP_001743800 
Protein GI170682405 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1275] Tellurite resistance protein and related permeases 
TIGRFAM ID[TIGR00816] C4-dicarboxylate transporter/malic acid transport protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000864633 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.635829999999999e-20 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGAGCG ATAAAGTGCT CAATTTGCCG GCAGGCTACT TTGGTATTGT GTTGGGGACG 
ATAGGGATGG GATTTGCCTG GCGCTATGCC AGCCAGGTTT GGCAGGTCAG CCACTGGTTG
GGGGATGGGC TGGTGATTCT GGCGATGATC ATCTGGGGAT TATTGACTGG CGCATTTATT
ACCCGACTCA TACGCTTTCC GCATAGCGTG CTGGCAGAAG TGCGCCATCC AGTGCTGAGC
AGTTTTGTGA GTTTGTTTCC GGCAACGACG ATGCTGGTGG CGATTGGTTT TGTTCCGTGG
TTTCGCCCAC TGGCGGTGTG CCTGTTCAGT TTTGGTGTCG TGGTTCAGTT GGCTTATGCC
GCCTGGCAAA CTGCGGGATT ATGGCGCGGA TCTCACCCTG AAGAAGCTAC CACGCCTGGA
CTGTATCTGC CGACAGTTGC CAACAACTTT ATCAGCGCAA TGGCCTGTGG TGCGTTGGGC
TACACCGACG CCGGTCTGGT GTTTTTAGGC GCAGGCGTTT TCTCATGGCT AAGCCTGGAA
CCGGTGATCT TGCAGCGTCT GCGCAGTTCG GGAGAATTAC CCACGGCACT GAGGACATCA
CTCGGCATTC AGCTCGCTCC TGCGCTGGTG GCCTGTAGTG CCTGGCTGAG CGTCAACGGC
GGCGAGGGTG ACACGCTGGC GAAAATGCTT TTCGGTTATG GACTGCTGCA ACTGCTGTTT
ATGCTACGTC TGATGCCATG GTATCTCTCC CAGCCATTTA ATGCTTCATT CTGGAGTTTC
TCGTTCGGCG TATCTGCACT GGCAACCACC GGTTTGCATC TGGGGAGTGG CAGCGATAAT
GGATTTTTCC ATACGCTGGC GGTGCCGCTG TTTATCTTTA CCAATTTTAT TATTGCAATA
CTGCTCATCC GTACTTTTGC GCTTCTGATG CAGGGAAAAT TGTTAGTCAG AACCGAGCGC
GCCGTTTTAA TGAAAGCAGA GGACAAAGAA TGA
 
Protein sequence
MQSDKVLNLP AGYFGIVLGT IGMGFAWRYA SQVWQVSHWL GDGLVILAMI IWGLLTGAFI 
TRLIRFPHSV LAEVRHPVLS SFVSLFPATT MLVAIGFVPW FRPLAVCLFS FGVVVQLAYA
AWQTAGLWRG SHPEEATTPG LYLPTVANNF ISAMACGALG YTDAGLVFLG AGVFSWLSLE
PVILQRLRSS GELPTALRTS LGIQLAPALV ACSAWLSVNG GEGDTLAKML FGYGLLQLLF
MLRLMPWYLS QPFNASFWSF SFGVSALATT GLHLGSGSDN GFFHTLAVPL FIFTNFIIAI
LLIRTFALLM QGKLLVRTER AVLMKAEDKE