Gene EcSMS35_1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1372 
Symbol 
ID6144448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1358508 
End bp1360064 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content54% 
IMG OID641616250 
ProductTerC family/CBS/transporter associated domain-containing protein 
Protein accessionYP_001743430 
Protein GI170679631 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0861] Membrane protein TerC, possibly involved in tellurium resistance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0696189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0303689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTCT TAATGGACCC CTCGATTTGG GCGGGGCTAC TCACGCTTGT TGTTCTCGAA 
ATTGTGCTGG GTATCGATAA CCTGGTCTTC ATCGCCATTC TTGCTGACAA ACTGCCGCCA
AAACAACGCG ATAAAGCGCG TTTGCTGGGA TTATCACTGG CGCTGATTAT GCGTCTGGGG
CTGCTGTCGC TGATTTCATG GATGGTCACG CTGACCAAAC CGCTATTTAC CGTCATGGAT
TTCTCCTTCT CCGGACGCGA CCTGATTATG TTGTTCGGGG GGATATTCTT GCTGTTCAAA
GCAACAACCG AACTGCATGA ACGGCTGGAA AACCGCGATC ATGATTCCGG CCACGGTAAA
GGCTACGCCA GTTTCTGGGT GGTCGTCACA CAGATCGTCA TCCTTGACGC CGTCTTCTCG
TTGGATGCGG TAATTACGGC AGTAGGGATG GTTAACCATC TGCCAGTGAT GATGGCGGCG
GTAGTGATTG CGATGGCGGT TATGCTGCTG GCATCGAAAC CGCTGACGCG ATTTGTTAAC
CAGCATCCAA CGGTGGTGGT GCTCTGTCTG AGCTTCCTGT TAATGATAGG TCTGAGTCTG
GTGGCAGAAG GTTTCGGTTT CCACATTCCG AAAGGTTACC TGTATGCCGC GATTGGCTTC
TCGATCATCA TCGAAGTGTT TAACCAAATT GCGCGTCGCA ACTTTATTCG TCACCAGTCG
ACTTTGCCGC TGCGAGCGCG TACTGCCGAT GCCATCCTGC GTTTGATGGG CGGGAAACGT
CAGGCCAATG TCCAGCACGA TGCCGATAAC CCGATGCCGA TGCCGATCCC GGAAGGTGCA
TTTGCCGAAG AAGAACGTTA CATGATTAAC GGCGTACTGA CGCTGGCGTC GCGTTCTCTG
CGCGGGATCA TGACGCCGCG CGGTGAAATA AGCTGGGTTG ACGCTAATCT CGGGGTCGAT
GAAATCCGCG AGCAACTGCT CTCTTCACCG CACAGTCTGT TCCCGGTATG TCGCGGTGAA
CTGGATGAAA TCATCGGTAT CGTACGTGCT AAAGAACTGC TGGTGGCGCT GGAAGAGGGC
GTTGATGTGG CGGCGATTGC TTCGGCGTCT CCGGCGATTA TCGTCCCGGA AACCCTCGAT
CCGATCAACC TGCTGGGCGT ACTGCGTCGT GCTCGCGGGA GCTTTGTTAT CGTGACCAAC
GAGTTTGGTG TGGTACAAGG TCTGGTCACG CCGCTGGATG TGCTGGAAGC CATTGCGGGT
GAATTCCCGG ACGCTGACGA AACGCCGGAA ATCATTACCG ACGGTGACGG CTGGCTGGTA
AAAGGCGGTA CAGATTTGCA TGCCTTGCAG CAGGCGCTTG ATGTTGAGCA CCTTGCCGAT
GACGATGATA TCGCGACGGT CGCGGGCCTC GTGATCTCGG CAAATGGTCA CATTCCCCGT
GTGGGCGATG TGATTGATGT AGGGCCACTG CATATCACCA TCATTGAAGC CAATGATTAT
CGTGTTGATC TGGTTCGCAT TGTTAAAGAG CAACCGGCGC ACGATGAAGA TGAGTAA
 
Protein sequence
MEFLMDPSIW AGLLTLVVLE IVLGIDNLVF IAILADKLPP KQRDKARLLG LSLALIMRLG 
LLSLISWMVT LTKPLFTVMD FSFSGRDLIM LFGGIFLLFK ATTELHERLE NRDHDSGHGK
GYASFWVVVT QIVILDAVFS LDAVITAVGM VNHLPVMMAA VVIAMAVMLL ASKPLTRFVN
QHPTVVVLCL SFLLMIGLSL VAEGFGFHIP KGYLYAAIGF SIIIEVFNQI ARRNFIRHQS
TLPLRARTAD AILRLMGGKR QANVQHDADN PMPMPIPEGA FAEEERYMIN GVLTLASRSL
RGIMTPRGEI SWVDANLGVD EIREQLLSSP HSLFPVCRGE LDEIIGIVRA KELLVALEEG
VDVAAIASAS PAIIVPETLD PINLLGVLRR ARGSFVIVTN EFGVVQGLVT PLDVLEAIAG
EFPDADETPE IITDGDGWLV KGGTDLHALQ QALDVEHLAD DDDIATVAGL VISANGHIPR
VGDVIDVGPL HITIIEANDY RVDLVRIVKE QPAHDEDE