Gene EcSMS35_0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0071 
SymbolthiP 
ID6145002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp79757 
End bp81367 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content58% 
IMG OID641614972 
Productthiamine transporter membrane protein 
Protein accessionYP_001742188 
Protein GI170682672 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.818645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.387113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCGCT 
ACGCTTGTGG TGGCGGTTGC GCTGGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG
GGTGACTGGG TGGCAGTCTG GCAGGACAGC TATCTGTGGC ATGTGGTGCG CTTCTCCTTC
TGGCAGGCGT TTCTCTCGGC GCTGCTCTCT GTCGTTCCGG CGATATTCCT CGCCCGCGCC
CTCTATCGCA GGCGCTTTCC AGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTG
ATCCTCCCGG TGCTGGTCGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG CCAGGGCTGG
CTGGCCTCGC TCTGGCAATC ACTTGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA
GGTATTTTGC TGGCGCACGT ATTTTTTAAT CTGCCGATGG CGAGCCGTTT ATTACTTCAG
GCACTGGAAA ACATTCCCGG CGAACAACGT CAACTTGCCG CACAACTTGG GATGCGCGGC
TGGCATTTTT TCCGCTTTGT CGAATGGCCG TGGTTACGTC GACAAATCCC GCCAGTTGCT
GCGCTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTATCGCT GGGGGGCGGT
CCGCAGGCGA CCACTATCGA GCTGGCTATT TATCAGGCGC TGAGTTACGA CTACGATCCT
GCCCGCGCGG CGATGCTTGC GCTGATCCAG ATGGTGTGTT GCCTTGGGCT GGTGCTGCTG
AGTCAGCGAT TGAGTAAGGC CATTGCGCCA GGCACCACGC TGCTGCAAGG CTGGCGCGAC
CCGGACGATC GTCTGCATAG CCGCATTTGC GACACTGTGT TAATTGTGCT GGCGCTGCTG
CTATTGCTGC CACCGTTACT GGCGGTGATC GTCGATGGGG TAAATCGCCA GTTGCCGGAA
GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG
GCAGGCGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTGCGG
GCGCGGCAGA AAATGCTGGC CGGCCAGGCG CTGGAGATGA GCGGCATGTT GATCCTCGCC
ATGCCGGGGA TTGTGCTGGC AACGGGCTTC TTTTTACTGC TCAACAACAC TATCGGCCTG
CCACAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG
CTGAAAGTGC TGGAAAACCC GATGCGCGAT ATCACCGCTC GCTACAACAT GTTATGTCAG
TCGCTGGGCA TTGAAGGCTG GTCACGCTTA AAAGTGGTGG AGCTGCGCGC CCTGAAACGT
CCGCTGGCGC AGGCACTGGC CTTTGCTTGT GTGCTGTCGA TTGGTGATTT TGGCGTGGTG
GCGTTGTTCG GTAACGATGA TTTCCGTACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC
TCCTATCGCA GCCAGGACGG CGCGGTCACC GCGTTAATTA TGCTGCTACT CTGTTTTCTG
CTGTTTACTG TGATTGAAAA ACTACCGGGG CGAAATGTTA AAACTGACTG A
 
Protein sequence
MATRRQPLIP GWLIPGVSAA TLVVAVALAA FLALWWNAPQ GDWVAVWQDS YLWHVVRFSF 
WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LASLWQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRG
WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP
ARAAMLALIQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL
LLLPPLLAVI VDGVNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR
ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA
LKVLENPMRD ITARYNMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALIMLLLCFL LFTVIEKLPG RNVKTD