Gene EcolC_3590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3590 
SymbolthiP 
ID6067549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3925060 
End bp3926670 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content58% 
IMG OID641603008 
Productthiamine transporter membrane protein 
Protein accessionYP_001726531 
Protein GI170021577 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.838581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000463415 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCGCC 
ACGCTGGTGG TAGCGGTTGC GCTCGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG
GGTAACTGGG TGGCAATCTG GCAGGACAGC TACCTGTGGC ATGTGGTGCG CTTCTCCTTC
TGGCAGGCGT TTCTCTCGGC GCTGCTCTCT GTCGTACCCG CGATATTCCT CGCCCGCGCA
CTCTATCGCA GGCGCTTTCC GGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTG
ATCCTCCCGG TGCTGGTTGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG TCAGGGCTGG
CTGGCATCGC TCTGCCAATC GCTCGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA
GGTATTTTGC TGGCGCACGT ATTTTTTAAT CTACCGATGG CGAGCCGCTT ATTACTCCAG
GCACTGGAAA ACATTCCCGG CGAGCAACGT CAGCTTGCCG CCCAGCTTGG GATGCGCGGC
TGGCATTTTT TCCGCTTCAT CGAATGGCCG TGGCTACGGC GACAAATCCC GCCGGTTGCC
GCGCTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTGTCGCT GGGCGGCGGT
CCGCAGGCAA CCACTATCGA GCTGGCAATC TATCAGGCGC TGAGTTACGA CTACGATCCT
GCCCGCGCGG CGATGCTGGC GCTGATCCAG ATGGTGTGTT GCCTCGGGTT GGTGCTGCTG
AGTCAGCGAT TGAGTAAGGC CATTGCGCCA GGCACCACGC TGCTGCAAGG CTGGCGCGAC
CCGGACGATC GTCTGCATAG CCGCATTTGC GACACGGTGT TAATTGTGCT GGCGCTGCTG
CTGTTGCTGC CACCGTTGCT GGCGGTGATC GTCGATGGGC TAAATCGCCA GTTGCCGGAA
GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG
GCAGGTGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTTCGG
GCGCGGCAGA AAATGCTGGC GGGTCAGGCG CTGGAGATGA GCGGCATGTT GATTCTCGCC
ATGCCGGGGA TTGTGCTGGC TACCGGCTTC TTTTTACTGC TCAACAACAC CATTGGCCTG
CCGCAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG
CTGAAAGTAC TGGAAAACCC GATGCGCGAT ATCACCGCCC GCTACAGTAT GTTGTGTCAG
TCGCTGGGCA TTGAAGGCTG GTCGCGCTTA AAAGTGGTCG AGCTGCGCGC CCTGAAACGT
CCACTGGCGC AGGCGCTGGC TTTTGCCTGC GTGCTGTCGA TTGGTGATTT TGGCGTAGTG
GCGTTGTTCG GTAACGATGA TTTCCGTACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC
TCCTATCGCA GCCAGGACGG TGCGGTCACC GCGTTAATTC TGCTACTACT CTGTTTTCTG
CTGTTTACCG TGATTGAAAA ACTACCGGGG CGAAATGTTA AAACTGACTG A
 
Protein sequence
MATRRQPLIP GWLIPGVSAA TLVVAVALAA FLALWWNAPQ GNWVAIWQDS YLWHVVRFSF 
WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LASLCQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRG
WHFFRFIEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP
ARAAMLALIQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL
LLLPPLLAVI VDGLNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR
ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA
LKVLENPMRD ITARYSMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFL LFTVIEKLPG RNVKTD