Gene EcHS_A0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0071 
SymbolthiP 
ID5592150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp74666 
End bp76276 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content58% 
IMG OID640919259 
Productthiamine transporter membrane protein 
Protein accessionYP_001456854 
Protein GI157159536 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones80 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCGCC 
ACGCTGGTGG TAGCGGTTGC GCTCGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG
GGTAACTGGG TGGCAGTCTG GCAGGACAGC TACCTGTGGC ATGTGGTGCG CTTCTCCTTC
TGGCAGGCGT TTCTCTCGGC GCTACTCTCT GTCGTACCCG CGATATTCCT CGCCCGCGCC
CTCTATCGCA GGCGCTTTCC GGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTG
ATCCTCCCGG TGCTAGTTGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG TCAGGGCTGG
CTGGCATCGC TCTGCCAATC GCTCGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA
GGTATTTTGC TGGCGCACGT ATTTTTTAAT CTACCGATGG CGAGCCGCTT ATTACTCCAG
GCACTGGAAA ACATTCCCGG CGAGCAACGT CAGCTTGCCG CCCAGCTTGG GATGCGCGGC
TGGCATTTTT TCCGCTTCAT CGAATGGCCG TGGCTACGGC GACAAATCCC GCCGGTTGCC
GCGCTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTGTCGCT GGGCGGCGGT
CCGCAGGCAA CCACTATCGA GCTGGCAATC TATCAGGCGC TGAGTTACGA CTACGATCCT
GCCCGCGCGG CGATGCTGGC GCTGATCCAG ATGGTGTGTT GCCTCGGGTT GGTGCTGCTG
AGTCAGCGAT TGAGTAAGGC CATTGCGCCA GGCACCACGC TGCTGCAAGG CTGGCGCGAC
CCGGACGATC GTCTGCATAG CCGCATTTGC GACACGGTGT TAATTGTGCT GGCGCTGCTG
CTGTTGCTGC CACCGTTGCT GGCGGTGATC GTCGATGGGC TAAATCGCCA GTTGCCGGAA
GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG
GCAGGTGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTTCGG
GCGCGGCAGA AAATGCTGGC GGGTCAGGCG CTGGAGATGA GCGGCATGTT GATTCTCGCC
ATGCCGGGGA TTGTGCTGGC TACCGGCTTC TTTTTACTGC TCAACAACAC CATTGGCCTG
CCGCAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG
CTGAAAGTAC TGGAAAACCC GATGCGCGAT ATCACCGCCC GCTACAGTAT GTTGTGTCAG
TCGCTGGGCA TTGAAGGCTG GTCGCGCTTA AAAGTGGTCG AGCTGCGCGC CCTGAAACGT
CCACTGGCGC AGGCGCTGGC TTTTGCCTGC GTGCTGTCGA TTGGTGATTT TGGCGTAGTG
GCGTTGTTCG GTAACGATGA TTTCCGTACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC
TCCTATCGCA GCCAGGACGG TGCGGTCACC GCGTTAATTC TGCTACTACT CTGTTTTCTG
CTGTTTACCG TGATTGAAAA ACTACCGGGG CGAAATGTTA AAACTGACTG A
 
Protein sequence
MATRRQPLIP GWLIPGVSAA TLVVAVALAA FLALWWNAPQ GNWVAVWQDS YLWHVVRFSF 
WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LASLCQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRG
WHFFRFIEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP
ARAAMLALIQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL
LLLPPLLAVI VDGLNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR
ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA
LKVLENPMRD ITARYSMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFL LFTVIEKLPG RNVKTD