Gene EcE24377A_0069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0069 
SymbolthiP 
ID5590217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp74601 
End bp76211 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content58% 
IMG OID640923800 
Productthiamine transporter membrane protein 
Protein accessionYP_001461237 
Protein GI157157490 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCGCC 
ACGCTGGTGG TAGCGGTTGC GCTCGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG
GGTAACTGGG TGGCAGTCTG GCAGGACAGC TACCTGTGGC ATGTGGTGCG CTTCTCCTTC
TGGCAGGCGT TTCTCTCGGC GCTACTCTCT GTCGTACCCG CGATATTCCT CGCCCGCGCC
CTCTATCGCA GGCGCTTTCC GGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTG
ATCCTCCCGG TGCTGGTTGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCA TCAGGGCTGG
CTGGCATCGC TCTGCCAATC GCTCGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA
GGCATTTTGC TGGCGCACGT ATTTTTTAAT CTGCCGATGG CGAGCCGCTT ATTACTCCAG
GCACTGGAAA ACATTCCCGG CGAACAACGT CAGCTTGCCG CCCAGCTTGG GATGCGCGGC
TGGCATTTTT TCCGCTTCGT CGAATGGCCG TGGCTACGGC GACAAATCCC GCCGGTTGCC
GCGCTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTATCGCT GGGCGGCGGT
CCGCAGGCAA CTACTATCGA GCTGGCTATT TATCAGGCGC TGAGTTACGA CTACGATCCT
GCCCGCGCGG CGATGCTGGC GCTGCTCCAG ATGGTGTGTT GCCTTGGGCT GGTGCTGTTG
AGTCAGCGAC TAAGTAAGGC CATTGCGCCC GGTACCACGC TGCTGCAAGG CTGGCGCGAT
CCGGACGATC GTCTGCATAG CCGCATTTGC GACACGGTGT TAATAGTGCT GGCGCTGCTG
CTGTTGCTGC CACCGTTGCT GGCGGTGATC GTCGATGGGG TAAATCGCCA GTTGCCGGAA
GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG
GCAGGTGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTGCGG
GCGCGGCAGA AAATGCTGGC GGGTCAGGCG CTGGAGATGA GCGGCATGTT GATCCTCGCC
ATGCCGGGGA TTGTGCTGGC TACCGGCTTC TTTTTACTGC TCAACAACAC CATTGGCCTG
CCGCAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG
CTGAAAGTAC TGGAAAACCC GATGCGCGAT ATCACCGCCC GCTACAGTAT GTTGTGTCAG
TCGCTGGGCA TTGAAGGCTG GTCGCGCTTA AAAGTGGTCG AGCTGCGCGC CCTGAAACGT
CCACTGGCGC AGGCGCTGGC TTTTGCCTGC GTGCTGTCGA TTGGTGATTT TGGCGTAGTG
GCGTTGTTCG GTAACGATGA TTTCCGTACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC
TCCTATCGCA GCCAGGACGG CGCGGTCACC GCGTTAATTC TGCTACTACT CTGTTTTCTG
CTGTTTACCG TGATTGAAAA ACTACCGGGG CGAAATGTTA AAACTGACTG A
 
Protein sequence
MATRRQPLIP GWLIPGVSAA TLVVAVALAA FLALWWNAPQ GNWVAVWQDS YLWHVVRFSF 
WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGHQGW
LASLCQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPGEQR QLAAQLGMRG
WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP
ARAAMLALLQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL
LLLPPLLAVI VDGVNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR
ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL PQSADGIVIF TNALMAIPYA
LKVLENPMRD ITARYSMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFL LFTVIEKLPG RNVKTD