Gene SbBS512_E0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0059 
SymbolthiP 
ID6271736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp63975 
End bp65585 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content58% 
IMG OID641724318 
Productthiamine transporter membrane protein 
Protein accessionYP_001878878 
Protein GI187732002 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACGC GCCGTCAGCC GTTAATTCCC GGCTGGTTAA TTCCAGGTGT AAGCGCCGCC 
ACGCTGGTGG TAGCGGTTGC GCTCGCGGCG TTTCTCGCCC TGTGGTGGAA CGCGCCGCAG
GGTAACTGGG TGGCAGTCTG GCAGGACAGC TACCTGTGGC ATGTGGTGCG CTTCTCCTTC
TGGCAGGCGT TTCTCTCGGC GCTACTCTCT GTCGTACCCG CGATATTCCT CGCCCGCGCC
CTCTATCGCA GGCGCTTTCC GGGTCGGCTG GCGCTGTTGC GTCTGTGCGC AATGACCTTG
ATCCTCCCGG TGCTGGTTGC TGTTTTCGGC ATTCTTAGCG TCTATGGTCG CCAGGGCTGG
CTGGCATCGC TCTGCCAATC GCTCGGTCTG GAGTGGACCT TTTCGCCCTA CGGCCTGCAA
GGTATTTTGC TGGCGCACGT ATTTTTTAAT CTGCCGATGG CGAGCCGCTT ATTACTCCAG
GCACTGGAAA ACATTCCCGA CGAACAACGT CAGCTTGCCG CCCAGCTTGG GATGCGCGGC
TGGCATTTTT TCCGCTTCGT CGAATGGCCG TGGTTACGGC GACAAATCCC GCCGGTTGCT
GCACTTATCT TTATGCTCTG TTTCGCCAGC TTCGCCACCG TGCTATCACT GGGCGGCGGT
CCGCAGGCGA CCACTATCGA GCTGGCTATT TATCAGGCGC TGAGTTACGA CTACGATCCT
GCCCGCGCGG CGATGCTGGC GCTGATCCAG ATGGTGTGTT GCCTTGGGCT GGTACTGTTG
AGTCAGCGAT TGAGTAAGGC CATTGCGCCA GGCACCACGC TGCTGCAAGG CTGGCGCGAC
CCGGACGATC GTCTGCATAG CCGCATTTGC GACACGGTGT TAATAGTGCT GGCGCTGCTG
CTGTTGTTGC CACCGTTGCT GGCGGTGATC GTCGATGGGG TAAATCGCCA GTTGCCGGAA
GTGCTGGCAC AACCGGTGCT GTGGCAGGCG CTGTGGACCT CGTTGCGTAT TGCGCTGGCG
GCAGGTGTAT TGTGCGTAGT GCTGACCATG ATGCTGCTAT GGAGCAGTCG CGAACTTCGG
GCGCGGCAGA AAATGCTGGC GGGTCAGGCG CTGGAGATGA GCGGCATGTT GATTCTCGCC
ATGCCGGGGA TTGTGCTGGC TACCGGCTTC TTTTTACTGC TCAACAACAC TATCGGCCTG
CAACAATCTG CTGACGGCAT TGTGATTTTC ACCAATGCGT TAATGGCGAT CCCTTATGCG
CTGAAAGTAC TGGAAAACCC GATGCGCGAT ATCACCGCCC GCTACAGTAT GTTGTGTCAG
TCGCTGGGCA TTGAAGGCTG GTCGCGCTTA AAAGTGGTGG AGCTGCGCGC CCTGAAACGT
CCACTGGCGC AGGCGCTGGC CTTTGCCTGC GTGCTGTCGA TTGGTGATTT TGGCGTGGTG
GCGTTGTTCG GTAACGATGA TTTCCGTACC CTGCCGTTTT ATCTCTACCA GCAAATTGGC
TCCTATCGCA GCCAGGACGG CGCGGTCACC GCGTTAATTC TGCTGCTACT CTGTTTTCTG
CTGTTTACCG TGATTGAAAA ACTACCGGGG CGAAATGTTA AAACTGACTG A
 
Protein sequence
MATRRQPLIP GWLIPGVSAA TLVVAVALAA FLALWWNAPQ GNWVAVWQDS YLWHVVRFSF 
WQAFLSALLS VVPAIFLARA LYRRRFPGRL ALLRLCAMTL ILPVLVAVFG ILSVYGRQGW
LASLCQSLGL EWTFSPYGLQ GILLAHVFFN LPMASRLLLQ ALENIPDEQR QLAAQLGMRG
WHFFRFVEWP WLRRQIPPVA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDP
ARAAMLALIQ MVCCLGLVLL SQRLSKAIAP GTTLLQGWRD PDDRLHSRIC DTVLIVLALL
LLLPPLLAVI VDGVNRQLPE VLAQPVLWQA LWTSLRIALA AGVLCVVLTM MLLWSSRELR
ARQKMLAGQA LEMSGMLILA MPGIVLATGF FLLLNNTIGL QQSADGIVIF TNALMAIPYA
LKVLENPMRD ITARYSMLCQ SLGIEGWSRL KVVELRALKR PLAQALAFAC VLSIGDFGVV
ALFGNDDFRT LPFYLYQQIG SYRSQDGAVT ALILLLLCFL LFTVIEKLPG RNVKTD