Gene YpAngola_A2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2943 
SymboltbpA 
ID5801415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3097844 
End bp3098890 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content49% 
IMG OID641340790 
Productthiamine transporter substrate binding subunit 
Protein accessionYP_001607320 
Protein GI162421366 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR01276] thiamine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.683707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAACTA CCGCCTCAGA TACCTTTGCC ACCTTTATGT TCAAGGAGTG CAAAGTGTTT 
AAACACATTA TTTCCTGCTT ATTACTGATA TCAGCCACCT CCGCGCTTGC CGCAGAAAAA
CCTACGCTGA CGGTCTATAC CTACGACTCC TTTGCTGCTG ACTGGGGCCC AGGCCCGGCG
ATTAAGCAGG CCTTTGAAGC TGAATGCGAT TGCCAGCTAA AATTTGTGGC ACTGGAAGAT
GGCGTTTCAC TGCTGAACCG CCTGCGGATG GAAGGTAAAA ACAGCCAGGC CGATGTGATT
TTAGGGTTGG ATAACAATCT GGTACAGGCG GCAGAACAAA CCGGCTTGTT TACCCCAAGC
CAGGTTGATA CCCGTAACCT GACCCTACCA GAGCCGTGGC AGAATAAGAC ATTTGTCCCT
TACGATTATG GCTATTTTGC TTTTGTGTAT AACAAAGAAA AACTGAAAAA CCCGCCAAAA
AGCCTGCACG AATTAATCAG CAGTAAAGAA CCGTGGAAAG TGATTTATCA AGACCCACGT
ACCAGCACTC CAGGTCTGGG TCTGATGCTG TGGATGCAAA AAGTTTATGG CGATCAAGCC
CCGCAAGCAT GGCAACAACT GGCGCAGAAA ACGGTCACAG TCACCAAAGG CTGGAGCGAG
GCGTATGGCT TGTTCCTCAA GGGGGAAGCG GATTTAGTGC TGAGCTACAC CACCTCTCCG
GCTTATCATT TAATTGCAGA AAAGAATAAC AACTATGCAG CCGCCGATTT CAGTGAAGGC
CACTATTTAC AAGTAGAAGT CGCCGGCCAA CTGGCCGCCA GCAAACAACC TGAACTGGCG
CAGCGCTTTA TGCAATTTAT CGTGACCCCT GCGTTCCAAA ACCACATTCC AACCGGCAAC
TGGATGTATC CAGTGATCAA AATGGATCTA CCCGCCGGGT TCGAGACACT GGCCGTGCCA
CAAACAGCGT TGCAATTTGA TGCTAAAGAC GTGGCGGATA ACCGCAGTAA ATGGATTCAG
GCATGGCAAT CCGCCGTCAG CCGTTAA
 
Protein sequence
MLTTASDTFA TFMFKECKVF KHIISCLLLI SATSALAAEK PTLTVYTYDS FAADWGPGPA 
IKQAFEAECD CQLKFVALED GVSLLNRLRM EGKNSQADVI LGLDNNLVQA AEQTGLFTPS
QVDTRNLTLP EPWQNKTFVP YDYGYFAFVY NKEKLKNPPK SLHELISSKE PWKVIYQDPR
TSTPGLGLML WMQKVYGDQA PQAWQQLAQK TVTVTKGWSE AYGLFLKGEA DLVLSYTTSP
AYHLIAEKNN NYAAADFSEG HYLQVEVAGQ LAASKQPELA QRFMQFIVTP AFQNHIPTGN
WMYPVIKMDL PAGFETLAVP QTALQFDAKD VADNRSKWIQ AWQSAVSR