Gene BAS3470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3470 
Symbol 
ID2850559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3444408 
End bp3446408 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content41% 
IMG OID637506712 
Producttransketolase 
Protein accessionYP_029725 
Protein GI49186473 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000202879 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACATT CAATCGAACA ACTTTCTATC AACACGATTC GCACATTATC CATCGATGCG 
ATTGAAAAAG CAAACTCTGG TCACCCAGGA ATGCCAATGG GTGCAGCACC AATGGCTTAT
ACATTATGGA CTCAATTTAT GAAACACAAT CCAAATAACC CAACGTGGTT TAACCGTGAT
CGTTTCGTAT TATCTGCAGG TCATGGTTCA ATGTTATTAT ACAGCCTACT TCACCTATCT
GGTTATGATG TAACAATGGA TGACTTAAAG AACTTCCGTC AATGGGGAAG CAAAACTCCA
GGGCATCCTG AGTACGGTCA TACAGCTGGT GTAGATGCAA CTACTGGTCC ACTTGGACAA
GGTATTGCAA CTGCTGTAGG TATGGCAATG GCAGAAAGAC ATTTAGCTGC TAAATATAAC
CGTGATGCGT ATAATATAGT AGATCATTAT ACATACGCTA TTTGTGGTGA TGGAGATTTA
ATGGAAGGCG TTTCTGCTGA AGCATCTTCA TTAGCTGCTC ATTTACAATT AGGTCGTCTT
GTTGTGCTTT ATGATTCAAA CGATATTTCA TTAGATGGCG ATTTAAATCG TTCATTCTCT
GAAAGTGTAG AAGATCGTTA CAAAGCATAC GGATGGCAAG TAATCCGTGT TGAGGATGGA
AACGATATTG AAGCTATCGC GAAAGCAATC GAAGAAGCGA AAGCTGACGA AAAACGCCCA
ACGCTAATTG AAGTAAGAAC GACAATTGGT TTCGGTTCTC CAAACAAATC AGGAAAATCA
GCTTCACATG GTTCTCCACT TGGTGTAGAA GAAACAAAGT TAACGAAAGA AGCATACGCT
TGGACTGCTG AACAAGACTT CCATGTAGCA GAAGAAGTAT ATGAAAACTT CCGTAAAACA
GTACAAGATG TTGGTGAAAC TGCACAAGCT GAGTGGAATA CTATGCTAGG TGAATATGCA
CAAGCATATC CAGAATTAGC AAACGAACTG CAAGCAGCAA TGAACGGTCT TCTTCCAGAA
GGTTGGGAGC AAAACTTACC AACTTATGAA TTAGGATCAA AAGCAGCAAC TCGTAATTCT
TCAGGTGCTG TAATTAATGC AATTGCAGAG TCTGTACCAT CATTCTTCGG TGGATCTGCT
GACCTTGCTG GTTCTAACAA AACATACATG AATAACGAAA AAGACTTTAC AAGAGATGAT
TACAGCGGTA AAAACATTTG GTACGGTGTA CGTGAGTTCG CAATGGGTGC AGCAATGAAC
GGTATTGCAC TACATGGTGG TTTAAAAACT TACGGTGGTA CGTTCTTCGT ATTCTCTGAC
TACTTACGCC CAGCAATTCG TCTTGCAGCA TTAATGCAAT TGCCGGTAAC GTATGTATTC
ACACACGACA GTATCGCTGT TGGTGAAGAT GGTCCAACAC ATGAACCAAT CGAGCAATTA
GCAGCGCTAC GTGCAATGCC AAATGTATCT GTTATTCGTC CAGCTGACGG TAACGAATCT
GTTGCAGCTT GGAGACTAGC TCTAGAATCT ACAAACAAAC CAACTGCTTT AGTATTAACT
CGTCAAGATC TTCCAACATT AGAAGGTGCA AAAGACGATA CGTATGAAAA AGTAGCAAAA
GGTGCGTATG TAGTTTCTGC AAGCAAGAAA GAAACAGCTG ATGTAATCTT ACTTGCAACT
GGATCTGAAG TAAGTCTAGC TGTTGAAGCT CAAAAAGCAT TAGCAGTAGA CGGCGTTGAT
GCATCTGTTG TCAGCATGCC ATCTATGGAT CGCTTTGAAG CTCAAACAGC TGAGTACAAA
GAATCTGTAT TACCAAAAGC AGTAACAAAA CGTTTCGCAA TCGAAATGGG TGCTACATTC
GGATGGCACC GTTACGTAGG TCTTGAAGGA GATGTGTTAG GTATCGATAC ATTCGGTGCT
TCTGCTCCTG GTGAGAAGAT TATGGAAGAG TATGGATTTA CTGTAGAGAA CGTTGTTCGT
AAAGTAAAAG AAATGCTTTA A
 
Protein sequence
MSHSIEQLSI NTIRTLSIDA IEKANSGHPG MPMGAAPMAY TLWTQFMKHN PNNPTWFNRD 
RFVLSAGHGS MLLYSLLHLS GYDVTMDDLK NFRQWGSKTP GHPEYGHTAG VDATTGPLGQ
GIATAVGMAM AERHLAAKYN RDAYNIVDHY TYAICGDGDL MEGVSAEASS LAAHLQLGRL
VVLYDSNDIS LDGDLNRSFS ESVEDRYKAY GWQVIRVEDG NDIEAIAKAI EEAKADEKRP
TLIEVRTTIG FGSPNKSGKS ASHGSPLGVE ETKLTKEAYA WTAEQDFHVA EEVYENFRKT
VQDVGETAQA EWNTMLGEYA QAYPELANEL QAAMNGLLPE GWEQNLPTYE LGSKAATRNS
SGAVINAIAE SVPSFFGGSA DLAGSNKTYM NNEKDFTRDD YSGKNIWYGV REFAMGAAMN
GIALHGGLKT YGGTFFVFSD YLRPAIRLAA LMQLPVTYVF THDSIAVGED GPTHEPIEQL
AALRAMPNVS VIRPADGNES VAAWRLALES TNKPTALVLT RQDLPTLEGA KDDTYEKVAK
GAYVVSASKK ETADVILLAT GSEVSLAVEA QKALAVDGVD ASVVSMPSMD RFEAQTAEYK
ESVLPKAVTK RFAIEMGATF GWHRYVGLEG DVLGIDTFGA SAPGEKIMEE YGFTVENVVR
KVKEML