Gene BAS3361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3361 
Symbol 
ID2848225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3329316 
End bp3330332 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content37% 
IMG OID637506605 
Productthiamine/molybdopterin biosynthesis MoeB-like protein 
Protein accessionYP_029618 
Protein GI49186366 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGC GGTATTCACG ACAACAGTTG TTCAAACCGA TTGGGGATAG AGGACAAGAA 
AAGATTCGAA ATAAACATGT GTTAATTGTA GGGGCAGGCG CATTAGGAAG TGCAAGTGCT
GAAAGTTTCG TACGTGCAGG CATTGGGAAG TTGACGATTA TTGATCGTGA TTATGTTGAA
TGGAGTAATT TACAAAGACA ACAACTGTAC TCTGAAGAAG ATGCGAGAGA GAAATTGCCA
AAAGCAATCG CTGCTAAAAA TCGGCTAGAA AAACTTAATT CGGAAGTACA AATAGATGCT
TTCGTAATGG ATGCATGTGC AGAAAACTTG GAAGGACTAT TAGAAAATGT TGATGTAATA
ATTGATGCAA CAGATAATTT CGATATCCGA TTTATAATAA ATGATTTATC ACAAAAATAT
AATATCCCGT GGGTATATGG TTCTTGCGTT GGCTCGTACG GTATGAGTTA TACAATTATT
CCGCAAGAGA CACCGTGTTT ACATTGTGTG CTGAAGAACG TTCCAGTTAC AGGTGTGACG
TGTGATACAG CTGGAATTAT TAGTCCGACT GTTCAAATCG TTGCAGCATA TCAAGTGGCG
GAAGCACTAA AAATTTTAGT AGAAGATTTT GCAGCAATTA GAAAAACATT TTTTATGTTT
GATATATGGA GTAATCAAAA CCATTTTATA AAACTAGGAA AAATCAAGAC AGACGATTGC
CCTTCGTGCG GTTTGAATCG AACTTATCCT TATTTATCAT ACGAAAATCA AACGAAGGTA
GCCGTTTTGT GCGGAAGAAA TACAGTTCAA ATTAGAACGG TAGAAAGTAG ACAGTACAAT
TTTGATGATA TAGAAAAAGT ATTAAAAAAA CTGGGGGAAG TAGATCGGAA TCCGTATTTA
CTATCTTGCC AACTAGATGA GTACCGCGTC GTTATTTTTC GAGATGGTCG TGTTTTCATT
CATGGTACAA ATGATATTTC AAAAGCGAAA CAGTTATATT ATCGCGTATT CGGTTAA
 
Protein sequence
MAERYSRQQL FKPIGDRGQE KIRNKHVLIV GAGALGSASA ESFVRAGIGK LTIIDRDYVE 
WSNLQRQQLY SEEDAREKLP KAIAAKNRLE KLNSEVQIDA FVMDACAENL EGLLENVDVI
IDATDNFDIR FIINDLSQKY NIPWVYGSCV GSYGMSYTII PQETPCLHCV LKNVPVTGVT
CDTAGIISPT VQIVAAYQVA EALKILVEDF AAIRKTFFMF DIWSNQNHFI KLGKIKTDDC
PSCGLNRTYP YLSYENQTKV AVLCGRNTVQ IRTVESRQYN FDDIEKVLKK LGEVDRNPYL
LSCQLDEYRV VIFRDGRVFI HGTNDISKAK QLYYRVFG