Gene GBAA_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3624 
Symbol 
ID2816584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3328749 
End bp3329765 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content37% 
IMG OID637790363 
Productthiamine/molybdopterin biosynthesis MoeB-like protein 
Protein accessionYP_020257 
Protein GI47528908 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGC GGTATTCACG ACAACAGTTG TTCAAACCGA TTGGGGATAG AGGACAAGAA 
AAGATTCGAA ATAAACATGT GTTAATTGTA GGGGCAGGCG CATTAGGAAG TGCAAGTGCT
GAAAGTTTCG TACGTGCAGG CATTGGGAAG TTGACGATTA TTGATCGTGA TTATGTTGAA
TGGAGTAATT TACAAAGACA ACAACTGTAC TCTGAAGAAG ATGCGAGAGA GAAATTGCCA
AAAGCAATCG CTGCTAAAAA TCGGCTAGAA AAACTTAATT CGGAAGTACA AATAGATGCT
TTCGTAATGG ATGCATGTGC AGAAAACTTG GAAGGACTAT TAGAAAATGT TGATGTAATA
ATTGATGCAA CAGATAATTT CGATATCCGA TTTATAATAA ATGATTTATC ACAAAAATAT
AATATCCCGT GGGTATATGG TTCTTGCGTT GGCTCGTACG GTATGAGTTA TACAATTATT
CCGCAAGAGA CACCGTGTTT ACATTGTGTG CTGAAGAACG TTCCAGTTAC AGGTGTGACG
TGTGATACAG CTGGAATTAT TAGTCCGACT GTTCAAATCG TTGCAGCATA TCAAGTGGCG
GAAGCACTAA AAATTTTAGT AGAAGATTTT GCAGCAATTA GAAAAACATT TTTTATGTTT
GATATATGGA GTAATCAAAA CCATTTTATA AAACTAGGAA AAATCAAGAC AGACGATTGC
CCTTCGTGCG GTTTGAATCG AACTTATCCT TATTTATCAT ACGAAAATCA AACGAAGGTA
GCCGTTTTGT GCGGAAGAAA TACAGTTCAA ATTAGAACGG TAGAAAGTAG ACAGTACAAT
TTTGATGATA TAGAAAAAGT ATTAAAAAAA CTGGGGGAAG TAGATCGGAA TCCGTATTTA
CTATCTTGCC AACTAGATGA GTACCGCGTC GTTATTTTTC GAGATGGTCG TGTTTTCATT
CATGGTACAA ATGATATTTC AAAAGCGAAA CAGTTATATT ATCGCGTATT CGGTTAA
 
Protein sequence
MAERYSRQQL FKPIGDRGQE KIRNKHVLIV GAGALGSASA ESFVRAGIGK LTIIDRDYVE 
WSNLQRQQLY SEEDAREKLP KAIAAKNRLE KLNSEVQIDA FVMDACAENL EGLLENVDVI
IDATDNFDIR FIINDLSQKY NIPWVYGSCV GSYGMSYTII PQETPCLHCV LKNVPVTGVT
CDTAGIISPT VQIVAAYQVA EALKILVEDF AAIRKTFFMF DIWSNQNHFI KLGKIKTDDC
PSCGLNRTYP YLSYENQTKV AVLCGRNTVQ IRTVESRQYN FDDIEKVLKK LGEVDRNPYL
LSCQLDEYRV VIFRDGRVFI HGTNDISKAK QLYYRVFG