Gene BAS4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4020 
Symbol 
ID2850961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3956591 
End bp3957784 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content40% 
IMG OID637507257 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II protein 
Protein accessionYP_030270 
Protein GI49187018 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCATC GTATTGAAGA AGCTCTAGAA GATTTAAAAC AAGGAAAAGT CGTTATCGTA 
TGTGATGATG AAAACCGTGA AAACGAAGGC GATTTTATCG CTTTAGCAGA GTATATTACG
CCCGAAACAA TTAACTTTAT GATTACACAT GGACGCGGTC TCGTTTGTGT ACCGATTACG
GAAGGATACG CAGAACGTCT ACAATTAGAA CCAATGGTAT CTCATAATAC AGATTCGCAT
CATACTGCAT TTACAGTGAG CATTGACCAT GTTTCTACAA CAACAGGCAT TAGCGCTCAC
GAACGTGCAA CTACAATACA ACAATTGTTA AATCCTGCAT CAAAAGGTGC TGATTTCAAT
CGACCTGGAC ATATCTTTCC ATTAATCGCG AAAGAAGGCG GTGTCCTGCG TCGTGCTGGT
CATACAGAAG CTGCTGTCGA TTTAGCACAG CTTTGCGGAG CAGAACCAGC TGGAGTCATT
TGCGAGATTA TCAATGAGGA CGGTACGATG GCACGCGTCC CTGATTTACT ACAGTGTGCA
AAACAATTTG ATATAAAAAT GATTACAATA GAAGATTTAA TTGCTTATCG CCGCCATCAT
GAAACACTTG TGACGAGAGA AGTGGAAATT ACATTACCTA CAGATTTCGG TACTTTTCAA
GCAATTGGCT ATTCTAACTC ATTAGATACG AAAGAACATA TTGCACTCGT AAAAGGTGAT
ATTTCAACAG GTGAGCCTGT ACTTGTACGC GTTCATTCAG AGTGCTTAAC AGGAGATGTA
TTTGGCTCGT GCCGCTGTGA TTGCGGACCA CAACTCCATG CTGCACTTGC TCAAATTGAA
CGTGAAGGAA AAGGCGTTCT TCTTTATATG AGACAAGAAG GACGAGGCAT TGGCCTTCTT
AATAAGCTTC GCGCTTATAA GTTACAAGAA GAAGGCTTCG ATACTGTAGA AGCAAACGAA
AAACTTGGGT TTCCCGCTGA CCTTCGTGAT TACGGTATCG GCGCTCAAAT ATTAAAAGAT
TTAGGTTTAC AACATTTACG ATTATTAACG AATAATCCAA GAAAAATCGC TGGCTTACAA
GGTTACGATT TAACCGTTAC GGAGCGCGTA CCGTTGCAAA TGCCAGCAAA AGAAGAGAAT
AAAACGTATT TACAAACGAA AGTAAACAAA TTAGGACATT TATTAAACTT ATAA
 
Protein sequence
MFHRIEEALE DLKQGKVVIV CDDENRENEG DFIALAEYIT PETINFMITH GRGLVCVPIT 
EGYAERLQLE PMVSHNTDSH HTAFTVSIDH VSTTTGISAH ERATTIQQLL NPASKGADFN
RPGHIFPLIA KEGGVLRRAG HTEAAVDLAQ LCGAEPAGVI CEIINEDGTM ARVPDLLQCA
KQFDIKMITI EDLIAYRRHH ETLVTREVEI TLPTDFGTFQ AIGYSNSLDT KEHIALVKGD
ISTGEPVLVR VHSECLTGDV FGSCRCDCGP QLHAALAQIE REGKGVLLYM RQEGRGIGLL
NKLRAYKLQE EGFDTVEANE KLGFPADLRD YGIGAQILKD LGLQHLRLLT NNPRKIAGLQ
GYDLTVTERV PLQMPAKEEN KTYLQTKVNK LGHLLNL