Gene GBAA_0733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_0733 
Symbol 
ID2814929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp758341 
End bp759360 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content37% 
IMG OID637787730 
Productthiamine/molybdopterin biosynthesis ThiF/MoeB-like protein 
Protein accessionYP_017366 
Protein GI47526017 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID[TIGR02356] thiazole biosynthesis adenylyltransferase ThiF, E. coli subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATAATC GATATTCTCG CCAAGAATTA TTTTCTCCGA TTGGGGAGGA AGGCCAGCAA 
AAGATAAGAA AAAAGCATGT ACTCATTATC GGTGCAGGCG CACTAGGTAG TGCAAATGCA
GAAATGTTTG TAAGAGCAGG TGTTGGCACA GTAACGATTG TTGATCGTGA TTATGTCGAT
TGGAGTAATT TACAAAGGCA GCAATTGTAT GTAGAGAGTG ATGTGGAAAA TAATCTTCCG
AAGGCTGTAG CAGCAAAGAA GCGTCTAGAA GAGATTAATA GTGAAGTAAG AGTAAAAGCG
CTCGTTCAAG ATGTAACAGC TGAGGAATTA GAAGAGCTTG TTACAAACGT TAATGTAATG
ATTGATGCAA CTGATAATTT CGAAACGCGT TTCATTGTGA ATGATATAGC ACAAAAATAT
TCTATTCCAT GGATTTACGG AGCATGTGTA GGGAGTTACG GCCTTTCTTA CACAATCCTT
CCTAGTAAAA CGCCATGTTT ATCTTGTTTA TTACAATCGA TTCCGCTTGG CGGGGCGACA
TGTGATACAG CGGGGATTAT ATCGCCTGCT GTATCTCTCG TCGTTTCTCA TCAAGTAACG
GAAGCTCTTA AACTATTAGT GGAAGATTAC GAATCACTTC GAGATGGACT TGTATCGTTT
GATGTATGGA AGAATGAATA TTCATGTATG AATGTGCAAA AGCTTCGTAA ACATAATTGC
CCTTCGTGCG GAGAGAATGC ATTATATCCT TATTTAAACA AAGAAAATAC ATCGAAAACA
GCAGTTTTAT GCGGGAGAAA TACAGTTCAA ATTAGACCAC CTTATAAAGA GGAAATGGAT
TTTGAACGAT ACAAAGAGCT GCTGAATGAT CGTGTAAATG ATTTAAATGT AAATCCATAT
TTATTATCAT TTTCTGTGGA AGAAAAGAGA TTAGTTGCTT TTAAAGATGG TCGCGTACTT
GTACATGGAA CGAAAGATAT AAGTGAAGCA AAAACAGTTT ATCATCGTTA TTTTGGATAG
 
Protein sequence
MNNRYSRQEL FSPIGEEGQQ KIRKKHVLII GAGALGSANA EMFVRAGVGT VTIVDRDYVD 
WSNLQRQQLY VESDVENNLP KAVAAKKRLE EINSEVRVKA LVQDVTAEEL EELVTNVNVM
IDATDNFETR FIVNDIAQKY SIPWIYGACV GSYGLSYTIL PSKTPCLSCL LQSIPLGGAT
CDTAGIISPA VSLVVSHQVT EALKLLVEDY ESLRDGLVSF DVWKNEYSCM NVQKLRKHNC
PSCGENALYP YLNKENTSKT AVLCGRNTVQ IRPPYKEEMD FERYKELLND RVNDLNVNPY
LLSFSVEEKR LVAFKDGRVL VHGTKDISEA KTVYHRYFG