Gene GBAA_4899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4899 
SymbolthiI 
ID2820124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4450418 
End bp4451629 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content38% 
IMG OID637791572 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_021542 
Protein GI47778341 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATATG AATATATTTT AGTTCGTTAT GGAGAGATGA CGACTAAAGG TAAGAACCGT 
TCTAAATTTG TAAGCACATT AAAAGATAAC GTAAAGTTCA AACTGAAAAA ATTCCCAAAT
ATTAAAATCG ATGCAACACA TGATCGTATG TACATCCAAT TAAATGGCGA AGATCATGAA
GCGGTATCTG AAAGATTGAA AGATGTATTT GGTATTCATA AGTTTAACTT AGCGATGAAA
GTACCATCAG AATTAGAAGA CATTAAAAAA GGTGCATTAG CAGCTTTCTT ACAAGTAAAA
GGTGATGTGA AAACATTTAA AATTACTGTA CACCGTTCTT ATAAGCATTT CCCAATGAGA
ACGATGGAAT TATTACCTGA GATTGGTGGA CATATTCTAG AAAATACAGA AGATATTACT
GTGGATGTTC ATAATCCAGA TGTAAATGTA CGCGTAGAAA TCCGTAGCGG TTATAGCTAC
ATTATGTGTG ATGAGCGTAT GGGAGCTGGC GGTTTACCAG TTGGCGTTGG CGGAAAAGTA
ATGGTACTTC TTTCTGGCGG TATTGATAGC CCAGTAGCAG CGTACTTAAC GATGAAACGG
GGCGTATCTG TGGAAGCAGT TCACTTCCAT AGCCCGCCTT TCACAAGTGA GCGCGCGAAA
CAAAAAGTAA TCGATTTAGC ACAAGAATTA ACGAAATACT GTAAACGTGT AACACTTCAC
CTTGTTCCAT TTACAGAAGT GCAAAAAACG ATTAATAAAG AAATCCCATC TAGCTATTCA
ATGACAGTTA TGCGCCGTAT GATGATGCGT ATTACAGAGC GTATCGCAGA GGAGCGTAAC
GCACTAGCAA TCACGACTGG TGAAAGTCTT GGACAAGTAG CAAGTCAAAC GTTAGATAGT
ATGCATACAA TTAACGAAGT AACAAACTAC CCAGTTATTC GTCCGCTTAT TACGATGGAT
AAATTAGAGA TTATTAAAAT CGCTGAAGAG ATCGGCACAT ATGATATTTC AATTCGTCCG
TACGAAGATT GCTGTACAGT ATTCACACCA GCAAGCCCAG CGACGAAGCC GAAGCGTGAA
AAAGCGAATC GTTTTGAAGC GAAATATGAT TTCACACCAT TAATCGATGA AGCTGTAGCG
AACAAAGAAA CAATGGTATT ACAAACGGTA GAAGTAGTAG CGGAAGAAGA AAAATTCGAA
GAACTTTTCT AA
 
Protein sequence
MTYEYILVRY GEMTTKGKNR SKFVSTLKDN VKFKLKKFPN IKIDATHDRM YIQLNGEDHE 
AVSERLKDVF GIHKFNLAMK VPSELEDIKK GALAAFLQVK GDVKTFKITV HRSYKHFPMR
TMELLPEIGG HILENTEDIT VDVHNPDVNV RVEIRSGYSY IMCDERMGAG GLPVGVGGKV
MVLLSGGIDS PVAAYLTMKR GVSVEAVHFH SPPFTSERAK QKVIDLAQEL TKYCKRVTLH
LVPFTEVQKT INKEIPSSYS MTVMRRMMMR ITERIAEERN ALAITTGESL GQVASQTLDS
MHTINEVTNY PVIRPLITMD KLEIIKIAEE IGTYDISIRP YEDCCTVFTP ASPATKPKRE
KANRFEAKYD FTPLIDEAVA NKETMVLQTV EVVAEEEKFE ELF