Gene BAS4545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4545 
Symbol 
ID2850079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4450980 
End bp4452194 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID637507782 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_030792 
Protein GI49187539 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACAT ATGAATATAT TTTAGTTCGT TATGGAGAGA TGACGACTAA AGGTAAGAAC 
CGTTCTAAAT TTGTAAGCAC ATTAAAAGAT AACGTAAAGT TCAAACTGAA AAAATTCCCA
AATATTAAAA TCGATGCAAC ACATGATCGT ATGTACATCC AATTAAATGG CGAAGATCAT
GAAGCGGTAT CTGAAAGATT GAAAGATGTA TTTGGTATTC ATAAGTTTAA CTTAGCGATG
AAAGTACCAT CAGAATTAGA AGACATTAAA AAAGGTGCAT TAGCAGCTTT CTTACAAGTA
AAAGGTGATG TGAAAACATT TAAAATTACT GTACACCGTT CTTATAAGCA TTTCCCAATG
AGAACGATGG AATTATTACC TGAGATTGGT GGACATATTC TAGAAAATAC AGAAGATATT
ACTGTGGATG TTCATAATCC AGATGTAAAT GTACGCGTAG AAATCCGTAG CGGTTATAGC
TACATTATGT GTGATGAGCG TATGGGAGCT GGCGGTTTAC CAGTTGGCGT TGGCGGAAAA
GTAATGGTAC TTCTTTCTGG CGGTATTGAT AGCCCAGTAG CAGCGTACTT AACGATGAAA
CGGGGCGTAT CTGTGGAAGC AGTTCACTTC CATAGCCCGC CTTTCACAAG TGAGCGCGCG
AAACAAAAAG TAATCGATTT AGCACAAGAA TTAACGAAAT ACTGTAAACG TGTAACACTT
CACCTTGTTC CATTTACAGA AGTGCAAAAA ACGATTAATA AAGAAATCCC ATCTAGCTAT
TCAATGACAG TTATGCGCCG TATGATGATG CGTATTACAG AGCGTATCGC AGAGGAGCGT
AACGCACTAG CAATCACGAC TGGTGAAAGT CTTGGACAAG TAGCAAGTCA AACGTTAGAT
AGTATGCATA CAATTAACGA AGTAACAAAC TACCCAGTTA TTCGTCCGCT TATTACGATG
GATAAATTAG AGATTATTAA AATCGCTGAA GAGATCGGCA CATATGATAT TTCAATTCGT
CCGTACGAAG ATTGCTGTAC AGTATTCACA CCAGCAAGCC CAGCGACGAA GCCGAAGCGT
GAAAAAGCGA ATCGTTTTGA AGCGAAATAT GATTTCACAC CATTAATCGA TGAAGCTGTA
GCGAACAAAG AAACAATGGT ATTACAAACG GTAGAAGTAG TAGCGGAAGA AGAAAAATTC
GAAGAACTTT TCTAA
 
Protein sequence
MMTYEYILVR YGEMTTKGKN RSKFVSTLKD NVKFKLKKFP NIKIDATHDR MYIQLNGEDH 
EAVSERLKDV FGIHKFNLAM KVPSELEDIK KGALAAFLQV KGDVKTFKIT VHRSYKHFPM
RTMELLPEIG GHILENTEDI TVDVHNPDVN VRVEIRSGYS YIMCDERMGA GGLPVGVGGK
VMVLLSGGID SPVAAYLTMK RGVSVEAVHF HSPPFTSERA KQKVIDLAQE LTKYCKRVTL
HLVPFTEVQK TINKEIPSSY SMTVMRRMMM RITERIAEER NALAITTGES LGQVASQTLD
SMHTINEVTN YPVIRPLITM DKLEIIKIAE EIGTYDISIR PYEDCCTVFT PASPATKPKR
EKANRFEAKY DFTPLIDEAV ANKETMVLQT VEVVAEEEKF EELF