Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS4545 |
Symbol | |
ID | 2850079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | - |
Start bp | 4450980 |
End bp | 4452194 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637507782 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_030792 |
Protein GI | 49187539 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACAT ATGAATATAT TTTAGTTCGT TATGGAGAGA TGACGACTAA AGGTAAGAAC CGTTCTAAAT TTGTAAGCAC ATTAAAAGAT AACGTAAAGT TCAAACTGAA AAAATTCCCA AATATTAAAA TCGATGCAAC ACATGATCGT ATGTACATCC AATTAAATGG CGAAGATCAT GAAGCGGTAT CTGAAAGATT GAAAGATGTA TTTGGTATTC ATAAGTTTAA CTTAGCGATG AAAGTACCAT CAGAATTAGA AGACATTAAA AAAGGTGCAT TAGCAGCTTT CTTACAAGTA AAAGGTGATG TGAAAACATT TAAAATTACT GTACACCGTT CTTATAAGCA TTTCCCAATG AGAACGATGG AATTATTACC TGAGATTGGT GGACATATTC TAGAAAATAC AGAAGATATT ACTGTGGATG TTCATAATCC AGATGTAAAT GTACGCGTAG AAATCCGTAG CGGTTATAGC TACATTATGT GTGATGAGCG TATGGGAGCT GGCGGTTTAC CAGTTGGCGT TGGCGGAAAA GTAATGGTAC TTCTTTCTGG CGGTATTGAT AGCCCAGTAG CAGCGTACTT AACGATGAAA CGGGGCGTAT CTGTGGAAGC AGTTCACTTC CATAGCCCGC CTTTCACAAG TGAGCGCGCG AAACAAAAAG TAATCGATTT AGCACAAGAA TTAACGAAAT ACTGTAAACG TGTAACACTT CACCTTGTTC CATTTACAGA AGTGCAAAAA ACGATTAATA AAGAAATCCC ATCTAGCTAT TCAATGACAG TTATGCGCCG TATGATGATG CGTATTACAG AGCGTATCGC AGAGGAGCGT AACGCACTAG CAATCACGAC TGGTGAAAGT CTTGGACAAG TAGCAAGTCA AACGTTAGAT AGTATGCATA CAATTAACGA AGTAACAAAC TACCCAGTTA TTCGTCCGCT TATTACGATG GATAAATTAG AGATTATTAA AATCGCTGAA GAGATCGGCA CATATGATAT TTCAATTCGT CCGTACGAAG ATTGCTGTAC AGTATTCACA CCAGCAAGCC CAGCGACGAA GCCGAAGCGT GAAAAAGCGA ATCGTTTTGA AGCGAAATAT GATTTCACAC CATTAATCGA TGAAGCTGTA GCGAACAAAG AAACAATGGT ATTACAAACG GTAGAAGTAG TAGCGGAAGA AGAAAAATTC GAAGAACTTT TCTAA
|
Protein sequence | MMTYEYILVR YGEMTTKGKN RSKFVSTLKD NVKFKLKKFP NIKIDATHDR MYIQLNGEDH EAVSERLKDV FGIHKFNLAM KVPSELEDIK KGALAAFLQV KGDVKTFKIT VHRSYKHFPM RTMELLPEIG GHILENTEDI TVDVHNPDVN VRVEIRSGYS YIMCDERMGA GGLPVGVGGK VMVLLSGGID SPVAAYLTMK RGVSVEAVHF HSPPFTSERA KQKVIDLAQE LTKYCKRVTL HLVPFTEVQK TINKEIPSSY SMTVMRRMMM RITERIAEER NALAITTGES LGQVASQTLD SMHTINEVTN YPVIRPLITM DKLEIIKIAE EIGTYDISIR PYEDCCTVFT PASPATKPKR EKANRFEAKY DFTPLIDEAV ANKETMVLQT VEVVAEEEKF EELF
|
| |