Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1402 |
Symbol | thiI |
ID | 4205931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1575993 |
End bp | 1577150 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642565956 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_698721 |
Protein GI | 110803991 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATT TAATTTTAGT AAAATATGCC TCAGAAATAT TTTTAAAGGG GCTTAATAAA AATAAGTTTG AGAGAAAATT AAAAGAAAAT ATAAGAAAAA AGTTAAAAGA TATAGATCAT GAATTTATAA CAGATCAAAA TAGATGGTTC ATAAAATCAG AAGACTTAGA TGGAGTTATT GAAAGAGTAA AAAAGGTTTT TGGAGTTAAA GAACTTTGTT TAGTTACTCA GGTTACAGGG GACTTTAATT CAATAAAAGA AGAGGGATTA AAGAAAATTA AAGAAAGCAA AGCTAAGAGT TTCAAAGTAG AAACAAATAG AGCTAATAAA AAATTCCCCA TGAATTCTAT GGAGGTTTCA AGAGCTGTTG GAGGATATAT CCTTTCAGAA CTTGGGGATG AAATAGAAGT TGATATACAT AATCCAGAGT GTAAGCTTTA TGTAGAAATA AGAGGAAATG CTTATGTATT TACTGATAAA GATAAAATAA AGGCTGTAGG AGGCTTACCA TATGGAATGA ACGGAAGTAC TATGGTTATG TTATCAGGAG GAATTGATTC ACCAGTAGCA GCTTACTTAA TGGCTAGAAG AGGAGTTGAA ACTCATTGTG TATATTATCA TTCTCATCCA TACACTTCAG AAAGAGCCAA GGATAAGGTT AAGGAATTAG CAAAAATAGT AGGAAGATAC ACAGAAAAAA TAACTCTTTA TGTGGTTCCT TTTACAGAAA TACAAATGGA TATAATAGAG AAGTGTAGAG AAGATGAATT AACAATAATA ATGAGAAGAT TCATGATGAG AGTTGCTTGT GAACTTTCTG AAAGAAAGAA AATACAGTCA ATAACTACTG GAGAAAGTAT AGGGCAAGTA GCATCTCAAA CTATGGAAGG ACTTATGGTA AGTAATGATG TTTCAGATAG ACCTGTATTT AGACCTCTAA TAGCTATGGA TAAAGAGGAT ATAATGGATA TAGCAAGAGA TATAGATACT TATGACACAT CAATACTTCC ATATGAAGAT TGCTGCACAA TATTTGTACC AAAACATCCA AAGACTAAGC CTAGAGTTAA GGACATGATA ATAGCAGAAA GAAAGCTTGA TATAGAAGCT TTAGTAAATA AGGCTATTGA TGAAATGGAA ACTTTCATAT TTGAATAA
|
Protein sequence | MNNLILVKYA SEIFLKGLNK NKFERKLKEN IRKKLKDIDH EFITDQNRWF IKSEDLDGVI ERVKKVFGVK ELCLVTQVTG DFNSIKEEGL KKIKESKAKS FKVETNRANK KFPMNSMEVS RAVGGYILSE LGDEIEVDIH NPECKLYVEI RGNAYVFTDK DKIKAVGGLP YGMNGSTMVM LSGGIDSPVA AYLMARRGVE THCVYYHSHP YTSERAKDKV KELAKIVGRY TEKITLYVVP FTEIQMDIIE KCREDELTII MRRFMMRVAC ELSERKKIQS ITTGESIGQV ASQTMEGLMV SNDVSDRPVF RPLIAMDKED IMDIARDIDT YDTSILPYED CCTIFVPKHP KTKPRVKDMI IAERKLDIEA LVNKAIDEME TFIFE
|
| |