Gene CPR_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1402 
SymbolthiI 
ID4205931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1575993 
End bp1577150 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content31% 
IMG OID642565956 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_698721 
Protein GI110803991 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATT TAATTTTAGT AAAATATGCC TCAGAAATAT TTTTAAAGGG GCTTAATAAA 
AATAAGTTTG AGAGAAAATT AAAAGAAAAT ATAAGAAAAA AGTTAAAAGA TATAGATCAT
GAATTTATAA CAGATCAAAA TAGATGGTTC ATAAAATCAG AAGACTTAGA TGGAGTTATT
GAAAGAGTAA AAAAGGTTTT TGGAGTTAAA GAACTTTGTT TAGTTACTCA GGTTACAGGG
GACTTTAATT CAATAAAAGA AGAGGGATTA AAGAAAATTA AAGAAAGCAA AGCTAAGAGT
TTCAAAGTAG AAACAAATAG AGCTAATAAA AAATTCCCCA TGAATTCTAT GGAGGTTTCA
AGAGCTGTTG GAGGATATAT CCTTTCAGAA CTTGGGGATG AAATAGAAGT TGATATACAT
AATCCAGAGT GTAAGCTTTA TGTAGAAATA AGAGGAAATG CTTATGTATT TACTGATAAA
GATAAAATAA AGGCTGTAGG AGGCTTACCA TATGGAATGA ACGGAAGTAC TATGGTTATG
TTATCAGGAG GAATTGATTC ACCAGTAGCA GCTTACTTAA TGGCTAGAAG AGGAGTTGAA
ACTCATTGTG TATATTATCA TTCTCATCCA TACACTTCAG AAAGAGCCAA GGATAAGGTT
AAGGAATTAG CAAAAATAGT AGGAAGATAC ACAGAAAAAA TAACTCTTTA TGTGGTTCCT
TTTACAGAAA TACAAATGGA TATAATAGAG AAGTGTAGAG AAGATGAATT AACAATAATA
ATGAGAAGAT TCATGATGAG AGTTGCTTGT GAACTTTCTG AAAGAAAGAA AATACAGTCA
ATAACTACTG GAGAAAGTAT AGGGCAAGTA GCATCTCAAA CTATGGAAGG ACTTATGGTA
AGTAATGATG TTTCAGATAG ACCTGTATTT AGACCTCTAA TAGCTATGGA TAAAGAGGAT
ATAATGGATA TAGCAAGAGA TATAGATACT TATGACACAT CAATACTTCC ATATGAAGAT
TGCTGCACAA TATTTGTACC AAAACATCCA AAGACTAAGC CTAGAGTTAA GGACATGATA
ATAGCAGAAA GAAAGCTTGA TATAGAAGCT TTAGTAAATA AGGCTATTGA TGAAATGGAA
ACTTTCATAT TTGAATAA
 
Protein sequence
MNNLILVKYA SEIFLKGLNK NKFERKLKEN IRKKLKDIDH EFITDQNRWF IKSEDLDGVI 
ERVKKVFGVK ELCLVTQVTG DFNSIKEEGL KKIKESKAKS FKVETNRANK KFPMNSMEVS
RAVGGYILSE LGDEIEVDIH NPECKLYVEI RGNAYVFTDK DKIKAVGGLP YGMNGSTMVM
LSGGIDSPVA AYLMARRGVE THCVYYHSHP YTSERAKDKV KELAKIVGRY TEKITLYVVP
FTEIQMDIIE KCREDELTII MRRFMMRVAC ELSERKKIQS ITTGESIGQV ASQTMEGLMV
SNDVSDRPVF RPLIAMDKED IMDIARDIDT YDTSILPYED CCTIFVPKHP KTKPRVKDMI
IAERKLDIEA LVNKAIDEME TFIFE