Gene CPF_1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1666 
SymbolthiI 
ID4201576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1882859 
End bp1884016 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content31% 
IMG OID638082541 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_696105 
Protein GI110801269 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.357019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATT TAATTTTAGT AAAATATGCC TCAGAAATAT TTTTAAAGGG GCTTAATAAA 
AATAAGTTTG AGAGAAAATT AAAAGAAAAT ATAAGAAAAA AGTTAAAAGA TATAGATCAT
GAATTTATAA CAGATCAAAA TAGATGGTTC ATAAAATCAG AAGACTTAGA TGGAGTTATT
GAAAGGGTAA AAAAGGTTTT TGGAGTTAAA GAACTTTGCT TAGTTACTCA GGTTGAAGGG
GACTTTGATT CAATAAAAGA AGAGGGATTA AAGAAAATTA AAGAAAGCAA AGCTAAGAGT
TTCAAAGTAG AAACAAATAG AGCTAATAAA AAATTCCCGA TGAATTCTAT GGAGGTTTCA
AGAGCTGTTG GAGGATATAT CCTTTCAGAA CTTGGGGATG AAATAGAAGT TGATATACAT
AATCCAGAGT GTAAGCTTTA TGTAGAAATA AGAGGAAATG CTTATGTGTT TACTGATAAA
GATAAAATAA AGGCTGTAGG AGGCTTACCA TATGGAATGA ACGGAAGTAC TATGGTTATG
TTATCAGGAG GAATTGATTC ACCAGTAGCA GCTTATTTAA TGGCTAGAAG AGGAGTTGAA
ACTCATTGTG TATATTATCA TTCTCATCCA TACACTTCAG AAAGAGCTAA GGATAAGGTT
AAGGAATTAG CAAAAATAGT AGGAAGATAC ACAGAAAAAA TAACTCTTTA TGTGGTTCCT
TTTACAGAAA TACAAATGGA TATAATAGAG AAGTGTAGAG AAGATGAATT AACAATAATA
ATGAGAAGAT TCATGATGAG AGTGGCTTGT GAACTTTCTG AAAGAAAGAA AATACAGTCA
ATAACTACTG GAGAAAGTAT AGGGCAAGTG GCATCTCAGA CTATGGAAGG ACTTATGGTA
AGTAATGATG TTTCAGATAG ACCAGTATTT AGACCTCTAA TAGCTATGGA TAAAGAGGAT
ATAATGGATA TTGCAAGAGA TATAGATACT TATGAGACAT CAATACTTCC ATATGAAGAT
TGTTGTACAA TATTTGTACC AAAACATCCA AAGACTAAGC CTAGAGTTAA GGACATGATA
ATAGCAGAAA GAAAGCTTGA TATAGAAGCT TTAGTAAATA AAGCTATTGA TGAAATGGAA
ACTTTCATAT TTGAATAA
 
Protein sequence
MNNLILVKYA SEIFLKGLNK NKFERKLKEN IRKKLKDIDH EFITDQNRWF IKSEDLDGVI 
ERVKKVFGVK ELCLVTQVEG DFDSIKEEGL KKIKESKAKS FKVETNRANK KFPMNSMEVS
RAVGGYILSE LGDEIEVDIH NPECKLYVEI RGNAYVFTDK DKIKAVGGLP YGMNGSTMVM
LSGGIDSPVA AYLMARRGVE THCVYYHSHP YTSERAKDKV KELAKIVGRY TEKITLYVVP
FTEIQMDIIE KCREDELTII MRRFMMRVAC ELSERKKIQS ITTGESIGQV ASQTMEGLMV
SNDVSDRPVF RPLIAMDKED IMDIARDIDT YETSILPYED CCTIFVPKHP KTKPRVKDMI
IAERKLDIEA LVNKAIDEME TFIFE