Gene Ccel_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1683 
Symbol 
ID7310426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2026988 
End bp2028166 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content38% 
IMG OID643608611 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_002506014 
Protein GI220929105 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AAATAATACT GGTACGTTAT GGAGAGATAA TATTAAAAGG TTTAAACAGG 
CCCGTTTTTG AAGATAAGCT TATTGGAAAT ATAAAGAGTG CTATTTTCAA ATTTGGAAAA
GCTAGGGTAA TCAAATCACA AGGCAGAATT TATATTGAAC CTCAAGAAGA GAACTATGAC
TTTGATTCAG TTCTTGTAAA AGTAACGAAA GTATTTGGTG TTGTTTCTGT AAGTCCTGTG
TGGAAAGTTG AAACAGACTA TGAAATAATC AAGGATACTT CCCTAAAACT GGCTTCTAAA
CTGGTAGAAG AAAAGAGCTA CAAGACATTC AAGGTAGAAA CAAAAAGAGG GAACAAGAGA
TTTCCAATGC AGTCACCTGA AATCAGTGCT GATGTAGGAG GCTTTATTTT AGAAAATATT
CCGCAGCTAT CAGTTGATGT CAAAAATCCT GATTTTATCA TATTTCTTGA AGTAAGAGAA
AGTACTTATA TCTATTCAGA AATGATGAAG GCACAGGGAG GTATGCCTCT TGGGTCTAAC
GGCAAAGCGA TGCTGCTTTT GTCGGGAGGA ATTGACAGTC CGGTTGCAGG TTGGATGATG
GGTAAAAGAG GTGTGGAGAT TGAAGCCGTT CATTTCTTTA GCTACCCTTA TACAAGTGAA
AGAGCAAAAC AAAAGGTAAT TGATCTGGCA CAAATAATGG CACAGTACTG CGGAAAAATT
CGTCTGCACG TTGTTCCGTT TACCGAGATT CAACTAAAAA TCAACGATAA TTGCCCTGAG
GAACAGCTTA CTATCATTAT GCGAAGGATT ATGATGAAAA TAGCGGAACA AATAGCTGTA
AAAGTAAATG CCATGGCACT TATTACCGGG GAAAGTATGG GGCAAGTTGC CAGCCAGACC
ATGCAGAGCC TTTACTGTAC GGATGCAGCA GTAAATATGC CGGTATTCAG GCCATTGATC
GGTATGGACA AGGTTGAAGT GGTGGATATA GCTAGGAGAA TTGATACTTT TGATACTTCT
GTTCTTCCAT ACGAAGATTG CTGTACTGTA TTTGTTGCAA AGCACCCTCA AACCAAGCCT
AAGCTTGATA GAATAATAGA GTCAGAGTCA GTTGTTGACT TTGAACCACT TATAAATACC
GCAATCGAAA ATACCGAAGT AATTGTTATA AAGCCATAG
 
Protein sequence
MNKKIILVRY GEIILKGLNR PVFEDKLIGN IKSAIFKFGK ARVIKSQGRI YIEPQEENYD 
FDSVLVKVTK VFGVVSVSPV WKVETDYEII KDTSLKLASK LVEEKSYKTF KVETKRGNKR
FPMQSPEISA DVGGFILENI PQLSVDVKNP DFIIFLEVRE STYIYSEMMK AQGGMPLGSN
GKAMLLLSGG IDSPVAGWMM GKRGVEIEAV HFFSYPYTSE RAKQKVIDLA QIMAQYCGKI
RLHVVPFTEI QLKINDNCPE EQLTIIMRRI MMKIAEQIAV KVNAMALITG ESMGQVASQT
MQSLYCTDAA VNMPVFRPLI GMDKVEVVDI ARRIDTFDTS VLPYEDCCTV FVAKHPQTKP
KLDRIIESES VVDFEPLINT AIENTEVIVI KP