Gene Ccel_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1991 
Symbol 
ID7310702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2357340 
End bp2358650 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content42% 
IMG OID643608926 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002506319 
Protein GI220929410 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACA AAACACAGAT GGACGCCGCA AAAAAAGGTA TTATTACAAA TGAAATGAAG 
ATTGTTTCTC GAAAGGAATC AATGGATGAA AATAAGCTTC GGGAGCTTGT TGCTGAGGGC
AGAATAGCTA TTCCGGCAAA TATAAATCAC AAGTCCTTAA GTCCGGAGGG AATAGGAGAA
GGTCTCAGAA CAAAAATCAA TGTAAACCTT GGTATATCAG GGGATTGTCC TGACTATACA
AAAGAAATGG AAAAGGCTGA TATGTCAATC AAATTTGGTG TTGAAGCCAT AATGGACCTT
AGTAATTACG GAAAGACAAA CACCTTTCGC AAAGAGCTTA TAAAGCGTTC TCCTGCCATG
ATAGGAACCG TTCCCATGTA TGATGCCATA GGCTACCTTG AAAAGGACCT GATGGATATT
AAAGCTTCGG ACTTTTTAAA AGTTGTTGAG GCTCATGCTG CAGAAGGTGT GGACTTTATG
ACAATTCATG CAGGAATAAA TAAACGTGCC GTTGAGTGTT TCAAACGCTC AAAGAGACTT
ACCAATATTG TTTCAAGAGG AGGTTCCCTT CTTTTTGCAT GGATGGAGAT GACTGGCAAT
GAAAATCCTT TTTTTGAGTA TTATGACGAT TTCCTTGGAA TACTCAGGGA ATATGATGTA
ACCATAAGTC TTGGAGATGC ATTGAGGCCC GGTAGTATCA ATGACAGCTC AGACGCCGGA
CAATTAAGTG AACTTATAGA ACTGGGCGAC CTGACCAAAC GTGCATGGGA AAAGGATGTA
CAGGTAATGG TTGAGGGGCC GGGACATATG GCTATGAACG AGATAGCCGC CAATATGACT
ATTCAAAAGA GACTATGTCA TGGAGCACCT TTTTATGTAC TGGGGCCTTT GGTGACAGAT
ATAGCACCGG GATATGACCA TATCACCTCG GCAATAGGCG GAGCAATTGC TGCGGCAAAC
GGTGCAGATT TCTTATGTTA TGTAACCCCT GCGGAGCATT TAAGGTTACC TGACCTTTCG
GATGTAAAAG AAGGAATAGT TGCTTCAAAA ATAGCCGCTC ATGCCGCCGA TATTGCAAAG
GGAATACCTA ATGCACGTGA AATAGATAAT AAAATGAGCG ATGCCAGACG CCGAATCGAC
TGGGAAGAAA TGTTCTCATA TGCTATTGAC GAAGATAAGG CAAGAGCATA TTTTGAAAGC
ACACCTCCCA CTGACAGACA TACCTGCTCA ATGTGCGGAA AAATGTGTGC TATGAGGACT
ACAAATAAGA TTTTAGCCGG TGAAAAGGTT GAGTTTGTAA CAGAGAAATA G
 
Protein sequence
MNYKTQMDAA KKGIITNEMK IVSRKESMDE NKLRELVAEG RIAIPANINH KSLSPEGIGE 
GLRTKINVNL GISGDCPDYT KEMEKADMSI KFGVEAIMDL SNYGKTNTFR KELIKRSPAM
IGTVPMYDAI GYLEKDLMDI KASDFLKVVE AHAAEGVDFM TIHAGINKRA VECFKRSKRL
TNIVSRGGSL LFAWMEMTGN ENPFFEYYDD FLGILREYDV TISLGDALRP GSINDSSDAG
QLSELIELGD LTKRAWEKDV QVMVEGPGHM AMNEIAANMT IQKRLCHGAP FYVLGPLVTD
IAPGYDHITS AIGGAIAAAN GADFLCYVTP AEHLRLPDLS DVKEGIVASK IAAHAADIAK
GIPNAREIDN KMSDARRRID WEEMFSYAID EDKARAYFES TPPTDRHTCS MCGKMCAMRT
TNKILAGEKV EFVTEK