Gene Ccel_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1938 
Symbol 
ID7310653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2287740 
End bp2288873 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content38% 
IMG OID643608872 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_002506266 
Protein GI220929357 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTAATTTGGT AAGATGTGCA TTTCTAGTTA TTGTTTCTGT TTTATGTACT 
GCAAATACCT TTGCCGCCGG AACTACCCTT AGGATAAATG GGCAGGAACT CATCGACGGG
GTAAAAACAA TAGAGGGCAG GCAGTATATA TCTGCCGATG CAATATCCTC TCATTTGGAA
GGGATTACGG TTACTCAGGG AAACAATACC ATTGAAATAA ATTCTGTGAA CAAAATTTCA
AATGTAGTCT CAAAAGTAAG CCCTTCTGTT GTTGGAATTA TTGGTAAATT AAAAGAGAGC
AGTTATGAAT ATGATGAAAC TTCAGATAAT ATCATATTCG GTACAGGAGT TATATATCGC
AGTAGTGGTT ACATAATAAC AAATGCCCAT GTTGTAAAGG ATATGGAGAG TATTGTTGTA
GTACTTTCAA ACAGTAAAGC ATACAAGGCC AGACTTAAGG CTATTGATGA AGATCTCGAT
CTGGCAGAGA TAAAAATAGA TAAGGGCGGC TTGCAGCCTG CAAAATTTGG TGATATTTCG
CAAGTGGCAG TAGGGGATGA AGTCGTTGCA ATAGGAACAC CATTGTCCTT CGGACTTAGA
AATTCCGCGA CAAGGGGAAT AATAAGCGGA ATGAACAGGT CAGAGAACAG ACAGTATAGG
TTTATACAGA CAGATGCTGC TATCAATTCT GGAAACAGCG GCGGCCCACT GGTCAATATG
AAAGGTGAGG TTGTAGGGAT AAATTCATGG GTTTATGCTG GAATAGGCGT GCAGGGTATG
AGCTTTTCAA TACCTATAGA CTCTGTAAGA TACGCAATAA ACCAGTTTGA AAAGTTTGGA
AAGATAAGAC GACCCTACCT AGGTTTGGCT TTTTCCGATA GTATAACCTC AATATACGGA
CTACCGAATA CGGTGTCAGG GGTGACTGTA AAATCAATAG AAAAAGGTTC TCCTGCACAG
AAATACAATA TTAAAGTTGA TGATAGACTG ATTTCCATTA ATGGAATTAA GGTAAATTCG
ACAACAGATT ACAATGAAGA AATGAAGAAG TATCTGCCTG GAGACATTGC TGAATTCAAA
TTACAGCGTG ACAACAGGGA ATTCAGTATT TCAGTTACTT TTGGAGAAAA ATAA
 
Protein sequence
MKKFNLVRCA FLVIVSVLCT ANTFAAGTTL RINGQELIDG VKTIEGRQYI SADAISSHLE 
GITVTQGNNT IEINSVNKIS NVVSKVSPSV VGIIGKLKES SYEYDETSDN IIFGTGVIYR
SSGYIITNAH VVKDMESIVV VLSNSKAYKA RLKAIDEDLD LAEIKIDKGG LQPAKFGDIS
QVAVGDEVVA IGTPLSFGLR NSATRGIISG MNRSENRQYR FIQTDAAINS GNSGGPLVNM
KGEVVGINSW VYAGIGVQGM SFSIPIDSVR YAINQFEKFG KIRRPYLGLA FSDSITSIYG
LPNTVSGVTV KSIEKGSPAQ KYNIKVDDRL ISINGIKVNS TTDYNEEMKK YLPGDIAEFK
LQRDNREFSI SVTFGEK