Gene Ccel_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2643 
Symbol 
ID7311286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3183812 
End bp3185200 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content50% 
IMG OID643609568 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002506947 
Protein GI220930038 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGC GTATATTTTT CGCTGTAGTG CTGGCCGCCA TGCTTTTGAC CGTATCTGCT 
TTGGCAGCCG AGGCTGAGGC GATACGGGTG GCTGTCATTG ATACGGGCAT TTCTGAAACA
GCCATCCCCA AGACAAACTT GTCCTCTGGG CGAAATTATA TCCTACCCAA TAAAACCACG
ACTGACACAG TTGGACACGG TACGGCAATA GCTTCTATCA TTGTAGGCAG CGAGACGGCA
GGGATAAAGG GGATTTGCCC TGAGGCTATG CTGGTTCCAT TGGTCTATTA CGCAAAAAAC
GAAGATGGAG GTACCATTAA GGGTGACGGT GTCATGCTGG CAAAGATAAT TCGTGACGCG
GTGGAGGTCT TTGACTGTAA AATCCTCAAC ATAAGTTCCG GTGTTTTGAC CGACACACCA
GCTTTGCGGG ATGCGGTAGC ATGGGCGGAG AAACAGGGAG CATTGGTAAT TTCCAGTGTG
GGAAATGACG GGAATGACAC CGTTTACTAT CCTGGTGCAT ACAGCAGTTC TTTATGTGTA
GGTGCGGTGA ATGACGCGAA CAGTGCTCCG GCAGACTTCT CCAACCGTAA TGAGGCAGTG
GATCTATTGG CTCCCGGAGA AAAGTTGCCC ACAGCTACCA TGAAGGGCAA TCGCCTGCTG
GCTAGTGGCA CTAGCTTTTC CACGGCCTAT ATTTCAGGTG TTGCAGCTAA GCTGATGAAG
GAATATCCCG ACCTAACAGC GGCACAGATA CGGCAGATTC TCTATGCCTC TGCCACAGAT
TTCGGCACTA CCGGCTATGA CAGAGTTTCC GGTTGGGGCA TTTTGAATCT GGAGCAGGCA
CTTGACTACG CACGGCAGGG CTGTCTGTTT CGGGATGTGG ATTCATCAAA ATGGTACTTT
GAAGGTGTGA GAAAAGCTGC AAAACTCGGA CTTTTTCAAG GGACGAGTGC GGTTGAATTT
TCTCCGAATC AGCCAACGAC CCGTGCCATG CTGTGGATGA TGCTCTATCG CTTGCATGGA
CTCAAGCCTT CTGAAAGCAC CACAATCTGG TATAGGGATG CTAGGTTGTG GGTAACAGCG
AATGGCATTT CTGACGGAAC AAACCCTAAT TGCACGATTA CTCGGGAACA GATGGCGGTC
ATGCTGTATG GTTATGCCTC AGTTTTCGAT TATGATATAG GTAAACGGGC GGATTTAAGT
AAATTTACCG ATTCCGACAG CATTAGTTCC TACGCAAAGG ATGCCCTCTC CTGGGCCAAT
GCCAGCGGAC TTATCAGCGG AACGGGTACG CAGACCCTAT CACCACAAGG CAGTGCTACC
AGAGCTCAGG TAGCGGTGAC TGTAATAAAG TTTTATGATT TAGTATTTGG TGGAGTGAGG
GGCACGTGA
 
Protein sequence
MKLRIFFAVV LAAMLLTVSA LAAEAEAIRV AVIDTGISET AIPKTNLSSG RNYILPNKTT 
TDTVGHGTAI ASIIVGSETA GIKGICPEAM LVPLVYYAKN EDGGTIKGDG VMLAKIIRDA
VEVFDCKILN ISSGVLTDTP ALRDAVAWAE KQGALVISSV GNDGNDTVYY PGAYSSSLCV
GAVNDANSAP ADFSNRNEAV DLLAPGEKLP TATMKGNRLL ASGTSFSTAY ISGVAAKLMK
EYPDLTAAQI RQILYASATD FGTTGYDRVS GWGILNLEQA LDYARQGCLF RDVDSSKWYF
EGVRKAAKLG LFQGTSAVEF SPNQPTTRAM LWMMLYRLHG LKPSESTTIW YRDARLWVTA
NGISDGTNPN CTITREQMAV MLYGYASVFD YDIGKRADLS KFTDSDSISS YAKDALSWAN
ASGLISGTGT QTLSPQGSAT RAQVAVTVIK FYDLVFGGVR GT