Gene Ccel_2904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2904 
Symbol 
ID7311520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3458900 
End bp3460147 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content37% 
IMG OID643609804 
Productcarboxyl-terminal protease 
Protein accessionYP_002507178 
Protein GI220930269 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0010841 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATA TGAAGGACCA AGAAACGAAA CGTCTGTTTT CAGTAATAAC CATGACGGCA 
GTGGTGTGTT TTACAATATC AATCCTAGTG TATGGCGGCT TAATGTATTT TAACGGAAGT
TATTCACTGA TATTCAATAA AAATAGTGTA GACAGGGAAA CGATACAAAA ATTTAATGAA
GCAAGAAGTA TACTTCAGAA AGCCTATTAT GAAAATGTAG ACACTAACAA ACTGGTGGAG
GGTGCAATTA GCGGTATGAC AGAGTCACTG AATGATCCCT ATACAGTATA TTATAATAAG
CAGCAAATGA AGTGGTTTAC AGGTCTTCAG AACAATACAG AGAATGAGTA TGTGGGGGTT
GGACTGCCGA TAATGCTAGA TAAAAACGGA ATAGTAACCG TTTTAGAGCC TTACGATAAT
TCCCCTGCAA AAATTGCAGG AATCAAGCAA GGAGATAAAA TACTTAAAAT AGATGGTAAA
GACATAACAG GAATCAAGGA TGAAACACTG GTTGCCAGCA TGATTAAAGG ACCTGAGAAC
ACCGAGACGG TTCTGACTAT TCTTCGAGAA TCAGACAACA GTACCATTGA TATCCCAGTA
ATGAGAAAAA AGATTAAAGC CCTGGTGAAT ATAAGAAGTG AAATGTTGGA TGGAAATATT
GCATATATTA AGCTTAAAAT GTTTGATAAA AATATTAGCA AGAACTTTAT CAGTCAGTTA
AACAAATTGG TTAAGCAAGG TGCTAAAGGC TTAATAATAG ATGTGAGGGA CAATCCGGGG
GGATTATATG ATGAAGTAGT GACATTGGCA GACCGACTTC TTCCAAAGGG AACAATAGTA
TTTACAAAGG ATAAAAACGG TAAAAAAAGT GTGCAGTCGT CTGATGAAAA TGAACTTAAT
ATGCCCATAG CTGTAATTAC AAATGGTAAC AGTGCAAGTG CTTCGGAAAT TCTGGCAGGT
GCTGTTAAGG ACTTTAAAAA GGGAACACTA ATAGGAACTA AAACCTTTGG AAAAGGACTG
GTGCAGACAA CCTATTCTTT TAAGGACGGA ACAGGACTTA AGGTAACAAT AGCAAGGTAT
TATACACCTT CCGGTGTTTG TATACAGGGA CAGGGTATAA AACCTGAAAT CGAAGTAAAG
CTTCCCGAAA AGTACAAAGA CATCGATGTT GCAGCAATTC CCAAGGAAGA TGACTTACAA
CTTCAAAAGG GTATTGAAGT TATTAGCAAA AAAATAATAT CAGATTAG
 
Protein sequence
MLNMKDQETK RLFSVITMTA VVCFTISILV YGGLMYFNGS YSLIFNKNSV DRETIQKFNE 
ARSILQKAYY ENVDTNKLVE GAISGMTESL NDPYTVYYNK QQMKWFTGLQ NNTENEYVGV
GLPIMLDKNG IVTVLEPYDN SPAKIAGIKQ GDKILKIDGK DITGIKDETL VASMIKGPEN
TETVLTILRE SDNSTIDIPV MRKKIKALVN IRSEMLDGNI AYIKLKMFDK NISKNFISQL
NKLVKQGAKG LIIDVRDNPG GLYDEVVTLA DRLLPKGTIV FTKDKNGKKS VQSSDENELN
MPIAVITNGN SASASEILAG AVKDFKKGTL IGTKTFGKGL VQTTYSFKDG TGLKVTIARY
YTPSGVCIQG QGIKPEIEVK LPEKYKDIDV AAIPKEDDLQ LQKGIEVISK KIISD