Gene Ccel_1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1890 
Symbol 
ID7312296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2245123 
End bp2246688 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content39% 
IMG OID643608824 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_002506218 
Protein GI220929309 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.206887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGACG ATAGAGACGA TAAGATAAAT GAAAATGGTT CCAGTATTGA CCAGCCGCAG 
AATGATAACG GCACTTATGA AGCAAACAGC AATTCAGAAT CACAGGAATC TGTAAACCAA
ACTCAGGAAG ACATCCAGAA CGGTAGTAGT TGGCAATACG AAACAGTAAC TCCTGTAAGC
GAAAACAAGG AGCCTGTTGA TAATGTAGTG GGTGAAAATA CTAGTCAAGA TACTGTTACC
GATAATTACA ATTTAGATCC ACAAGAATCA CAAACCGAAA CACAAGATGC CAATAAGTCA
GGGTTTGGCG AAGATTGCAA TTTGGAAACA CAAGTAGAAC CTAACTCTGT TTACAATAAT
TATTACAGGG AAAACTGTAA AAAAACAAAT ACTAAAAAGT CAAATGCATG GAAATACGTA
TTGGTATCAG CTGTCAGCTC ACTTGTTGGA GCGGCACTTC TTGCATTGTT AATGCTTTTT
GTTGCACCGT TTGTACAGCC CCAAATAAAA TCATACCTTG GAAATAATTT CCCTGGCCTT
AAAACTGAGA GTACCCAGCC CAATACCGGC GAACTAAAGA GAGTTGAAAT AGTTCAAAGT
GGAGAATCAG CCGTTACAAG TGTAGCAGAG AAAGTCGGGC CTTCAGTTGT AGGTATTAAA
ACATCTTATC AGAATACCAA TGAATTGTTT GGAGTACAGT CCGGGGGTGG TGAAGGATCA
GGTATAATAA TAAGTGCTGA TGGATATATT CTTACAAACC ATCACGTTAT AGAAGGTGCC
TTAAATGATA AAACAAGAAA TATAAGAAGT GATGCAAAAA TTGAAGTGTT CCTTCCCAAC
AAAATAGACA AGCCTTACTC TGCTATAGTT AAAGGGTACG ATGCAAAGAC AGATCTGGCT
GTATTAAAAA TCAATGATAC AAACCTCCCT GTTATAGAGT TCGGTAATTC GAATGATATA
AAGATCGGTG AGCCGGCCAT TGCAGTAGGT AATCCAGGAG GCCTTGAATA CATGGGTTCA
GTAACATACG GCGTTATAAG CGGTTTAAAC AGAACTGTTC AGTTGGATGG AGGCAAGAGG
ATAAGACTGT TGCAGACAGA TGCTGCCATA AACCCCGGAA ACAGTGGTGG TGCATTGGTA
AATATCAAAG GACAGCTTAT TGGTGTAAAT ACAGTAAAGA TGGTTGCAAC AGGGTTTGAA
GGACTTGGTT TTGCAATCCC CGTAAATGAG GCAAAAACAA TAGCAGATGA ACTTATCACA
AAAACCTACA TTGCAAAACC TTATCTGGGT ATTTCAGTTA ACACGCAGTA TACAGAAGAT
ATAGCAAAGG CAAATAATAT GCCTGCCGGA GTATACGTGG CAGATGTTGA ACTTTTTGGT
GCAGCTGCTA AAGCGGGCAT AATGCCGGGT GATGTAATAA CCAAATTTAA CAACAAAGTA
ATTAAGTCTT ATGATGAGCT TGAAGATACA AAGAATAAGA TGAAACCTGG GGATGTGGTT
AAGATAGAGA TTTTCAGAGA CGGAAGTACA AAAACCGTTC AGGCTAAACT AGGTGAAACA
AAGTAA
 
Protein sequence
MIDDRDDKIN ENGSSIDQPQ NDNGTYEANS NSESQESVNQ TQEDIQNGSS WQYETVTPVS 
ENKEPVDNVV GENTSQDTVT DNYNLDPQES QTETQDANKS GFGEDCNLET QVEPNSVYNN
YYRENCKKTN TKKSNAWKYV LVSAVSSLVG AALLALLMLF VAPFVQPQIK SYLGNNFPGL
KTESTQPNTG ELKRVEIVQS GESAVTSVAE KVGPSVVGIK TSYQNTNELF GVQSGGGEGS
GIIISADGYI LTNHHVIEGA LNDKTRNIRS DAKIEVFLPN KIDKPYSAIV KGYDAKTDLA
VLKINDTNLP VIEFGNSNDI KIGEPAIAVG NPGGLEYMGS VTYGVISGLN RTVQLDGGKR
IRLLQTDAAI NPGNSGGALV NIKGQLIGVN TVKMVATGFE GLGFAIPVNE AKTIADELIT
KTYIAKPYLG ISVNTQYTED IAKANNMPAG VYVADVELFG AAAKAGIMPG DVITKFNNKV
IKSYDELEDT KNKMKPGDVV KIEIFRDGST KTVQAKLGET K