Gene Ccel_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0520 
Symbol 
ID7309390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp598883 
End bp600103 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content36% 
IMG OID643607450 
Productpeptidase S11 D-alanyl-D-alanine carboxypeptidase 1 
Protein accessionYP_002504882 
Protein GI220927973 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA AAATACTTAT AGTTTTTGTG GCTTTAATAT TGATATTTAA TTGTCTCCCG 
GTTTTTGCAT GGACGCCGAC AGAAGGTCTT ACTTCCAGCT CTGTAGTTTT AATGGATACT
GTAAGAGGAC AGATTCTTTT CGAAAAAAAT GCTTATGCAC ATTTATCACC TTCAGTTATG
TGTAAGCTTA TGACTGCCCT TGTTACCATT GAGAAAACTG ATTTAAATGC AAAGGTTACA
ATCAGTAAAA ATGCCGCCAG CTTTAAGGGG CTGAATCTTG TGGTAGGGAA CCAGTATACG
GTTGAGGATT TGCTTTACGC CGTAATGCTT TCACAGGGAA ACGATGCGGC TGTGGCACTT
GCCGAATATG TAGGTGACGG TGATATCCAA AAATTTGTCA GATATATGAA TACAAAAGCT
AAGGAGCTGT CCCTTAAAGA TACATATTTT GTAAATCCTA CTGGCCTTTA CGAAAAGGAT
CAATATACAT CCGCCAAGGA CATTGCAGTG CTGGTAAAGG CAGCGATTTC TAATAATACT
TTTAATTTGA TGTTCGGTGC AAGAGGTTTT GGTTGGCTCA ACGGAAATAA TTCTTCAATT
TTAACTAACC AAAATACTCT TTTTTGGAGC TATAAAGGCG TTGACGGGGG AAAAATAGGT
ACCAATACAA ACCCTCAGAG TATTTCTGCT GTTACTACAG CTACCCTCAA TGAAAAAAGA
CTTATTGCTG TGGTATTTAA TACAAATGAG GAAAATGCCT TTTCCGAGAC GGCAAAACTC
TTTGATTATG GGTTCTCAAA ATTTTATACA GGAGTTCTTG TTCCTAAGAA TATTCCTCAA
AGAAGTATAG AAGTGGACAA TGTAAAGGTT GATCTTGTAA GTAAAATTGA TGTCTATTAT
ACCTATCCAG TGGGAGACAG CTTTATTAAT AATATCAGCT TTACACCTAA TGAAAAGCTT
AAACTTCCGC TTAATACCGA GACTATTGCA GGTGTTTTGA AATACACCCT GAACGATAAT
ACCGTAATTG AAGTTAATCT GTACTCCGAT AAGACAGTGT CAGCACCTGA GGATTACATA
TCAAAAATTA AAAACATTGT TACTGAAAAT CGGGATCTTG TAATAATCGT AGGTATTCTT
GCAATTATTG AATTAGTTTT GATCGCTAAA AATCTTTTAA AATTCATATT TAAAGCTAAA
AAAACAAGAA CCCAGAAATA G
 
Protein sequence
MSKKILIVFV ALILIFNCLP VFAWTPTEGL TSSSVVLMDT VRGQILFEKN AYAHLSPSVM 
CKLMTALVTI EKTDLNAKVT ISKNAASFKG LNLVVGNQYT VEDLLYAVML SQGNDAAVAL
AEYVGDGDIQ KFVRYMNTKA KELSLKDTYF VNPTGLYEKD QYTSAKDIAV LVKAAISNNT
FNLMFGARGF GWLNGNNSSI LTNQNTLFWS YKGVDGGKIG TNTNPQSISA VTTATLNEKR
LIAVVFNTNE ENAFSETAKL FDYGFSKFYT GVLVPKNIPQ RSIEVDNVKV DLVSKIDVYY
TYPVGDSFIN NISFTPNEKL KLPLNTETIA GVLKYTLNDN TVIEVNLYSD KTVSAPEDYI
SKIKNIVTEN RDLVIIVGIL AIIELVLIAK NLLKFIFKAK KTRTQK