Gene Ccel_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1474 
Symbol 
ID7310243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1789155 
End bp1790375 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content42% 
IMG OID643608400 
Productpeptidase U32 
Protein accessionYP_002505808 
Protein GI220928899 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00296183 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TTGAGCTTCT GGCTCCGGCC GGAAATCTTG AAAAACTAAA AATGGCAATT 
ATGTATGGTG CAGACGCAGT ATATATTGGA GGACAAAAAT TCGGCTTAAG GGCGTCTGCT
GATAATTTCT CACTTGAGGA CATAAAGGCG GGACTTGTAT TTGCACACGA TCGGGAGTGT
AAGGTCTATG TGACCGTTAA CATAATTCCT CATAATGAAG ATCTTGTGGG ATTGCCCGAA
TATATTAGAC AGCTTGATGA ACTGGGAGTG GATGCTCTTA TAGTATCTGA CCCCGGAATT
TTTGATATTG TACGTGAAAA CGCTCCTGAT ATGGAAATAC ATGTGAGTAC TCAGGCTAAT
AATACAAATT ATGCAAGTGC AATGTTCTGG TACAGGCATG GTGCCAAGCG AATTGTTACT
GCTAGGGAGC TTTCTCTTAT AGAAATAAGT GAAATTAAGG AAAAAATCCC GGATGACTTG
GATTTAGAGG CCTTTATCCA TGGTGCGATG TGTATTTCAT ATTCGGGAAG GTGCCTGCTC
AGCAATTACA TGGCAGGAAG AGATTCAAAC AGAGGAGCCT GTTCCCATCC GTGCAGATGG
AAGTACCATC TGGTTGAAGA AAAGCGTCCA GGAGAGTACT ACCCTGTTTA TGAGGATGAA
AAGGGGACAT ATATATACAA TTCAAAGGAT TTATGCACCA TTGAGTATAT CCCCGAACTT
GTGAAAGCAG GTATCTACAG TTTTAAAATA GAGGGCAGGA TGAAAAGCTC CTTTTACGTA
GCCACCGTTG TAAGTGCCTA TAGACAGGCT ATCGATGCAT ATATGGCCGA TCCTGAAAAC
TACAGGTTTG ACCCGGAATG GTTAAAAGAG TTGTCAAAGG CAAGTCACAG GGAGTATACA
ACGGGATTTT ATTTTAATAA AACTAGCGGT GCCGATCAGA TATACAATAC AAGCTCCTAT
ATCAGAGAAT ATGACTTTGT GGGCGTGGTT CTGGAGTATG ACAAAGAAAC TGGTATTGCT
AAGATTGAGC AGAGAAACCG TATGATTGTG GGAGATGAAA TAGAGGTGGT AAGCCCCCGT
AAAGGCTATT TTACCCAAAC TATTAAAGAG ATGAAGAATG AGGATGGAGA AAGCATCAGA
ACTGCACCTC ATGCACAGAT GATCGTATAT ATGCCTATGG ATCAGGAAGT GGGGCCTTAT
AGTATTTTGA GAAGGAAGTA G
 
Protein sequence
MKKVELLAPA GNLEKLKMAI MYGADAVYIG GQKFGLRASA DNFSLEDIKA GLVFAHDREC 
KVYVTVNIIP HNEDLVGLPE YIRQLDELGV DALIVSDPGI FDIVRENAPD MEIHVSTQAN
NTNYASAMFW YRHGAKRIVT ARELSLIEIS EIKEKIPDDL DLEAFIHGAM CISYSGRCLL
SNYMAGRDSN RGACSHPCRW KYHLVEEKRP GEYYPVYEDE KGTYIYNSKD LCTIEYIPEL
VKAGIYSFKI EGRMKSSFYV ATVVSAYRQA IDAYMADPEN YRFDPEWLKE LSKASHREYT
TGFYFNKTSG ADQIYNTSSY IREYDFVGVV LEYDKETGIA KIEQRNRMIV GDEIEVVSPR
KGYFTQTIKE MKNEDGESIR TAPHAQMIVY MPMDQEVGPY SILRRK