Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0814 |
Symbol | |
ID | 7309662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 935299 |
End bp | 936327 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643607756 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002505172 |
Protein GI | 220928263 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0751065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATACAC TTATACTCGG GATAGAAAGC AGCTGTGATG AAACTGCTGC TTCGGTTGTG AAAAACGGAA GATATATAAT GTCAAATGTT ATTTCATCTC AAATAGATTT GCATAAAAAG TATGGTGGTG TTGTTCCGGA AATTGCATCA CGAAAACATG TTGAACTGAT TATGCCGGTT GTACATCAGG CACTGGAAGA GGCGGGTGTT TCCTTAAAGG AAATTGATGC CGTAGGTGTT ACCTATGGAC CTGGATTGGT AGGAGCTCTC CTTGTTGGGT TGACTGCTGC AAAGGCGATT GCATTTGCTG CCGATAAACC GTTGGTTGGA GTGCATCATA TTGAGGGGCA TATTGCGGCA AACTACCTTC AGGAGCCGGA GTTGGAGCCG CCTTTCATAT GCCTTGTAGC TTCCGGCGGA CACAGTCATA TAGTCCATGT TAAAAGCTAC AGTGAATTTG AGATTCTTGG ACAAACCAGG GATGATGCCG CCGGAGAGGC TTTTGATAAG ATATCCAGAG CTATCGGATT GGGATATCCG GGAGGCCCTT TAATTGATAA AAATGCACTG ACAGGAAACA GTAAGGCTAT ACAATTTCCA CGTGTAACCT TCGATGACGG TTCACTTGAC TTCAGCTTCA GTGGTCTCAA GACTGCAGTT CTAAATTATC TTAACAGGAT GGAGCAAACC GGGGAGAAAA TTAATATTCC TGATGTAGCG GCAAGTTTTC AGCAAGCGGT TGTAGACGTT TTGGTCAGAA ATACCATCGC TGCAGCTAAT ATGAAGAATA TTGACAAAAT TGCCTTAGCC GGTGGGGTTG CAGCAAATAC ACAGTTGAGA AACGACATGA AGGTCACAGC GGAAAAGCAA GGTATAAAGG TAATGTATCC GGGATTGACA CTTTGTACCG ATAATGCTGC TATGATTAGC TGTGCAGCTT ACTACGAGTA TAAAAGTGGA AAGAGGGCAG GAATGGATTT GAATGCTGTC CCGGGATTAA AGTTATCCAC AGGTATTTCC AAAGATTAA
|
Protein sequence | MDTLILGIES SCDETAASVV KNGRYIMSNV ISSQIDLHKK YGGVVPEIAS RKHVELIMPV VHQALEEAGV SLKEIDAVGV TYGPGLVGAL LVGLTAAKAI AFAADKPLVG VHHIEGHIAA NYLQEPELEP PFICLVASGG HSHIVHVKSY SEFEILGQTR DDAAGEAFDK ISRAIGLGYP GGPLIDKNAL TGNSKAIQFP RVTFDDGSLD FSFSGLKTAV LNYLNRMEQT GEKINIPDVA ASFQQAVVDV LVRNTIAAAN MKNIDKIALA GGVAANTQLR NDMKVTAEKQ GIKVMYPGLT LCTDNAAMIS CAAYYEYKSG KRAGMDLNAV PGLKLSTGIS KD
|
| |