Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2097 |
Symbol | |
ID | 7310798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2454156 |
End bp | 2454821 |
Gene Length | 666 bp |
Protein Length | 221 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643609031 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 3 |
Protein accession | YP_002506422 |
Protein GI | 220929513 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.284323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACA CTACCGTACT ATTTGATTTG GACGGAACAC TTATAAACTC ACTTGAGGAT TTGGCTGAAA GTGCTAATGA GGCTTTAACA AAGCATGGAT TCAAAGCTCA TCCTCTGGAG GCATACAAAA AATTTGTTGG AAACGGAGTA CGGAACCTTA TTAAAAGTGC TACACCTGAC GGAACAGAGG ACAGCGTCGT TGATATGATA CTCGAAGATT ACCGGAAAAT ATATAACAAA AACTACGTAA ATAAAACCAG AGTTTATGCC GGAATACATG AAATGTTGGA AAATCTTAAA AAAGTGGGGG TAAAAATGGG AGTTTGCTCA AACAAACCTC ACAAACCTAC AAATGAGATA GTGGAAAAAC TACTGGGAAA TAAGTATTTT GACGTAGTAT TCGGTGAACG GGAGGGAATA CCCCGCAAAC CGGACCCGGC TTCACTGATA GAGGCGGCTG AAAAACTTGG GGTTGTACCG AGTCAAACCA TATATGTGGG GGATTCTGGT GGGGATATGG AATCGGCAAA CAGGGCAGGG ATGCTGGCAG TAGGTGTATT ATGGGGATTT AGGGAACAAG ATGAATTAAA ATCCTGCGGT GGAAAGATAC TGATTGCTTC GCCATCTGAA TTGGTGGACT TTGTAACTGG GGATAACAGG GGTTAA
|
Protein sequence | MKYTTVLFDL DGTLINSLED LAESANEALT KHGFKAHPLE AYKKFVGNGV RNLIKSATPD GTEDSVVDMI LEDYRKIYNK NYVNKTRVYA GIHEMLENLK KVGVKMGVCS NKPHKPTNEI VEKLLGNKYF DVVFGEREGI PRKPDPASLI EAAEKLGVVP SQTIYVGDSG GDMESANRAG MLAVGVLWGF REQDELKSCG GKILIASPSE LVDFVTGDNR G
|
| |