Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0798 |
Symbol | |
ID | 7309649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 918624 |
End bp | 919814 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643607740 |
Product | amidohydrolase |
Protein accession | YP_002505156 |
Protein GI | 220928247 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.286142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTAA TATATAATGG AAAAATAATA ACAATGGCAG AAGTTGATTA CGAAAATGGA TACATACTTA CTGATAATGG TAAAATAATA GCTGTTGGCA ATGACTTATC TGATGTAAAG GAACAGCTGA AGCCGGATAT ACATAGGATA GATGCAAATG GAGGCTACGT ATTGCCGGGA TTAATAGATC CACACAGCCA TATAGGAATG TGGGAGGATT CAGTTGGTTT TGAAGGTGAT GACGGAAACG AGTCAACTGA CCCGGTAACA CCTCAGATGA GAGCAATAGA TGCTATTTAC CACGCAGACA GGTCTTTTGT GGAAGCTTAT GAAAGCGGCG TGACGACGGT TGTTACAGGC CCGGGCAGTG CAAATGTTAT AGGCGGACAG TTTGCTGCTC TCAAAACCTA CGGACGGTGC GTAGACGAGA TGGTTGTAAA GCATCCTGTA GCAATGAAGG TGGCATTTGG CGAAAATCCA AAAACAGTTT ATAATGATAA ACATCAGACG CCAATGACTA GAATGGCTAC TGCTGCTATA CTCCGTGAAA GTTTGTTTAA GGCAAAAGAA TATCAGGAAG TATGGGAAGA CTATAAAAAG AACCCGGAGG AATATGATAA GCCGGATTTT GATTTTAAAA TGGAAGCTCT GCTTCAGGTG TTGAACCGAC AAATACCTTT AAAAGCTCAT ACTCACAGGG CGGACGATAT AATAACGGCT ATCAGGGTAG CCAGGGAATT TAATGTGGAC ATTACACTTG ACCACTGTAC GGAGGGCTAT CTTATAAAGG ATATTCTTGC AGAAGCAGGT GTCCCTGTAA TTATAGGACC AATGCTTAGC GACAGGTCAA AAATAGAATT AAAAAACTAT AACTTGAAAA CCCCTGGAAT TCTGTCCAAG TCGGGTATCA AAGTAGCCAT AATGACTGAC CATCCATGTA TGCCTGAGCA ACATCTGTGT CTGTCTGCCG CAATTGCTGC CCGTGAAGGT ATGAGTGAAA AAGAGGCATT ACGGGCGATA ACAATAAATG CGGCTGAAAT AACAGGTATC AGTGACAGGG TAGGTTCTCT GGAGAAGGGG AAGGATGCCG ATATTGTAAT ATTTGACGGA AATCCTTTGG AATTGAAATC AACTGTACAG AAGACTATAA TCAACGGTGT TGTAGTTTAT GAAAGGAAAC AGAATGAATA A
|
Protein sequence | MLLIYNGKII TMAEVDYENG YILTDNGKII AVGNDLSDVK EQLKPDIHRI DANGGYVLPG LIDPHSHIGM WEDSVGFEGD DGNESTDPVT PQMRAIDAIY HADRSFVEAY ESGVTTVVTG PGSANVIGGQ FAALKTYGRC VDEMVVKHPV AMKVAFGENP KTVYNDKHQT PMTRMATAAI LRESLFKAKE YQEVWEDYKK NPEEYDKPDF DFKMEALLQV LNRQIPLKAH THRADDIITA IRVAREFNVD ITLDHCTEGY LIKDILAEAG VPVIIGPMLS DRSKIELKNY NLKTPGILSK SGIKVAIMTD HPCMPEQHLC LSAAIAAREG MSEKEALRAI TINAAEITGI SDRVGSLEKG KDADIVIFDG NPLELKSTVQ KTIINGVVVY ERKQNE
|
| |