Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1771 |
Symbol | |
ID | 7310505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 2122291 |
End bp | 2123541 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643608703 |
Product | protein of unknown function DUF795 |
Protein accession | YP_002506102 |
Protein GI | 220929193 |
COG category | [R] General function prediction only |
COG ID | [COG1323] Predicted nucleotidyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.251831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTAC TTGGAATAGT CGCTGAATAC AACCCTTTTC ATAACGGACA TATGTATCAT ATAGAAGAAT CAAAAAAACT TACCGGCTGT GATGCAGTGG TTTGCGTAAT GAGCGGTAAC TTTATACAAA GAGGAGAACC TGCTATAATA AATAAATTTG CACGAACCGA AATTGCACTT GGCAACGGTG TGGATTTAAT AATTGAGCTA CCTGTTCCCT TTGCAGTATC AAGTGCGGAG TTTTTTTCTT ATGGTGCTGT CAGCATCTTG AACAATATAG GTATAGTTGA TTGTATCTCT TTCGGAAGTG AGTCAGGAGA CATTATTTCC CTTCAAAAAA TAGCCGAAAT CCTTGTTTCT GAGCCACAAA GCTATAAAGC TGAATTGAAA AAGCAGTTGT CCGCAGGGCT GTCCTTCCCT GTTTGCAGGC AACGAGCTTT GGATAAATAT CTTAAAATAC AGAATGACTC CAATGAGTCT CTCTCCTCTT TACTTGAAAC CTCCAATAAC ATACTCGCTT TGGAATATTT GAAGGCTCTT TCAAGGCTTA ATAGTCCTAT ACAGCCGTAT ACAGTTAAAA GGATTTCCAA CTGCTATAAC ACTCCACAGC TTACAGGTAG TATTTCAAGT GCAACTGCTA TCAGAAACAG TATTTATAAG AGTGAAATTG ATGTCAGCAG GCAAGCCCTT CCAACACTGG CACAACAGAT TATGGACAGG GAATTTTCCT TAGGCAGAGG GCCGAACAGC TTGTATTCTT TTGAAGATAT AATTCTTGCC TTTCTGCGTC ACGCTACACC ACAGGAACTG GAGAAAATAC AGGACGTTTC AGAGGGCTTG GAGTACAGAA TTAAAAATGC CGCAGATAAT TCAGGCTCCT TTGACGACCT GCTTGCCAAT ATATGTACAA AGCGTTATCC GAAAACACGT ATACAAAGAA TACTTATTTC ACTCCTTGCC GGAATGAAAA GGTTTGATAT GGAACAATTT ATGGCATGTG GCGGACCACA ATACGCCAGA ATATTAGGGT TTAACGAAAT TGGACGTCAG CTTCTGTCAC TTATGAAAAA GAAATCGTCA ATCCCGGTTA TCACCAAGGC ATCCCATTAC AAAACATCTG ATGACAGTCC GATTTTAAGA ATGTTGGAAA TCGAAGCAAG AGCAACGGAC ACATATGTTT TGGCATATAA AAACCCGGCT TTTAAAAAAG CCGGGCAGGA GTTTACTCAA AATATTATCA TTTGCAGGTA A
|
Protein sequence | MKVLGIVAEY NPFHNGHMYH IEESKKLTGC DAVVCVMSGN FIQRGEPAII NKFARTEIAL GNGVDLIIEL PVPFAVSSAE FFSYGAVSIL NNIGIVDCIS FGSESGDIIS LQKIAEILVS EPQSYKAELK KQLSAGLSFP VCRQRALDKY LKIQNDSNES LSSLLETSNN ILALEYLKAL SRLNSPIQPY TVKRISNCYN TPQLTGSISS ATAIRNSIYK SEIDVSRQAL PTLAQQIMDR EFSLGRGPNS LYSFEDIILA FLRHATPQEL EKIQDVSEGL EYRIKNAADN SGSFDDLLAN ICTKRYPKTR IQRILISLLA GMKRFDMEQF MACGGPQYAR ILGFNEIGRQ LLSLMKKKSS IPVITKASHY KTSDDSPILR MLEIEARATD TYVLAYKNPA FKKAGQEFTQ NIIICR
|
| |