Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1804 |
Symbol | |
ID | 7310535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2158134 |
End bp | 2159279 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643608736 |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_002506134 |
Protein GI | 220929225 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00796527 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATGGA GACAGCTAAG AGAACAGCAG GAAAATTTAC TGAGTCCCTA TGCAGCAAGA AGTGTTAATT CTTTTGGCAG GATAATCAAT GAAGAAAAAT GTCCTATCAG AACAGATTAC GAACGTGATG GGAACAGGAT TTTATACTCC ATGGAATTCA GACGATTAAG ACACAAAACA CAGGTGTTTT TTAATGCAAA GAACGACCAC ATATGTACAC GAATGGAGCA TGTATTAAAT GTAGGCTCAA TAGCCGTCAC TATAGCCAGA ACGCTGAATT TGAATCAGGA CCTTACATAT GCAATAGCTT TAGGGCATGA TCTTGGGCAT GCTCCATTCG GCCACAGCGG TGAACGTGTA TTAGATAAAT GCATGAAGAA AGTAAATTTA GAGTTCGGTT TTCAACATGA GCTCCACAGC CTTAGGGTTG TAGAAAAGCT TGCAACACGT ATCTCTAAAG AAAAAATACA TGAAAAATGC GGACTCAATC TAACTTTTGA AGTAAGGGAC GGGATAGTAT CACACTGTGG TGAAAACTAT AACGAATATT CTTTAAAAAG AGATTTAGGG AAAACACCAC AGTCACTGTG TAATGTAAAA GACAGAGGAG GTCTTCCGTT TACCCTTGAA GGCTGCATTG TGAGACTGGT AGATAAAATA GCATACGCCG GAAGGGATAT TGAGGACGCT GTACGTGTCA ACCTGATGAA CGTCCATGAT ATACCCAAGG ATATAAGAAA TGAGTTGGGT AACACAAATG GGGAAATAAT AAATACTCTG GTTTGTGATA TGGTTGAGAA CAGCTACGGC AGGGATTGTA TCCGGCTTAG CCCGGATAAA GGCCAGGCAC TTGAAAAACT CATAAATGAA AATATAAGGT TAATTTATAA GGCTGATAAA ATAACCCGAT ATGAAAAAAT TGCCGAAAAC ACTCTTGAAG GTTTGTTTGA CAGCCTGCTC GACAGCCTAT CTGATTTTGA AAAGCTTCAG ACAAATGAAA ATAAGGTTTA CAGAATGTTT TATAACTTTA TAGCAGACAA GGCATACGAC GAGTCTGAAA GTGATGCTCA AAAAGTAATT GATTTTATAG CCGGGATGAC AGACCAGTTT GCACAAAGCT GCTTTGAAGA AATATACTGG ATGTAG
|
Protein sequence | MIWRQLREQQ ENLLSPYAAR SVNSFGRIIN EEKCPIRTDY ERDGNRILYS MEFRRLRHKT QVFFNAKNDH ICTRMEHVLN VGSIAVTIAR TLNLNQDLTY AIALGHDLGH APFGHSGERV LDKCMKKVNL EFGFQHELHS LRVVEKLATR ISKEKIHEKC GLNLTFEVRD GIVSHCGENY NEYSLKRDLG KTPQSLCNVK DRGGLPFTLE GCIVRLVDKI AYAGRDIEDA VRVNLMNVHD IPKDIRNELG NTNGEIINTL VCDMVENSYG RDCIRLSPDK GQALEKLINE NIRLIYKADK ITRYEKIAEN TLEGLFDSLL DSLSDFEKLQ TNENKVYRMF YNFIADKAYD ESESDAQKVI DFIAGMTDQF AQSCFEEIYW M
|
| |