Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0802 |
Symbol | |
ID | 4486180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 885608 |
End bp | 887014 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639729573 |
Product | putative deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_872561 |
Protein GI | 117928010 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.178296 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.148231 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGACG TAGCGGTGCC GCGTCAGATT GCCGCGTATG ACGCGGCGGC CATGGAGCGG CGGGTCGACG AAGCGCCGAA GCACTCCGCC CGCTCGGCGT TCGCCAGGGA CCGGGCTCGG GTGCTGCACA GTTTCGCGTT GCGGCGGCTC GGCGCTAAAA CCCAGGTGGT CGGTCCCACC GATGTCGAGA GCGATTTCCC GCGTACCCGG CTCACGCATT CGTTGGAGTG CGCGCAGATC GGTCGGGATC TGGGTGCAGC CCTCGGTTGT GATCCTGACA TCGTGGAGAC GGCCTGCCTC GCGCATGACC TCGGTCATCC GCCGTTCGGG CACAACGGCG AATCGGCGTT GGCGCAGATT GCCCGGCATA TCGGCGGCTT TGAGGGAAAC GCCCAGTCCT TCCGCCTGCT GACCCGTCTG GAGGCCAAGA TTCTCGACGC CGCCGGGCAC AGTGTCGGTT TGAATCTGAC CCGGGCCAGT CTGGACGCCG CGACGAAATA CCCGTGGGGC CCGGATACGT CCGATGCCGA CGCGGCATCT CATTCCGATG CCGGGGCGGC GTCCGTGCCT GATGCCGGGG TGTCTCCTCC CGGTGGGGTT GCATCCGTGC CCGATGCCGG GGCGGCGTCC GGCGGTGCGA GTGGCCGGAA ATTCGGCGCC TACCGGGAGG ATCTAGCGGT TCTGGACTGG GTGCGGGCCG GGGCGCCGGC GCGGCGTCCC TGCCTTGAGG CGCAGGTCAT GGATTGGGCG GACGACGTCG CCTACTCCGT CCACGACCTG GAGGACGGGA TTCACGCCGA GTTGGTGCCG CTTCGCCGAC TGCGGAATCC GGCCGGGTGG GCGGACGTCG TCGACGTCGC CGCCGAGCGG TACCTCATCG GGGTCGAACG CGCCGAAATC GACGACGCCG TACGCCGCTT GCTGAGCTTC CCATGGTGGT TGACGGAATA CACCGGCAGC CGGCGTGAGT TGGCGGCATT GAAGAACATG ACGAGCGAGC TGATCGGCCG CTTCTGTTCG GCGGCCGAAA CCGCAACGCG CCAGGCGTAT GGAGCCGCGC CGGTGAACCG GTACGCCGCC GAACTCGTGG TTCCTCGGGA GGCGCGGATC GAGTGCGGCC TGCTCAAGGC GGTCACCGCG CACTTCGTGA TGGCCCGTCA TGGCGCGGAG GAGACGCGGG TCCGGCAACG GGAGGTGCTG GCGGATCTCG TGGAGGCGCT GGTGGCATTG GACGGCACGG TCCTCGATCC GGTCTTCGCG GAGGAATGGC GGGAGGCGGC GGACGATGCC GGGCGGTTGC GGGCGGTCGT CGACCAAGTG GCGGCGTTGA CCGACACCTC GGCGCTTGCG TGGCATCGCC GGCTCTGCCC ACGCAACGCG GTGGTGATTG CCGGCACAAC GGGATAG
|
Protein sequence | MSDVAVPRQI AAYDAAAMER RVDEAPKHSA RSAFARDRAR VLHSFALRRL GAKTQVVGPT DVESDFPRTR LTHSLECAQI GRDLGAALGC DPDIVETACL AHDLGHPPFG HNGESALAQI ARHIGGFEGN AQSFRLLTRL EAKILDAAGH SVGLNLTRAS LDAATKYPWG PDTSDADAAS HSDAGAASVP DAGVSPPGGV ASVPDAGAAS GGASGRKFGA YREDLAVLDW VRAGAPARRP CLEAQVMDWA DDVAYSVHDL EDGIHAELVP LRRLRNPAGW ADVVDVAAER YLIGVERAEI DDAVRRLLSF PWWLTEYTGS RRELAALKNM TSELIGRFCS AAETATRQAY GAAPVNRYAA ELVVPREARI ECGLLKAVTA HFVMARHGAE ETRVRQREVL ADLVEALVAL DGTVLDPVFA EEWREAADDA GRLRAVVDQV AALTDTSALA WHRRLCPRNA VVIAGTTG
|
| |