Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1729 |
Symbol | |
ID | 5713296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1796409 |
End bp | 1797608 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267647 |
Product | putative deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_001533072 |
Protein GI | 159044278 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0827588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.630859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCAAA ACGCAGAACA AGAACAGGCA GATCATATGA CAGCGCCCTT TGCGTGCGGA CCCGCAGGCT CGCGCGGCAG GCTTGTGGCC GAACCCGAGA GCGATTTTCG CTCGTGCTTT CAGCGCGACC GCGACCGGAT CATCCATGCC AGTGCGTTCC GGCGGCTGAA ACACAAGACC CAGGTCTTCG TGGAACACGA GGGCGATTAT TTCCGCACCC GCCTGACCCA TTCGATCGAG GTGGCCCAGG TGGCGCGCAC GATCTGCGGC GCGCTGGGGC TGAACCCGGA TCTGACCGAA GCGGTGGCGC TGGCCCATGA TCTGGGCCAC ACGCCGTTCG GGCATACGGG CGAGGATGCG CTCAACGCGC TGATGGCCCC CTATGGCGGG TTCGATCACA ACGCCCAGGC GCTGAAGATC GTCACCTCGC TCGAACGTCA TTACGCGGCT TTCGACGGGC TCAACCTGAC TTGGGAAACG CTGGAAGGCA TCGCCAAGCA TAACGGCCCG GTCACGGGCG AGTTGCCCCA TGCGCTGGCC AGCTACAACG CCCGCCACGA TCTCGAACTG CAAACCCATG CCAGCGCCGA GGCGCAGGTG GCCGCCCTGG CCGACGACAT CGCCTATAAC AACCACGACC TGCAGGACGG GCTGCGCGCG GGACTCTTCA GCCAGGCCGA TATCGCCGAC CTGCCGCTGG TGGCCGAGGC CTATGCCGAG GTCGACGCCG TCTGGCCCGA TCTCGACCCC GCGCGGCGCA AACACGAAGC CCTGCGCCGG GTGTTCGGGA TGATGGTGGC GGACGTCATC GACACCTCCC GCGCGCTGCT GGCCGAGGCC GCCCCGGCCG ACGCCCAGGC CGTGCGCGAC CTGGGCCGAC CGGTGATCCG GTTCTCCGAC GGGATGTTCG CCAGCCTGCG GCAGATCCGC GAGTTTCTCT TCACCCGCAT GTACCGCGCC CCCAGCGTGA TGGAGAAGCG CGCCGAGGTG ACCACGGTCA TCAACGACCT CTTCCCGCGC TATATGGCCG ATCCGAGCCT GCTGCCCGCG CGCTGGCAAC CCGACATCCT CGCCACCCGC ACCCGCACCG AACTGGCCCG TATCGTGGCC GACTACATCG CGGGCATGAC CGACCGCTAC GCGCTCCAGG CCCATGACCG GCTCACGGCG GGGGACCGCG CCCGCAGCGC GCGCGCCTGA
|
Protein sequence | MPQNAEQEQA DHMTAPFACG PAGSRGRLVA EPESDFRSCF QRDRDRIIHA SAFRRLKHKT QVFVEHEGDY FRTRLTHSIE VAQVARTICG ALGLNPDLTE AVALAHDLGH TPFGHTGEDA LNALMAPYGG FDHNAQALKI VTSLERHYAA FDGLNLTWET LEGIAKHNGP VTGELPHALA SYNARHDLEL QTHASAEAQV AALADDIAYN NHDLQDGLRA GLFSQADIAD LPLVAEAYAE VDAVWPDLDP ARRKHEALRR VFGMMVADVI DTSRALLAEA APADAQAVRD LGRPVIRFSD GMFASLRQIR EFLFTRMYRA PSVMEKRAEV TTVINDLFPR YMADPSLLPA RWQPDILATR TRTELARIVA DYIAGMTDRY ALQAHDRLTA GDRARSARA
|
| |