Gene Dshi_1729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1729 
Symbol 
ID5713296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1796409 
End bp1797608 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content68% 
IMG OID641267647 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001533072 
Protein GI159044278 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0827588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.630859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCAAA ACGCAGAACA AGAACAGGCA GATCATATGA CAGCGCCCTT TGCGTGCGGA 
CCCGCAGGCT CGCGCGGCAG GCTTGTGGCC GAACCCGAGA GCGATTTTCG CTCGTGCTTT
CAGCGCGACC GCGACCGGAT CATCCATGCC AGTGCGTTCC GGCGGCTGAA ACACAAGACC
CAGGTCTTCG TGGAACACGA GGGCGATTAT TTCCGCACCC GCCTGACCCA TTCGATCGAG
GTGGCCCAGG TGGCGCGCAC GATCTGCGGC GCGCTGGGGC TGAACCCGGA TCTGACCGAA
GCGGTGGCGC TGGCCCATGA TCTGGGCCAC ACGCCGTTCG GGCATACGGG CGAGGATGCG
CTCAACGCGC TGATGGCCCC CTATGGCGGG TTCGATCACA ACGCCCAGGC GCTGAAGATC
GTCACCTCGC TCGAACGTCA TTACGCGGCT TTCGACGGGC TCAACCTGAC TTGGGAAACG
CTGGAAGGCA TCGCCAAGCA TAACGGCCCG GTCACGGGCG AGTTGCCCCA TGCGCTGGCC
AGCTACAACG CCCGCCACGA TCTCGAACTG CAAACCCATG CCAGCGCCGA GGCGCAGGTG
GCCGCCCTGG CCGACGACAT CGCCTATAAC AACCACGACC TGCAGGACGG GCTGCGCGCG
GGACTCTTCA GCCAGGCCGA TATCGCCGAC CTGCCGCTGG TGGCCGAGGC CTATGCCGAG
GTCGACGCCG TCTGGCCCGA TCTCGACCCC GCGCGGCGCA AACACGAAGC CCTGCGCCGG
GTGTTCGGGA TGATGGTGGC GGACGTCATC GACACCTCCC GCGCGCTGCT GGCCGAGGCC
GCCCCGGCCG ACGCCCAGGC CGTGCGCGAC CTGGGCCGAC CGGTGATCCG GTTCTCCGAC
GGGATGTTCG CCAGCCTGCG GCAGATCCGC GAGTTTCTCT TCACCCGCAT GTACCGCGCC
CCCAGCGTGA TGGAGAAGCG CGCCGAGGTG ACCACGGTCA TCAACGACCT CTTCCCGCGC
TATATGGCCG ATCCGAGCCT GCTGCCCGCG CGCTGGCAAC CCGACATCCT CGCCACCCGC
ACCCGCACCG AACTGGCCCG TATCGTGGCC GACTACATCG CGGGCATGAC CGACCGCTAC
GCGCTCCAGG CCCATGACCG GCTCACGGCG GGGGACCGCG CCCGCAGCGC GCGCGCCTGA
 
Protein sequence
MPQNAEQEQA DHMTAPFACG PAGSRGRLVA EPESDFRSCF QRDRDRIIHA SAFRRLKHKT 
QVFVEHEGDY FRTRLTHSIE VAQVARTICG ALGLNPDLTE AVALAHDLGH TPFGHTGEDA
LNALMAPYGG FDHNAQALKI VTSLERHYAA FDGLNLTWET LEGIAKHNGP VTGELPHALA
SYNARHDLEL QTHASAEAQV AALADDIAYN NHDLQDGLRA GLFSQADIAD LPLVAEAYAE
VDAVWPDLDP ARRKHEALRR VFGMMVADVI DTSRALLAEA APADAQAVRD LGRPVIRFSD
GMFASLRQIR EFLFTRMYRA PSVMEKRAEV TTVINDLFPR YMADPSLLPA RWQPDILATR
TRTELARIVA DYIAGMTDRY ALQAHDRLTA GDRARSARA