Gene Dshi_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1960 
Symbol 
ID5712954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2056060 
End bp2057664 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content66% 
IMG OID641267884 
Productphosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001533301 
Protein GI159044507 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.151896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00177875 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCACAGA CGAAAATTCA GACGGTTATC ATTGGCCTTG ATGGGGCGAC CTACGACATG 
CTCGACCACC TGGTCGCGGA GGGGGTCATG CCCAATTTGG GGCGTATCAT GGCCGAAGGC
GCGCGCGGGA TCCTGGCCTC GACGATCCAC CCGCTGACCC CGCCGGCCTG GGCCACGCTG
ATGACCGGGC GCAGCCCCGG CAATCACGGT GTGTTCGACT TCATCCGGGT CGACCGCGAA
GGGTCCAAGC CCAGCTACAC GCTGGCGACC TCGGCGGATG TGAAGGTGCC GACCATCTGG
CAGATCGCGA GCGCCGCCGG CAAGCGGGCG ACGACCCTGA ACTACCCGGT GATGTTCCCG
GCCAAGCCGA TCGACGGGGT GGTGATCCCG GGCTATGTGC CCTGGTCCTA TCTCGGCCGC
GCCATCCACC CGCGCGAGAC CTTCAAGATG CTGAAGGCCA AGGGGGTCTT CAAGGCCTCC
GAGATGTCCA CCGACTGGCA GCATGAGCGC AAGGCCGTGC AGGGCCTGTC GGAGAACCAG
CTGACCGATT GGGTGCAGTT CCACATCACG CGGGAACAGC GCTGGCAGGA CATCCTGCTG
ACCCTGATGG AGGAAGAGCC GTCAGAGCTG ACAGCCGTTC TGTTCGACGG CGTCGACCGG
ATCCAGCACC TCTGCTGGCA CCTGATCGAC CCGGTCAGCC GGGACGACTA CACCACGCCG
GAGTCGGTGG CGGCACGGGA GCTGGTGCTG CAATATTTCC GCAACGTGGA CGACTACCTG
GTGCAGATCA TCGACAAGGC CGGGCCCCAG GCGCAGGTCT TCATCGTGTC CGATCACGGC
TTTACCCGCT CGGGCACGCG GATCTTCTAT GCCAATACCT TCCTGGAACA TGCGGGGCTC
CTGACCTGGA ACGCGGGCGT GGCAATGGAC GACCAGGGCC GCGTGGCGCT CGACGAGAAT
ACCGAGGCCA GCACCCTGAT CGACTGGGCC GAGACCAAGG CTTATTCGCT CAGTTCCTCC
TCCAACGCGA TCTTCATCCG CCGCGCGGCC AAGCCGGGCG ATCCCGGCGT CACCGATGCG
GCGTACGAGG CGTTCCGCGA CGACCTGATC GCGCAGCTTC TGGCCTTCAC CGACCCCGAG
ACCGGCAAGC CGGTGATCAA GTCGGTGTTC AAGCGCGAAG ATGCCTTCCC CGGCACCCAG
ACCGAGCGCG CCCCGGACCT GACCCTGCAG CTCCATGATT ACAGCTTCCT GTCGGTGCTG
CGCGCGGACC AGCCGATCAA GGACCGGCGC GTGCCCTATG CCACCCACCA CCCGGACGGC
ATCTTCGTGG CCACCGGGCC GGGGATCGCG GCGGGCACGG CGCTCGACCG GCTGCAGATC
GCCGACGTGG CGCCCACGGC GCTCTATTCC TGCGGGGTCG AGGTGCCCTC GGAGATGGAG
GGCAAGGTGG CCGAGCAGGC CTTCGCCGAG GCCTACAAGG CTGACAACCC GATCCGTTAT
ACCGCGGGCG AGGGGGCGGC GGCGGGCGAT ACCGACGACG CGGCCCTGAC CGGCGACGCC
GAAGAGCAGA TCCGCGAACG CCTGAAATCC CTCGGGTATC TCTGA
 
Protein sequence
MSQTKIQTVI IGLDGATYDM LDHLVAEGVM PNLGRIMAEG ARGILASTIH PLTPPAWATL 
MTGRSPGNHG VFDFIRVDRE GSKPSYTLAT SADVKVPTIW QIASAAGKRA TTLNYPVMFP
AKPIDGVVIP GYVPWSYLGR AIHPRETFKM LKAKGVFKAS EMSTDWQHER KAVQGLSENQ
LTDWVQFHIT REQRWQDILL TLMEEEPSEL TAVLFDGVDR IQHLCWHLID PVSRDDYTTP
ESVAARELVL QYFRNVDDYL VQIIDKAGPQ AQVFIVSDHG FTRSGTRIFY ANTFLEHAGL
LTWNAGVAMD DQGRVALDEN TEASTLIDWA ETKAYSLSSS SNAIFIRRAA KPGDPGVTDA
AYEAFRDDLI AQLLAFTDPE TGKPVIKSVF KREDAFPGTQ TERAPDLTLQ LHDYSFLSVL
RADQPIKDRR VPYATHHPDG IFVATGPGIA AGTALDRLQI ADVAPTALYS CGVEVPSEME
GKVAEQAFAE AYKADNPIRY TAGEGAAAGD TDDAALTGDA EEQIRERLKS LGYL