Gene Dshi_3289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3289 
Symbolamn 
ID5712346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3455298 
End bp3456767 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content65% 
IMG OID641269217 
ProductAMP nucleosidase 
Protein accessionYP_001534623 
Protein GI159045829 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID[TIGR01717] AMP nucleosidase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.264464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCAGA CGCTGAAGAC CCCTCCGCGG ACCCTGCCGC GCAGCTTTGA CGATGCCGCT 
GCCGCGGTGG CCGAGTTGCA GCGCCTTTAT GACGAGGGTG TCCTGTTCTT GCAGACCAGC
TTCGCAGATG CGGTGCGCGA GGGCGCGGTT TCGGCACGCT ACCGGGCGTT TTACCCGGAG
ATCCGGGTAC GGGTGAGCAG CTTCGGGCAG GTGGACAGTC GGCTGTCCTT CGGCCATGTC
GCCGAGCCCG GCAGCTATGC CACGACCGTC ACGCGCCCGG ATCTGTTCGC GAACTACCTG
ACACAGCAGA TCGGCCTTCT GATCCGCAAT CACGGGGTGC CCATCGAGAT CGGGCAGTCC
GAGACGCCGA TCCCGCTGCA TTTCGCCATG GCGGGCGGGC CGGTGGCCTC GATCCCGCAG
GAAGGCGTTC TGGCCTTCAC CCTGCGCGAT GCCTTCGACG TGCCGGACCT GGCCACCACC
AACGACCAGA TCGTCAACGG CTACGGCTTC AAGCACGAAG ACGGCGCGGG CCCGCTGGCG
CCCTTCACGG CGCAGCGGAT CGACTATTCG CTGGCGCGGC TGCAGCATTA CACCGCCACC
AAGGCGGAGC ATTTCCAGAA CCATGTCCTG TTCACCAACT ACCAGTTCTA CGTGGACGAG
TTCATCGCCT TCGCCCGCAA GGCGCTGGCC GATCCCGACA GCGGGTATCA AAGCTTCGTG
GCCCCGGGCA ACGTGGAGAT CACCGACCCG GACGCGGAAT TGCCGGTGCT GCCGAAGGCA
CCGCAGATGC CGACCTATCA CCTGACCCGC AAGGGGCAGG CAGGGATCAC GCTGGTCAAT
ATCGGGGTGG GGCCGTCCAA CGCCAAGACC GCGACCGACC ATATCGCAGT GCTGCGGCCC
CATGCCTGGC TGATGCTGGG GCATTGCGCG GGGCTGCGGA ATACCCAGAG CCTGGGCGAT
TACGTTCTGG CCCATGCCTA TCTGCGCGAG GATCACGTGC TGGACGACGA TCTGCCGATC
TGGGTGCCGA TCCCGGCCCT GGCGGAGATC CAGATCGCGC TGGAGCAGGC GGTGGCGGAG
GTGACCAAGC TGGAGGGCTA TGACCTCAAG CGGATCATGC GCACGGGCAC GGTGGCCACC
ATCGATAACC GCAACTGGGA GCTGCGCGAC CAGTCCGGCC CGGTCCAGCG CCTCAGCCAG
TCCCGCGCCG TGGCGCTGGA CATGGAAAGC GCCACGATCG CCGCCAATGG ATTCCGGTTC
AGGGTGCCTT ACGGCACGCT GCTATGCGTG TCGGACAAGC CGCTCCATGG GGAGTTAAAG
CTGCCGGGCA TGGCGACGGA GTTCTACACC ACCCAGGTGG CCCGGCATCT GCTGATCGGG
ATCCGGGCAA TGGAGACGAT CCGCGACATG CCGCTGGAGC GTATCCATTC CCGGAAACTG
CGCAGTTTCG AGGAAACCGC GTTTCTTTAA
 
Protein sequence
MTQTLKTPPR TLPRSFDDAA AAVAELQRLY DEGVLFLQTS FADAVREGAV SARYRAFYPE 
IRVRVSSFGQ VDSRLSFGHV AEPGSYATTV TRPDLFANYL TQQIGLLIRN HGVPIEIGQS
ETPIPLHFAM AGGPVASIPQ EGVLAFTLRD AFDVPDLATT NDQIVNGYGF KHEDGAGPLA
PFTAQRIDYS LARLQHYTAT KAEHFQNHVL FTNYQFYVDE FIAFARKALA DPDSGYQSFV
APGNVEITDP DAELPVLPKA PQMPTYHLTR KGQAGITLVN IGVGPSNAKT ATDHIAVLRP
HAWLMLGHCA GLRNTQSLGD YVLAHAYLRE DHVLDDDLPI WVPIPALAEI QIALEQAVAE
VTKLEGYDLK RIMRTGTVAT IDNRNWELRD QSGPVQRLSQ SRAVALDMES ATIAANGFRF
RVPYGTLLCV SDKPLHGELK LPGMATEFYT TQVARHLLIG IRAMETIRDM PLERIHSRKL
RSFEETAFL