Gene Saro_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0473 
Symbol 
ID3918601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp515201 
End bp516508 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content70% 
IMG OID640443202 
ProductPpx/GppA phosphatase 
Protein accessionYP_495755 
Protein GI87198498 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAT CCATTCCGCC GGCCAGTGGG CAAGGCGCGC GTTCCGGCAG GAAGAAGGGG 
CAACGGAAGT CCCGTCCGCC ACGCAGCAAT GCCGGGAGCA GCCTCCCGCC CAGGTCTGCC
GGCAAGACAA GGCCGCCCTC GTCAAGGGGC GGCGCAGCGC AGGAATCACT ACCGGCACAG
GCGCGTCAGG CCGGACCGCA GGGCCTTGCC GAAACGGCCT TGCCGCCTGC CGAAACCGCA
TTGCCTCAGG CGGGGGAACG CACATCCGAA CGGCCGGGCC CGTGGCAGAA GCCCCTGCCA
CACCACCGCC AGGCCTATGC CGCGATCGAC CTGGGCACCA ACAACTGCCG CCTGCTCATC
GCGCGCCCAT CGGGCGAACA CTTCGTGGTG ATCGACGCGT TCAGCCGCGT GGTGCGACTG
GGCGAAGGCC TTGCCCAGAC CGGGCGCCTT TCCGACGCGG CGATGGACCG CGCGCTCGCC
GCGCTGCACG TGTGCGCCGA CAAGTTGCGC AAGCGCAACG TCCACCTCGC CCGCTCGGTC
GCCACCGAGG CATGCCGCCG CGCCACGAAC GGCCAGGCCT TCATCGACCG CGTGCGCGAG
GAAACCGGTA TCCGGCTCAA TATCATAACC GCGCAGGAGG AAGCCCGCCT CGCCGTGCTC
GGCTGCCACA TCCTGCTCGA ACAGGGCGAC GGGCCGGCGA TGATCTTCGA CATCGGCGGC
GGCTCGACGG AAATGGTGCT GGTCGAGACG GGCGAGACCG TCCCCCGCAT TCTCGACTGG
CAATCCGTGC CCTGGGGCGT GGTCTCGCTC ACCGAAAGCA TCGGCCATAT CGACGACGAA
CCCGTTGCCC GCGCCGTCGC CTATGCCGAA ATGCGTCGCC GGGTGGACGA GGGCTTCGCC
GACTTCGCCG AACGCGTCGC CCCGATGCGC CACGCGGCGC AAGGGCAGGG CCGCATCCGC
CTGCTCGGCA CCAGCGGCAC GGTGACGACG CTCGCCAGCC TCCACCTCGA ACTGCCGCAA
TATGACCGCC GCGCGGTGGA CGGCCTCGTC GTCCCGGCCG AATCGATGCG CGACATCAGC
CGACGCCTTT CGACCATGTC CCCGGCGGAG CGCATTTCCG TGCCCTGCAT CGGGCGCGAG
CGGTCGGACC TGGTCGTCGC GGGCTGCGCG ATACTCGAAT CGATTTTCGA CATATGGCCC
GCCGACCGGC TGGGCATCGC CGACCGCGGC ATCCGTGAAG GGATCCTGCG CAGTTTGATG
GCCGGGGGCG CCGACCCGCG CGCCAGGAAG AGGATCGAAG CCGCATGA
 
Protein sequence
MAESIPPASG QGARSGRKKG QRKSRPPRSN AGSSLPPRSA GKTRPPSSRG GAAQESLPAQ 
ARQAGPQGLA ETALPPAETA LPQAGERTSE RPGPWQKPLP HHRQAYAAID LGTNNCRLLI
ARPSGEHFVV IDAFSRVVRL GEGLAQTGRL SDAAMDRALA ALHVCADKLR KRNVHLARSV
ATEACRRATN GQAFIDRVRE ETGIRLNIIT AQEEARLAVL GCHILLEQGD GPAMIFDIGG
GSTEMVLVET GETVPRILDW QSVPWGVVSL TESIGHIDDE PVARAVAYAE MRRRVDEGFA
DFAERVAPMR HAAQGQGRIR LLGTSGTVTT LASLHLELPQ YDRRAVDGLV VPAESMRDIS
RRLSTMSPAE RISVPCIGRE RSDLVVAGCA ILESIFDIWP ADRLGIADRG IREGILRSLM
AGGADPRARK RIEAA