Gene Saro_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0129 
Symbol 
ID3916015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp131784 
End bp133289 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content63% 
IMG OID640442854 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_495412 
Protein GI87198155 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCATT GCCCTTGCAA GGCACCGGGC ATAGATTTGC GCACCATGAA CCTCATCAAG 
AAGATCCTGA TCTCGCTCGT CGTGATCCTG GTGCTCGTCG GCGCCTATGT CGCCTGGTCC
GTTCGCGGCA CGCCCGCCCA GTTCGCCTTG AACGACACCA CAGGGCCCCG GCCGAAACTG
GCAGACCCCG ACGAGCAGAC CATCCCCACG ATCAAGACCG CCGATCCCAT TGGCTGGAAG
GACGGCGAGG CGCCGGTCGC TGCCGAAGGG CTGCAGGTCA CGCGCTTTGC CGACAAGCTC
GACCACCCCC GCACGGTCTT CACCCTGCCC AATGGCGATG TCCTCGTGGC CGAAACCAAT
TCGCCCCCGC GCAAAGTGGG AGGCGTGACC GGCATTGTGA TGAACTTCCT CATGAAGCAG
GTGGGCGCGG GCGGGCCTTC TCCGAACAAG ATTGTGCTGC TGCGTGACGG CGATGGTGAC
GGACGCGCCG AACAGCGCTT CGTGATGGAG AATCCGGCGC TCGATTCGCC CTTCGGCATG
GCCTTCCGCG ATGGCCGCCT GCTCGTCGCC AACCATAACG CGGTGCTGTC CTTCCCTTAC
CAGCTCGGCC AGACCTCGCT TTCCGGCAAG CCCGAGAAAC TGATGGACCT TCCCGGTGGA
GGCAATCACT GGGCGCGCAA CCTGCTGCTC TCGCCGGACG GCACCCAGCT TTTCGTGACG
GTGGGTTCCG CATCCAACAT TGCAGAAGGC GGGATCGATG CGGAGCGTGG GCGCGCGGCG
ATCCATGAGT ACGATTTCGC CAAAAAACGG TCACGCGAAT TTGCCGGTGG CCTGCGCAAT
CCCAACGGTC TCGATTTCAA CCCGAACAGC GAAGAGCTGT GGACCGTGGT CAACGAGCGA
GACCAGCTTG GTTCCGACCT GGTGCCCGAC TACCTGACCA ACGTACCGTT CGGTTCGAAC
TATGGCTGGC CCTGGGCGTA CTGGAAGAAG AACATCGACT GGCGCGTAAA GGAACCGATG
CCCGAATACC TGATGGATTA CGTGCGCAAG CCTGAATACG GCCTCGGCGC GCACGTAGCG
CCGTTGGGAC TGGCATTCGC TCGGGGCGGG AACCTTTTGG GCGACAAGTT CCAGCAGGGC
GCCTTCATCG CACGGCACGG CTCGTGGAAC CGACGTCCGC TTTCGGGCTA TGACGTGGTC
TTCGTGAAGT TCGACGCGCT TGGCAATGTC CTGCCAAAGT CGCCGGTGCC GGTACTCACC
GGGTTCCTGA CCGAGGACCA GAAGGCACGC GGACGGCCGA CTTGGGTAGC CTTCGCCAAG
GACGGCGCCT TGCTCGTCAG CGACGATACG GGCGGTGTAA TCTGGCGCGT CACGGCCCCG
GGCGCGAAAC CCGCCGCAGA GGTGAAGCCG CTGCCGAAGC GTGCCGCACC TCCGCAGCCG
AAAGGCACCG GGAAGTTCAT CATGAAGCCC AACGCCGATT CAGACTTGGT CAAGCCGCAG
GACTAA
 
Protein sequence
MSHCPCKAPG IDLRTMNLIK KILISLVVIL VLVGAYVAWS VRGTPAQFAL NDTTGPRPKL 
ADPDEQTIPT IKTADPIGWK DGEAPVAAEG LQVTRFADKL DHPRTVFTLP NGDVLVAETN
SPPRKVGGVT GIVMNFLMKQ VGAGGPSPNK IVLLRDGDGD GRAEQRFVME NPALDSPFGM
AFRDGRLLVA NHNAVLSFPY QLGQTSLSGK PEKLMDLPGG GNHWARNLLL SPDGTQLFVT
VGSASNIAEG GIDAERGRAA IHEYDFAKKR SREFAGGLRN PNGLDFNPNS EELWTVVNER
DQLGSDLVPD YLTNVPFGSN YGWPWAYWKK NIDWRVKEPM PEYLMDYVRK PEYGLGAHVA
PLGLAFARGG NLLGDKFQQG AFIARHGSWN RRPLSGYDVV FVKFDALGNV LPKSPVPVLT
GFLTEDQKAR GRPTWVAFAK DGALLVSDDT GGVIWRVTAP GAKPAAEVKP LPKRAAPPQP
KGTGKFIMKP NADSDLVKPQ D