Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0129 |
Symbol | |
ID | 3916015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 131784 |
End bp | 133289 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640442854 |
Product | L-sorbosone dehydrogenase |
Protein accession | YP_495412 |
Protein GI | 87198155 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCATT GCCCTTGCAA GGCACCGGGC ATAGATTTGC GCACCATGAA CCTCATCAAG AAGATCCTGA TCTCGCTCGT CGTGATCCTG GTGCTCGTCG GCGCCTATGT CGCCTGGTCC GTTCGCGGCA CGCCCGCCCA GTTCGCCTTG AACGACACCA CAGGGCCCCG GCCGAAACTG GCAGACCCCG ACGAGCAGAC CATCCCCACG ATCAAGACCG CCGATCCCAT TGGCTGGAAG GACGGCGAGG CGCCGGTCGC TGCCGAAGGG CTGCAGGTCA CGCGCTTTGC CGACAAGCTC GACCACCCCC GCACGGTCTT CACCCTGCCC AATGGCGATG TCCTCGTGGC CGAAACCAAT TCGCCCCCGC GCAAAGTGGG AGGCGTGACC GGCATTGTGA TGAACTTCCT CATGAAGCAG GTGGGCGCGG GCGGGCCTTC TCCGAACAAG ATTGTGCTGC TGCGTGACGG CGATGGTGAC GGACGCGCCG AACAGCGCTT CGTGATGGAG AATCCGGCGC TCGATTCGCC CTTCGGCATG GCCTTCCGCG ATGGCCGCCT GCTCGTCGCC AACCATAACG CGGTGCTGTC CTTCCCTTAC CAGCTCGGCC AGACCTCGCT TTCCGGCAAG CCCGAGAAAC TGATGGACCT TCCCGGTGGA GGCAATCACT GGGCGCGCAA CCTGCTGCTC TCGCCGGACG GCACCCAGCT TTTCGTGACG GTGGGTTCCG CATCCAACAT TGCAGAAGGC GGGATCGATG CGGAGCGTGG GCGCGCGGCG ATCCATGAGT ACGATTTCGC CAAAAAACGG TCACGCGAAT TTGCCGGTGG CCTGCGCAAT CCCAACGGTC TCGATTTCAA CCCGAACAGC GAAGAGCTGT GGACCGTGGT CAACGAGCGA GACCAGCTTG GTTCCGACCT GGTGCCCGAC TACCTGACCA ACGTACCGTT CGGTTCGAAC TATGGCTGGC CCTGGGCGTA CTGGAAGAAG AACATCGACT GGCGCGTAAA GGAACCGATG CCCGAATACC TGATGGATTA CGTGCGCAAG CCTGAATACG GCCTCGGCGC GCACGTAGCG CCGTTGGGAC TGGCATTCGC TCGGGGCGGG AACCTTTTGG GCGACAAGTT CCAGCAGGGC GCCTTCATCG CACGGCACGG CTCGTGGAAC CGACGTCCGC TTTCGGGCTA TGACGTGGTC TTCGTGAAGT TCGACGCGCT TGGCAATGTC CTGCCAAAGT CGCCGGTGCC GGTACTCACC GGGTTCCTGA CCGAGGACCA GAAGGCACGC GGACGGCCGA CTTGGGTAGC CTTCGCCAAG GACGGCGCCT TGCTCGTCAG CGACGATACG GGCGGTGTAA TCTGGCGCGT CACGGCCCCG GGCGCGAAAC CCGCCGCAGA GGTGAAGCCG CTGCCGAAGC GTGCCGCACC TCCGCAGCCG AAAGGCACCG GGAAGTTCAT CATGAAGCCC AACGCCGATT CAGACTTGGT CAAGCCGCAG GACTAA
|
Protein sequence | MSHCPCKAPG IDLRTMNLIK KILISLVVIL VLVGAYVAWS VRGTPAQFAL NDTTGPRPKL ADPDEQTIPT IKTADPIGWK DGEAPVAAEG LQVTRFADKL DHPRTVFTLP NGDVLVAETN SPPRKVGGVT GIVMNFLMKQ VGAGGPSPNK IVLLRDGDGD GRAEQRFVME NPALDSPFGM AFRDGRLLVA NHNAVLSFPY QLGQTSLSGK PEKLMDLPGG GNHWARNLLL SPDGTQLFVT VGSASNIAEG GIDAERGRAA IHEYDFAKKR SREFAGGLRN PNGLDFNPNS EELWTVVNER DQLGSDLVPD YLTNVPFGSN YGWPWAYWKK NIDWRVKEPM PEYLMDYVRK PEYGLGAHVA PLGLAFARGG NLLGDKFQQG AFIARHGSWN RRPLSGYDVV FVKFDALGNV LPKSPVPVLT GFLTEDQKAR GRPTWVAFAK DGALLVSDDT GGVIWRVTAP GAKPAAEVKP LPKRAAPPQP KGTGKFIMKP NADSDLVKPQ D
|
| |