Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1543 |
Symbol | |
ID | 3917218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1593855 |
End bp | 1595447 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444284 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_496818 |
Protein GI | 87199561 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCAAGC ATGAGGCATT CGACTATGTC ATCGTTGGTG CCGGTTCGGC CGGTTGCGTC CTCGCAAACC GGCTATCGGC CGATCCCGAC GTGAGTGTCC TGGTACTCGA GGCGGGCGGG CGCGACACCA GTCCGTTCAT CCACATGCCG GCGGGCTTCT TCCAGTTGTT GCAGAGTGGC AGCAATGCCT GGCACTACCA GACGGCGCCG CAGGAGCACC TGAACGGCCG CGTACTTGCC GACGCCCGAG GCAAGGTGCT GGGCGGCAGC AGTTCCATCA ACGGCATGTG CTACAGCCGC GGCAGTCCCG AGATATTCGA CCACTGGGCG GAACTCGGCA ACGATGGCTG GTCATACAAA GACGTGCTGC CCTGGTTCCG CAAGGCCGAG GGCAACCCGG GCGCCGATCC TTACTTCCAC GGCCAGGATG GTCCCTTGTC CGTTACCCAT GCTTCGGTCA CCAACCCGGC GCAGTTGGCC TGGCTTCGGG CTGCGCAGGA AGCAGGCTTT CCCTACAGCG ACGACCACAA CGGTGCCGCC CCGGAGGGCT TCGGGCCGGG GGAACACACA ATCCGCAACG GGCGGAGGAT CAGCACGGCT GTCGCGTATC TCAAACCGGC GATGCGACGT CGCAACCTCG TTGTACGAAC TCGCGCGCAT GCCACGCGGG TCTTGCTCGA GGGCGCGCGC GCAACAGGAG TGGAATATCG GCAGGGAAGG GCGCTGCAGA AGGTCCACGC CAGTCGCGAA GTGATCCTTT GTGGTGGCAC TTTCCAGTCG CCGCAATTGT TGATGCTGTC GGGCATCGGA GACGGCGCAC ATCTTCAGCC GCTCGGTATA CGTACGGTGG TCGACCTGAA AGGCGTGGGC CGCAACCTCC ACGATCACAT TGGCACGCAA GTCCAGATGA CCTGCCCAGA GCCCGTGTCC GACTTCTCGG TAGCGACGAA CCCGTTGCGG ATGGCGCTGG CGGGCCTTCA GTATCTCGTC GCGCGCAAGG GGCCTCTGGC CCGGAGCGGA ACCGACGTCG TTGCCTATCT GCGCTCGGGC GCGCCCGGGC ACGATGAACT CGATCTCAAG TTCTATTTCA TCCCGCTGCT GTTCAACGAG GGTGGCGGCA TTGCACGGCA GCATGGCTTC TCCAACCTGG TTATCCTGAC CCGGCCCGAA AGTCGCGGGG AGCTGCGCCT CCGCTCTGCC AACCCGGTGG ATCAGCCGCT GATCGATTCG AATTACCTGG CGGAAGGGCG CGACCGCGAT GCGCTGCGCC GCGGGGTTGG CATTGTTCGC CGGATCTTTG CCCAGCCTGC GTTTGCCCGC TTTCGCGGCG TCGAATGCAC GCCGGGCGCC GACATTGCCG ATGACGTTGC GCTCGATGGC TTCTTCCGCG AGACCTGCAA CGTCAATTAC GAGGCCGTGG GCACCTGTCG GATGGGTGAT GACGAACTCG CCGTGGTCGA TCCGGGGTTG CGAGTTCGGG GTGTGGAAGG TCTTCGCGTC GTTGACGGGT CGGTAATGCC CCGCATCACG ACCGGGGACC CCAATGCGAC GATCGTGATG ATCGCGGAAA AGGCCGCACA GATGATCCTC TGA
|
Protein sequence | MRKHEAFDYV IVGAGSAGCV LANRLSADPD VSVLVLEAGG RDTSPFIHMP AGFFQLLQSG SNAWHYQTAP QEHLNGRVLA DARGKVLGGS SSINGMCYSR GSPEIFDHWA ELGNDGWSYK DVLPWFRKAE GNPGADPYFH GQDGPLSVTH ASVTNPAQLA WLRAAQEAGF PYSDDHNGAA PEGFGPGEHT IRNGRRISTA VAYLKPAMRR RNLVVRTRAH ATRVLLEGAR ATGVEYRQGR ALQKVHASRE VILCGGTFQS PQLLMLSGIG DGAHLQPLGI RTVVDLKGVG RNLHDHIGTQ VQMTCPEPVS DFSVATNPLR MALAGLQYLV ARKGPLARSG TDVVAYLRSG APGHDELDLK FYFIPLLFNE GGGIARQHGF SNLVILTRPE SRGELRLRSA NPVDQPLIDS NYLAEGRDRD ALRRGVGIVR RIFAQPAFAR FRGVECTPGA DIADDVALDG FFRETCNVNY EAVGTCRMGD DELAVVDPGL RVRGVEGLRV VDGSVMPRIT TGDPNATIVM IAEKAAQMIL
|
| |