Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3651 |
Symbol | |
ID | 5077799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 280481 |
End bp | 282142 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640481374 |
Product | choline dehydrogenase |
Protein accession | YP_001166036 |
Protein GI | 146275876 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAGG CAGCGGGCGA GTTCGACTTC ATCGTAATCG GCGGCGGCAG CGCGGGGGCG GTGCTCGCCG CCCGCCTGTC GGAAGACGCG CAAAGCAGGG TCCTTCTGCT CGAGGCGGGC GGTGCCAACA CCTCGCTGCT GGTGCGCATG CCCGCTGGCG TCGGCACGCT GATCAAGAAG AAAAGCCGCC ACAACTGGGG CTTCTGGTCC GACCCCGAAC CCCACATGGA CGGGCGCCGC ATGTGGCATC CAAGGGGGCG CGGGCTGGGC GGCTCCTCGG CGATCAACGG GATGGTCTAT ATTCGCGGCC ACGCGCGCGA TTACGACCAG TGGCGGCAGA TGGGCCTCGA AGGCTGGTCC TTCGCCGAGG TCCTGCCCTA TTTCCGCCGC GCCGAGGACT TCTGCGACGG TGCGGATGCC TTCCACGGCG CGGGCGGCCC CTTGCGGGTA AGCTGGGGCG AGCGCTCGGA CCACCCGCTC TATCGCGGCG TGATCGAGGC AGGCCGCCAG GCCGGGCACA AGGTCACCCC CGATTTCAAT GGCGCTGACC AGGAAGGCTT TGGTCGCTAC CAGCTCACCA TCCACGATGG CGAGCGGTGG AGCGCCGCGC GCGGCTATCT CGCGCCGGTC GCGGGGCAGC GGGCGAACCT CACGATCGTC ACCGGGGCGC GCGTCCACCG TGTCGTGGTC GAGGGCGGAC GCGCCACCGG CGTCGAGTAC AGCCTTGGCA AGGGCAAGCC GGTGCGCCGC GCCCATGCCG CGCGCGAAGT GCTGGTCTGT GCGGGTGCCC TGCAATCGCC GCAGATCCTG CAGCTTTCGG GGATCGGCGA TCCGGAGGAA CTGGCAAGGC ATGGTATCGC GCCGGTCCAT CCCCTGCCCG GCGTGGGGGC CAATCTCCAG GACCACCTCG ACGTAACGCT CAACTGGGCC TGCACGCAGC CGATCACGAT CTACAACGAG ATCAAGGGGT TGGGCCAGCT CAAGGTCGGC CTGCAATACC TGCTGACCGG CAAGGGCGCG GGACGGCAGA ACGGGCTTGA GGCGGGAGCC TTCCTCAAGT CGCGGCCCGA TCTCGACCGT CCGGACCTCC AGATCCACTT CGTGCTGGCC ATCATGCAGG AACACGGCAA GCGTTCGGTC AAGCGCGACG GGTTCACGCT CCACGTCTGC CAGCTCCGGC CAGAAAGCCG GGGGCGGGTA TCGCTCGCCT CGGCGGACCC ATATGCCGAT CCCTCGATCC TGGCGAATTT CATGGCCGCC GAGGAAGACC GCCGCGCTGT CCGCGCGGGC ATCCGCATCG CGCGCGAGGT GGCGGCGCAG CCTGCGCTTG CACCCTATCG CGGCGAGGAG ATCTGGCCGG GCAACGACGT GCAGACCGAC GAAGAGATCG ACGCCTGGGT GCGCCGCACC GGCGAGACGA TCTATCACCC TGTCGGCACT TGCCGCATGG GCACGCAAGG CGATGCGATG GCGGTGGTCG ACAGCCAGTG CCGCGTCATC GGCCTTGAAG GGCTGCGCGT GGTCGATGCA TCGGTCATGC CGAACCTGAT CGGCGGAAAC ACCAACGCGC CCACGATCAT GATCGCCGAA AAGATCTCCG ACGCGATCCG GGGCAGGGCA CCGCTTGCGC CGGTCGAGAC GAGGACGGTG GACTTCGTCT GA
|
Protein sequence | MAEAAGEFDF IVIGGGSAGA VLAARLSEDA QSRVLLLEAG GANTSLLVRM PAGVGTLIKK KSRHNWGFWS DPEPHMDGRR MWHPRGRGLG GSSAINGMVY IRGHARDYDQ WRQMGLEGWS FAEVLPYFRR AEDFCDGADA FHGAGGPLRV SWGERSDHPL YRGVIEAGRQ AGHKVTPDFN GADQEGFGRY QLTIHDGERW SAARGYLAPV AGQRANLTIV TGARVHRVVV EGGRATGVEY SLGKGKPVRR AHAAREVLVC AGALQSPQIL QLSGIGDPEE LARHGIAPVH PLPGVGANLQ DHLDVTLNWA CTQPITIYNE IKGLGQLKVG LQYLLTGKGA GRQNGLEAGA FLKSRPDLDR PDLQIHFVLA IMQEHGKRSV KRDGFTLHVC QLRPESRGRV SLASADPYAD PSILANFMAA EEDRRAVRAG IRIAREVAAQ PALAPYRGEE IWPGNDVQTD EEIDAWVRRT GETIYHPVGT CRMGTQGDAM AVVDSQCRVI GLEGLRVVDA SVMPNLIGGN TNAPTIMIAE KISDAIRGRA PLAPVETRTV DFV
|
| |