Gene Saro_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3651 
Symbol 
ID5077799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp280481 
End bp282142 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content68% 
IMG OID640481374 
Productcholine dehydrogenase 
Protein accessionYP_001166036 
Protein GI146275876 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAGG CAGCGGGCGA GTTCGACTTC ATCGTAATCG GCGGCGGCAG CGCGGGGGCG 
GTGCTCGCCG CCCGCCTGTC GGAAGACGCG CAAAGCAGGG TCCTTCTGCT CGAGGCGGGC
GGTGCCAACA CCTCGCTGCT GGTGCGCATG CCCGCTGGCG TCGGCACGCT GATCAAGAAG
AAAAGCCGCC ACAACTGGGG CTTCTGGTCC GACCCCGAAC CCCACATGGA CGGGCGCCGC
ATGTGGCATC CAAGGGGGCG CGGGCTGGGC GGCTCCTCGG CGATCAACGG GATGGTCTAT
ATTCGCGGCC ACGCGCGCGA TTACGACCAG TGGCGGCAGA TGGGCCTCGA AGGCTGGTCC
TTCGCCGAGG TCCTGCCCTA TTTCCGCCGC GCCGAGGACT TCTGCGACGG TGCGGATGCC
TTCCACGGCG CGGGCGGCCC CTTGCGGGTA AGCTGGGGCG AGCGCTCGGA CCACCCGCTC
TATCGCGGCG TGATCGAGGC AGGCCGCCAG GCCGGGCACA AGGTCACCCC CGATTTCAAT
GGCGCTGACC AGGAAGGCTT TGGTCGCTAC CAGCTCACCA TCCACGATGG CGAGCGGTGG
AGCGCCGCGC GCGGCTATCT CGCGCCGGTC GCGGGGCAGC GGGCGAACCT CACGATCGTC
ACCGGGGCGC GCGTCCACCG TGTCGTGGTC GAGGGCGGAC GCGCCACCGG CGTCGAGTAC
AGCCTTGGCA AGGGCAAGCC GGTGCGCCGC GCCCATGCCG CGCGCGAAGT GCTGGTCTGT
GCGGGTGCCC TGCAATCGCC GCAGATCCTG CAGCTTTCGG GGATCGGCGA TCCGGAGGAA
CTGGCAAGGC ATGGTATCGC GCCGGTCCAT CCCCTGCCCG GCGTGGGGGC CAATCTCCAG
GACCACCTCG ACGTAACGCT CAACTGGGCC TGCACGCAGC CGATCACGAT CTACAACGAG
ATCAAGGGGT TGGGCCAGCT CAAGGTCGGC CTGCAATACC TGCTGACCGG CAAGGGCGCG
GGACGGCAGA ACGGGCTTGA GGCGGGAGCC TTCCTCAAGT CGCGGCCCGA TCTCGACCGT
CCGGACCTCC AGATCCACTT CGTGCTGGCC ATCATGCAGG AACACGGCAA GCGTTCGGTC
AAGCGCGACG GGTTCACGCT CCACGTCTGC CAGCTCCGGC CAGAAAGCCG GGGGCGGGTA
TCGCTCGCCT CGGCGGACCC ATATGCCGAT CCCTCGATCC TGGCGAATTT CATGGCCGCC
GAGGAAGACC GCCGCGCTGT CCGCGCGGGC ATCCGCATCG CGCGCGAGGT GGCGGCGCAG
CCTGCGCTTG CACCCTATCG CGGCGAGGAG ATCTGGCCGG GCAACGACGT GCAGACCGAC
GAAGAGATCG ACGCCTGGGT GCGCCGCACC GGCGAGACGA TCTATCACCC TGTCGGCACT
TGCCGCATGG GCACGCAAGG CGATGCGATG GCGGTGGTCG ACAGCCAGTG CCGCGTCATC
GGCCTTGAAG GGCTGCGCGT GGTCGATGCA TCGGTCATGC CGAACCTGAT CGGCGGAAAC
ACCAACGCGC CCACGATCAT GATCGCCGAA AAGATCTCCG ACGCGATCCG GGGCAGGGCA
CCGCTTGCGC CGGTCGAGAC GAGGACGGTG GACTTCGTCT GA
 
Protein sequence
MAEAAGEFDF IVIGGGSAGA VLAARLSEDA QSRVLLLEAG GANTSLLVRM PAGVGTLIKK 
KSRHNWGFWS DPEPHMDGRR MWHPRGRGLG GSSAINGMVY IRGHARDYDQ WRQMGLEGWS
FAEVLPYFRR AEDFCDGADA FHGAGGPLRV SWGERSDHPL YRGVIEAGRQ AGHKVTPDFN
GADQEGFGRY QLTIHDGERW SAARGYLAPV AGQRANLTIV TGARVHRVVV EGGRATGVEY
SLGKGKPVRR AHAAREVLVC AGALQSPQIL QLSGIGDPEE LARHGIAPVH PLPGVGANLQ
DHLDVTLNWA CTQPITIYNE IKGLGQLKVG LQYLLTGKGA GRQNGLEAGA FLKSRPDLDR
PDLQIHFVLA IMQEHGKRSV KRDGFTLHVC QLRPESRGRV SLASADPYAD PSILANFMAA
EEDRRAVRAG IRIAREVAAQ PALAPYRGEE IWPGNDVQTD EEIDAWVRRT GETIYHPVGT
CRMGTQGDAM AVVDSQCRVI GLEGLRVVDA SVMPNLIGGN TNAPTIMIAE KISDAIRGRA
PLAPVETRTV DFV