Gene Saro_3680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3680 
Symbol 
ID5077828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp314150 
End bp315754 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content68% 
IMG OID640481403 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001166065 
Protein GI146275905 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGGT TCGAGTTCGA TTTCGTCATC ATCGGCGGCG GCGTCGCGGG GTGCATCCTC 
GCCAATCGCC TGTCCGCCGA TCCCGCCACC CGCGTCCTGC TGCTCGAGGC GGGCGGCTCC
GACAGGAGCC CGCTGATCGC CGCTCCCGGC GGGCTGCTTC CGATCATGAT GTCGGGCGCC
CACGCATGGC GCTACGTGTC CGCGCCCCAA CGCCATCTCG ACGACCGCGT GCTCTATCTC
CCGCGCGGCA AGGTGCTTGG TGGAGGCTCC TCCATCAACG GCATGACCTA TGACCGGGGC
TTCCATTCGG ACTACGATCG CTGGGCACAG GCGGGCAATC GGGGCTGGTC ATTCGAGGAT
GTCCTCCCCT ATTTCCGCAA GCTCGAGAAC TACCTGCCCA GCGAGGACGA ATGGCACGGT
AGGGGTGGCC CGATCCAGGT TACCCGCGCC GCGCAGGACC ATCCCTTTGC GAAGGCCTTC
CTCAAGGCGG GCGCCGAAGC CGGCTACCCC CTGACGCAAG ATCTCAACGG CGCCTCGCGC
GACGGCTTCG GCGCGGTGGA CCTTACCGTC GGCCGGGGTC GCCGTTCCAG CGCCTCGTCC
GCCTACCTGC GCCCTGCCAA GGGCAGGCCC AACCTCACCG TCCTGACCCA GGCGCATACC
CGCCGCATCG TGATCGAGAA CGGCCGCGCC ACCGGCGTGA TCTTCCGCCG CAAGGGCGCG
GACCGGCTGG CACTGGCCGC GCGCGAGGTG ATCCTTTCGG CAGGCGCGAT CAACAGCCCG
CAAATCCTCA TGCTCTCGGG ACTGGGCCCG GCCGCGCACC TTGCCGAACA CGGCATTCAG
GTCCTGCACG ATCTTCCCGG CGTCGGGCAG GGCCTGCAAG ACCATCTCGC CGCCCACGTA
AAGTACCGCT CGACCAAGCC CTGGTCGATG CTGCGCTATC TCAATCCCCT GCGCGGCGCG
CTCGCCATGG CCCAGTATGC CCTCCTGCGC CGAGGTCCAC TCGCCGATCC CGGCATGTCC
GTCGCCTGCA TGGTCCGCTC CGATCCCTCG CTGGATGAAC CCGACATCAA GATGCTGCTG
GTGAGCGCGC TCTTCGCGCA GAACGGGCGC GAGATGGTGC CGATGCACGG CTTCTACGCC
CATATCAACG TCGCCCGCCC GCAATCGCGG GGTTCGGTCA CGCTCGCCAG CGCCGATCCG
GAAGTGCCGC CGGTCATCGA CCAGAACTAC AACGCCGCTC AGGAAGACCG CCGCGCCATG
CGCGAAGGCG TGCGCATCGC CCGCCGCATC TTCGCCCAGC CCGCTTTCGA CATCATGCGC
GGAGAGGAAC TGGCGCCCGG CAGCGGGGTC GAATCCGATG CGCAGATCGA CGCCTATATC
CGCGCCACCG CCGAGGCCGA CTATCACTCC ACCAGCACCG CCCGCATGGG TCGCGATCCG
ATGGCCGTGG TCGATGACCG ACTGCGCGTC CACGGCGTTG CAGCCCTGCG GGTGGTCGAT
GCTTCGGTCA TGCCGCACCT TCCGGGCGGC AACACCGCCA TCCCCGTCGC GATGATCGCC
GAAAAGGCCG CCGACCTCAT TCTTTCGAAG GATTCCCGCC CATGA
 
Protein sequence
MERFEFDFVI IGGGVAGCIL ANRLSADPAT RVLLLEAGGS DRSPLIAAPG GLLPIMMSGA 
HAWRYVSAPQ RHLDDRVLYL PRGKVLGGGS SINGMTYDRG FHSDYDRWAQ AGNRGWSFED
VLPYFRKLEN YLPSEDEWHG RGGPIQVTRA AQDHPFAKAF LKAGAEAGYP LTQDLNGASR
DGFGAVDLTV GRGRRSSASS AYLRPAKGRP NLTVLTQAHT RRIVIENGRA TGVIFRRKGA
DRLALAAREV ILSAGAINSP QILMLSGLGP AAHLAEHGIQ VLHDLPGVGQ GLQDHLAAHV
KYRSTKPWSM LRYLNPLRGA LAMAQYALLR RGPLADPGMS VACMVRSDPS LDEPDIKMLL
VSALFAQNGR EMVPMHGFYA HINVARPQSR GSVTLASADP EVPPVIDQNY NAAQEDRRAM
REGVRIARRI FAQPAFDIMR GEELAPGSGV ESDAQIDAYI RATAEADYHS TSTARMGRDP
MAVVDDRLRV HGVAALRVVD ASVMPHLPGG NTAIPVAMIA EKAADLILSK DSRP