Gene Saro_3554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3554 
Symbol 
ID5077703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp170377 
End bp171999 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content70% 
IMG OID640481278 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001165940 
Protein GI146275780 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGG GCTGGGATTA CATCGTCGTC GGCGCGGGCT CCGCCGGATG CGTCGTGGCC 
GAAAGGCTGA GCGCCGATGG CCGCCACCGC GTGCTCGTGC TCGAGGCGGG CGGCGAGAAC
GACGGTTTCT GGGTGACCCT GCCCAAGGGT GTGGCCAGGC TCGTCACCAA TCCCGACCAC
ATCTGGGCCT ATCCGGTCGC TCAGCCCCGC GCGGCGGGGA TGCCCGCGAA CGAGGTCTGG
ATCAGGGGCA AGGGTCTTGG CGGTTCGTCC GCCGTCAACG GCATGATCTG GAGCCGGGGC
GAGCCGGCCG ACTATGATGC GTGGGAGCAG GCCGGCGCCA CCGGCTGGAA TGGCGCTGCC
ATGACCGAGG CCTTCCTCGC GCTCGAGGAT CACGCCGCCG GTCCCGGCCC GATGCGCGGC
AGCGGCGGCC TCGTCCACGT CGATCCGGCA ATCTACACCT ATCCGCTGGC CGATCGCATG
ATCGCGGCGG GCGAGTCCTG TGGCATGGCC CGCGTGGCCG ATCTCAACGA ACGCGGCGGT
CCGCGTGTCG GCCTCTACAG CCACAATATC CGGAAGGGCC GCCGCCAAAG CTCGGGCCGC
ACCTTCCTCG CCGCGGCGCG CCGCCGCGCC AACGTCAGGG TCGTGACCGG CGCGATTGCC
GAACGCGTCG TGACACGCGA TGGCCGCGCG GTCGCGGTGG AGGCGCGCGT CAACGGCGTC
CTCACCCGGT TCGATTGCGC GGGCGAGGTA ATCGTCAGCG GCGGCGCGAT GGAAAGCCCG
CTCCTGCTCC AGCGCTCGGG CATCGGCGAT GGCGCCCGCC TTCGCGCCGT CGGGATCGAA
CCTCTGGTCC AGTCGCCCGA CGTGGGCGAA CGCCTGGTCG AGCATCTCGG TTTTTCCATG
CCCCACCGCC TCGTCGGCGA GCGCGGTACC GGCGGCTTGC TGCGGGGGCC GGGCCTGGTC
GCGGCAGTCC TGCGCTATGC CCTGACGCGC GGCGGCATCA TGGCGACCGG TCCGTTCGAG
GTCGGCGCCT TCTGCAACGT CGCCCATCCC GATGGCCGCG TCGATGCCCA GCTCTATCTT
GGCGGCTATA CCTTCGAAGT CTCGGACGAT GGCAACCCGG TGCCGCTCGA CAAGATCTCC
TCGCGCCCCG GCATGACGAT CTACGGCCAG CTCCTGCGCC TGACCAGCGA GGCGTCGGTT
CGCGCCGCCG GCCCGGATGC CGCAACCGCG CCCGTGATCC TGCCCAACTG GCTCTCGACC
GAACACGACC GCAAGTCCGC CGTCGCCATG GTTCGCGCCA TGCGCCGTTT CGTGCGCACC
GCGCCGCTTG CCGAGGTGGT GGGAGAGGAA ATGATACCGG GCGCCGCGGT CGAAAGCGAC
GAGGCCATTC TCGACGCTTT CCGCCACCTC GCCAGTTGCG GCCTCCATGC CATAGGCTCC
TGCCGCATGG GCTCGGACCA GCGCGCGGTG GTCGATCCGC GCCTCAGGGT GCGCGGGGTC
GATGGCTTGC GCGTCGTCGA TTGTTCGGTC ATGCCCGGAC ACATCACCGG CAACACCAAC
GCGCCGGCCA TGGCCCTCGG TTATCGTGCC GGGAACCTGA TCCTCGAGGA TCGCAAGGGA
TGA
 
Protein sequence
MEQGWDYIVV GAGSAGCVVA ERLSADGRHR VLVLEAGGEN DGFWVTLPKG VARLVTNPDH 
IWAYPVAQPR AAGMPANEVW IRGKGLGGSS AVNGMIWSRG EPADYDAWEQ AGATGWNGAA
MTEAFLALED HAAGPGPMRG SGGLVHVDPA IYTYPLADRM IAAGESCGMA RVADLNERGG
PRVGLYSHNI RKGRRQSSGR TFLAAARRRA NVRVVTGAIA ERVVTRDGRA VAVEARVNGV
LTRFDCAGEV IVSGGAMESP LLLQRSGIGD GARLRAVGIE PLVQSPDVGE RLVEHLGFSM
PHRLVGERGT GGLLRGPGLV AAVLRYALTR GGIMATGPFE VGAFCNVAHP DGRVDAQLYL
GGYTFEVSDD GNPVPLDKIS SRPGMTIYGQ LLRLTSEASV RAAGPDAATA PVILPNWLST
EHDRKSAVAM VRAMRRFVRT APLAEVVGEE MIPGAAVESD EAILDAFRHL ASCGLHAIGS
CRMGSDQRAV VDPRLRVRGV DGLRVVDCSV MPGHITGNTN APAMALGYRA GNLILEDRKG