Gene Saro_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3164 
Symbol 
ID3918206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3378071 
End bp3379762 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content65% 
IMG OID640445948 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_498433 
Protein GI87201176 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.115784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGACG CAATCGTAAT CGGATCGGGA ATGAGCGGCG GCATCGCCGC CAAGGAACTG 
TGCGAACGCG GCCTCAAGAC GCTCGTGATC GAGCGTGGCC GCAAGCTCGA GCACGGCGCG
TCCTATACTG ACTGGATGAA CCCCTGGGAC GTCCCCAACG CTGGCCTCAT TCCCGAAGAG
GAACTGGCCC GCGACTATGC CGTCCAGCGC AACTGCTATG CCGTGAACAC CGCTACGCAG
CAGTATTGGG TCAAGGATAG CGAGCATCCC TACACCACGC CGGAAGACAA GCCCTTCTGG
TGGATTCGCG GCTATCACCT GGGCGGCCGT TCGATCATGT GGGGTCGCCA GACCTATCGC
ATGTCGGAAA TGGACTTCGA GGCCAATGCG CGCGACGGGC ACGGCTCGGA CTGGCCGATC
CGCTATGCCG ATCTCGCGCC GTGGTACGAT CATATCGAGC GGTTCATCGG CGTTTCCGGA
TCGAAGGAGG GATTGCCGCA GCTTCCGGAC GGCGAATTCC TGCCCGCCAT GCCGATGAAC
GACGGCGAAA AGGCGTTCAA GTCGGCGGTG GAGCGCAACT ATCCCGATCG CAAGGTCATC
ATCGGCCGCT GTGCGCACCT GACCGAAGCG CGCGAGCATC ACACGGAACT GGGGCGCAAC
CCCTGCCAGT ACCGCTCGCT CTGCGAACGA GGTTGTTCCT ACGGGGCTTA TCACTCCAGC
CTGTCTTCGT CGCTCCCCGC GGCGGAAGCG ACCGGCAACC TTACCATCGT GACCGACGCC
ATCGCCCATT CGATCATCAC CGATCCCCGG ACGGGCAAGG CCACCGGCGT GCGGGTGATC
GACCAGAACA CCCGCGAAGG CCGGACCTAT GAGGCCAAGG TTGTGTTCCT GTGCGCCTCG
ACCATTCCCA CCGCGCAGAT CCTGCTCAAT TCGCGCAGCG AGGCGAACCC GCGCGGCCTT
GCCAATTCGT CGGACATGGT CGGACGCAAC CTGATGGATC ACCTCTACGG CCTCGGGTAC
GCGGCGCGCA TGCCGGGGCC GGAGACGACC TTTCGCGGGC GGCGCCCCAA CGGTCTCTAC
ATCCCGCGCT ATCGCAACCT GCCAGGCGCC GGCGACACTG CCGGCTTCCT GCGGGGCTAC
GGCTTCCAGG GTGCGGTTGA CCGTAGTCCG TGGCGGGCGG TTGCGAATGC CGCGCCGGGC
GTCGGTGCGG AACTCAAGGA GCGGGTCCGC CACCCCGGCG AATGGATGAC CTACTTCTCC
GGCTTCGGCG AAATGCTGCC GAACCCGGAG AACCGGGTGA CGCTCCATGC GACCAATGTC
GACAAGTGGG GCATGCCCAT CGCCCATATC GACTGCGCGC ACGGCGAGAA CGACCGCAAG
ATGGCGCAGG CGATCCTTGC CGACGGCAAG GCGATGATCG AGGCGGCGGG CGGCCAGATC
GTCATGGCCC GCACGGACCT CGTGCCGCCC GGCCTCGGCA TCCACGAAAT GGGCACCGCC
TGCATGGGCA AGGACCCGAA GACCTCGGTG CTCAACAAGT ACAACCAGGC CCATGACGTG
CCGAACCTGT TCGTCACAGA CGGCGCGGCA ATGGCGTCGG GCGGCTGCCA GAACCCGTCG
CTGACCTACA TGGCGCTTTC CGCCCGCGCG GCGCACCATG CCACGGAGTT CCTCAAGGCC
GGGACGATCT GA
 
Protein sequence
MFDAIVIGSG MSGGIAAKEL CERGLKTLVI ERGRKLEHGA SYTDWMNPWD VPNAGLIPEE 
ELARDYAVQR NCYAVNTATQ QYWVKDSEHP YTTPEDKPFW WIRGYHLGGR SIMWGRQTYR
MSEMDFEANA RDGHGSDWPI RYADLAPWYD HIERFIGVSG SKEGLPQLPD GEFLPAMPMN
DGEKAFKSAV ERNYPDRKVI IGRCAHLTEA REHHTELGRN PCQYRSLCER GCSYGAYHSS
LSSSLPAAEA TGNLTIVTDA IAHSIITDPR TGKATGVRVI DQNTREGRTY EAKVVFLCAS
TIPTAQILLN SRSEANPRGL ANSSDMVGRN LMDHLYGLGY AARMPGPETT FRGRRPNGLY
IPRYRNLPGA GDTAGFLRGY GFQGAVDRSP WRAVANAAPG VGAELKERVR HPGEWMTYFS
GFGEMLPNPE NRVTLHATNV DKWGMPIAHI DCAHGENDRK MAQAILADGK AMIEAAGGQI
VMARTDLVPP GLGIHEMGTA CMGKDPKTSV LNKYNQAHDV PNLFVTDGAA MASGGCQNPS
LTYMALSARA AHHATEFLKA GTI