Gene Saro_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1543 
Symbol 
ID3917218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1593855 
End bp1595447 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content64% 
IMG OID640444284 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_496818 
Protein GI87199561 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCAAGC ATGAGGCATT CGACTATGTC ATCGTTGGTG CCGGTTCGGC CGGTTGCGTC 
CTCGCAAACC GGCTATCGGC CGATCCCGAC GTGAGTGTCC TGGTACTCGA GGCGGGCGGG
CGCGACACCA GTCCGTTCAT CCACATGCCG GCGGGCTTCT TCCAGTTGTT GCAGAGTGGC
AGCAATGCCT GGCACTACCA GACGGCGCCG CAGGAGCACC TGAACGGCCG CGTACTTGCC
GACGCCCGAG GCAAGGTGCT GGGCGGCAGC AGTTCCATCA ACGGCATGTG CTACAGCCGC
GGCAGTCCCG AGATATTCGA CCACTGGGCG GAACTCGGCA ACGATGGCTG GTCATACAAA
GACGTGCTGC CCTGGTTCCG CAAGGCCGAG GGCAACCCGG GCGCCGATCC TTACTTCCAC
GGCCAGGATG GTCCCTTGTC CGTTACCCAT GCTTCGGTCA CCAACCCGGC GCAGTTGGCC
TGGCTTCGGG CTGCGCAGGA AGCAGGCTTT CCCTACAGCG ACGACCACAA CGGTGCCGCC
CCGGAGGGCT TCGGGCCGGG GGAACACACA ATCCGCAACG GGCGGAGGAT CAGCACGGCT
GTCGCGTATC TCAAACCGGC GATGCGACGT CGCAACCTCG TTGTACGAAC TCGCGCGCAT
GCCACGCGGG TCTTGCTCGA GGGCGCGCGC GCAACAGGAG TGGAATATCG GCAGGGAAGG
GCGCTGCAGA AGGTCCACGC CAGTCGCGAA GTGATCCTTT GTGGTGGCAC TTTCCAGTCG
CCGCAATTGT TGATGCTGTC GGGCATCGGA GACGGCGCAC ATCTTCAGCC GCTCGGTATA
CGTACGGTGG TCGACCTGAA AGGCGTGGGC CGCAACCTCC ACGATCACAT TGGCACGCAA
GTCCAGATGA CCTGCCCAGA GCCCGTGTCC GACTTCTCGG TAGCGACGAA CCCGTTGCGG
ATGGCGCTGG CGGGCCTTCA GTATCTCGTC GCGCGCAAGG GGCCTCTGGC CCGGAGCGGA
ACCGACGTCG TTGCCTATCT GCGCTCGGGC GCGCCCGGGC ACGATGAACT CGATCTCAAG
TTCTATTTCA TCCCGCTGCT GTTCAACGAG GGTGGCGGCA TTGCACGGCA GCATGGCTTC
TCCAACCTGG TTATCCTGAC CCGGCCCGAA AGTCGCGGGG AGCTGCGCCT CCGCTCTGCC
AACCCGGTGG ATCAGCCGCT GATCGATTCG AATTACCTGG CGGAAGGGCG CGACCGCGAT
GCGCTGCGCC GCGGGGTTGG CATTGTTCGC CGGATCTTTG CCCAGCCTGC GTTTGCCCGC
TTTCGCGGCG TCGAATGCAC GCCGGGCGCC GACATTGCCG ATGACGTTGC GCTCGATGGC
TTCTTCCGCG AGACCTGCAA CGTCAATTAC GAGGCCGTGG GCACCTGTCG GATGGGTGAT
GACGAACTCG CCGTGGTCGA TCCGGGGTTG CGAGTTCGGG GTGTGGAAGG TCTTCGCGTC
GTTGACGGGT CGGTAATGCC CCGCATCACG ACCGGGGACC CCAATGCGAC GATCGTGATG
ATCGCGGAAA AGGCCGCACA GATGATCCTC TGA
 
Protein sequence
MRKHEAFDYV IVGAGSAGCV LANRLSADPD VSVLVLEAGG RDTSPFIHMP AGFFQLLQSG 
SNAWHYQTAP QEHLNGRVLA DARGKVLGGS SSINGMCYSR GSPEIFDHWA ELGNDGWSYK
DVLPWFRKAE GNPGADPYFH GQDGPLSVTH ASVTNPAQLA WLRAAQEAGF PYSDDHNGAA
PEGFGPGEHT IRNGRRISTA VAYLKPAMRR RNLVVRTRAH ATRVLLEGAR ATGVEYRQGR
ALQKVHASRE VILCGGTFQS PQLLMLSGIG DGAHLQPLGI RTVVDLKGVG RNLHDHIGTQ
VQMTCPEPVS DFSVATNPLR MALAGLQYLV ARKGPLARSG TDVVAYLRSG APGHDELDLK
FYFIPLLFNE GGGIARQHGF SNLVILTRPE SRGELRLRSA NPVDQPLIDS NYLAEGRDRD
ALRRGVGIVR RIFAQPAFAR FRGVECTPGA DIADDVALDG FFRETCNVNY EAVGTCRMGD
DELAVVDPGL RVRGVEGLRV VDGSVMPRIT TGDPNATIVM IAEKAAQMIL