Gene Saro_3578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3578 
Symbol 
ID5077727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp195813 
End bp197438 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content69% 
IMG OID640481302 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001165964 
Protein GI146275804 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAT TCGATTACAT CATCGTCGGC GCGGGTTCGG CGGGCTGCGT CCTGGCCAAC 
CGGCTCAGCG CCGATCCCGC CAACCGCGTC CTGCTGATCG AGGATGGCGG CGACAACCAG
CACCCGTTCA TCAAGATGGC CGGCGGCTTC ATCAAGATCA TGGGCAACCC GGACTATTTC
CGCGTGTTCC CGACAGAACC GCGCCCCGGG ATGCGCCCCG GCATCCACAC CTACGGGCGC
GGGCTCGGCG GATCATCGGC GATCAACGGC ACGTGGTATC TCACCGGCAT GCCCAAGGAC
TTTGACGGCT GGGCGCAATC CGGCCTTGCC GGATGGGGCT GGGACGAAAT CGCCCGCTGC
TACCGCAAGT TCGAGGACTA TCGCGAGCCC GGCGCCCATC CCGGACGCGG CCGCGGCGGC
GAGCTTCAGG TCACTGCCTC GACCTACGAA TCGCCGGTGT TCGATGCCCT CGCGCAAGGG
TTCGCCGCGC AGGGCATGCC CTGGCTGGAC GACATCACCA CGCCGGGCGT GCAGGGGGTA
GGCCGCAGCC AGTACACCGT GGACCGCAAG GGCGTGCGCG AAAGCACCTA CAAGGCTTTC
GTCATGCCGA TCCTGGGCCG CCACAACCTG ACGATCGCAC AGCACACCGC CGTCAAGCGC
GTGACGATCG AACAGGGCCG CGCCACGGGC GTCGTCACCG AGGCGCACGG GCAGGAAAGC
ACCCATGTCG CCAAGCGCGA AGTGATCCTC GCCGCCGGCG TCTATGGCTC GCCCCAACTC
CTTCAGCTCT CGGGCATCGG CGCGGGCGCG GTGTTGCAGG AGCTCGGCAT TCCGGTCCTC
AAGGCCCTGC CGATGGTCGG CCGCCAGCTT TGCGACCACA CCAAGTTCGG CGTCTCGTTC
GACCTCACCA ACCACCCCGG CACCAACCGC GAGTTCTTCG GCTGGCGGCT CTATCGCAAC
GCGCTGCAAT ACTTCCTTAC CGGCACCGGC CACCTCGCCC GCGTCGGCAT GCCCCTGACC
GGCCTATACG CCAGCGAGGG CACGGACAAG GACTGGCCCG ACCTCCAGGT CGCCGCCGCG
CCCTTCGCGA TGCGCACCGT CAACGAGATG GCCGCGCGTC CCGGCAGCCC GCTCACGCCG
AACCCGGGCC TCACCTTCTC GGGCTACCAC CTGCGCCCGA AGAGCCGCGG ATCGATCCGC
CTGGTCTCCC CCGATTTCCG CGATGCGCCC GTCGCCGATG CCGCGATCTG GGCAGATCCT
CACGACAAGG CCAAGAGCCT CGAACTGTTC CGCCTGTTCC GCGCCATCGC CGCATCCGAA
CCGCTGCGGC CCTTCATCGG CAAGGAGCGC ATGCCGGGCC CCGACGTGCA GGACGAAGCC
GCCATCCTCG CCGAACTCGG CAAGATGGTT GAGGTCGGCC TCCACGGGAC AGGCACCTGT
TCGATGGGCA CCGACGAGGC GACCTCCGTC ACCGACGCCC GCGCCCGCGT CCACGGCGTC
GGCGCGCTGC GCGTGGTCGA CTGCTCGATC ATGCCAACCC CCGTTTCGGG CAACACCAAC
GGCCCCGCCA TGGCCTTGGC CGAACGCGCC GCGGAACTGA TCCTCGAGGA CGCCCGCCGA
GGCTGA
 
Protein sequence
MAEFDYIIVG AGSAGCVLAN RLSADPANRV LLIEDGGDNQ HPFIKMAGGF IKIMGNPDYF 
RVFPTEPRPG MRPGIHTYGR GLGGSSAING TWYLTGMPKD FDGWAQSGLA GWGWDEIARC
YRKFEDYREP GAHPGRGRGG ELQVTASTYE SPVFDALAQG FAAQGMPWLD DITTPGVQGV
GRSQYTVDRK GVRESTYKAF VMPILGRHNL TIAQHTAVKR VTIEQGRATG VVTEAHGQES
THVAKREVIL AAGVYGSPQL LQLSGIGAGA VLQELGIPVL KALPMVGRQL CDHTKFGVSF
DLTNHPGTNR EFFGWRLYRN ALQYFLTGTG HLARVGMPLT GLYASEGTDK DWPDLQVAAA
PFAMRTVNEM AARPGSPLTP NPGLTFSGYH LRPKSRGSIR LVSPDFRDAP VADAAIWADP
HDKAKSLELF RLFRAIAASE PLRPFIGKER MPGPDVQDEA AILAELGKMV EVGLHGTGTC
SMGTDEATSV TDARARVHGV GALRVVDCSI MPTPVSGNTN GPAMALAERA AELILEDARR
G