Gene Saro_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1208 
Symbol 
ID3916506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1255008 
End bp1256687 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content66% 
IMG OID640443945 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_496487 
Protein GI87199230 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.397312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTCG ATGCGATCGT GGTAGGCTCG GGTATCACCG GCGGCTGGGC AGCCAAGGAA 
CTGACGCAGG CGGGCCTAAA GGTCCTGATG ATCGAGCGCG GGCGCGAGAT CGTCCACGGC
GATTACCCGA CCGAGATGAA GACGCCCTGG GAGATGCCGT TTCGCGGCGT GGGCGATGCC
GCGCTTTATG CGCGCGAATA CCAGGTGCAG GCGCAGAACC GCCATTTCAA CGAGTTCACG
CAGGGGCACT TCGTCAACGA CAAGGAGAAC CCCTACGCCA CTGGTCCGGA CAGCGAGTTC
AACTGGCTGC GGTCCTATCA GCTTGGCGGA CGCTCGCTGA CCTGGGGGAG GCAGGCCTAT
CGCTGGTCGG ACTACGATTT CAGCGCCAAC AAGCGCGACG GCAACGGCAC TGACTGGCCG
ATCCGCTACG CCGACCTGGC TCCATGGTAC GACAAGGTCG AGGAGTTCAT CGGCGTTTCC
GGCGCGGCGG AGGGCCTGCC GCAGCTTCCC GACGGCCGGT TCCAGCCGCC GATGGCGCTG
AACGCCGTGG AGCGTCACGT CCGGCAGGTC GTGGCGGACA GGTATGGTCG CTGCATGACG
GTCGGTCGTG TCGCCAACAT GACCCAGGCC AAGCCGGACG AGGGCCGCTC CGCCTGCCAG
AACCGTTCGA TCTGTGCGCG CGGTTGCTCG TACGGGGCAT ATTTCTCGAC GCAATCGAGC
ACGCTGCCGG CGGCCAAGGC CACGGGCAAC CTGACCGTGG TCACCGATGC CATCGTCGAG
CATGTCGACT ACGATCCGGC GACGAAGCGC GTGACCGGCG TGCGCTATGT GAACACCAAG
GACGGTTCGC GCGGTAGCGC CACCGCGCGC ATGGTGTTCC TCAACGCCAG TGCATTCAAT
TCGGTGCACG TGCTGCTGAA TTCGCGTTCC GAGGCGATGC CGAACGGGCT GGGCAATTCG
AGCGGCGTTC TGGGCACGCA GATCATGGAC CACGCCAACA CGCTGTCGAC GATTGCGCTG
TTCCCGCAGT TCAACGGCCG CACCAGTTTC GGCAACCGGC CGACGGGCGT GGTCATCGCG
CGCTATCGCA ACATGGACGA GATGGACGGT GCGGGGCACA CGCGTGGCTA TTCGTACCAG
GGCGGCGCAT TGCAGAGCAA CTGGGGCGCG GGCAAGCGCG AGGCGGGCAT CGGCGCCGAC
TTCAAGGACA AGCTGCGTAC GCCGGGCATG TGGCGCATGG TGCTGGTCGC CTTCGCCGAC
TGCGTCCCGC GCGACAGCAA CCGCCTGACG CTGGACCCGG TGAAAACCGA CCGCTTCGGC
ATTCCCCAAC TCCGCATCGA CTTCGCCTAT GGCAAGGAAG AGCAGGCAGC ACTTGCCCAG
GCCAAGGCCG ATGCCGCCGA AATGATGACG GCGGCGGGCG GCATGGTCGT CATGGGTTCG
GACCAGCCCG GCACCGGTGG CATGGCGATC CACGAGATGG GCGGCGCGCG CATGGGCCAC
GACCCGAAGA CCTCGGTGCT CAACAAGTGG AGCCAGAGCC ACGACGTCGC CAACCTGTTC
GTCACCGACG GCGCGCAGAT GGCGTCCTCG GCCTGCCAGA ACCCTTCGCT CACCTACATG
GCGCTGACCG CACGTGCCTG CGATGCGGCG GTCAGGATGC TGCGCGAAGG TGCGATCTGA
 
Protein sequence
MQFDAIVVGS GITGGWAAKE LTQAGLKVLM IERGREIVHG DYPTEMKTPW EMPFRGVGDA 
ALYAREYQVQ AQNRHFNEFT QGHFVNDKEN PYATGPDSEF NWLRSYQLGG RSLTWGRQAY
RWSDYDFSAN KRDGNGTDWP IRYADLAPWY DKVEEFIGVS GAAEGLPQLP DGRFQPPMAL
NAVERHVRQV VADRYGRCMT VGRVANMTQA KPDEGRSACQ NRSICARGCS YGAYFSTQSS
TLPAAKATGN LTVVTDAIVE HVDYDPATKR VTGVRYVNTK DGSRGSATAR MVFLNASAFN
SVHVLLNSRS EAMPNGLGNS SGVLGTQIMD HANTLSTIAL FPQFNGRTSF GNRPTGVVIA
RYRNMDEMDG AGHTRGYSYQ GGALQSNWGA GKREAGIGAD FKDKLRTPGM WRMVLVAFAD
CVPRDSNRLT LDPVKTDRFG IPQLRIDFAY GKEEQAALAQ AKADAAEMMT AAGGMVVMGS
DQPGTGGMAI HEMGGARMGH DPKTSVLNKW SQSHDVANLF VTDGAQMASS ACQNPSLTYM
ALTARACDAA VRMLREGAI