Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1208 |
Symbol | |
ID | 3916506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1255008 |
End bp | 1256687 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443945 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_496487 |
Protein GI | 87199230 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.397312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTCG ATGCGATCGT GGTAGGCTCG GGTATCACCG GCGGCTGGGC AGCCAAGGAA CTGACGCAGG CGGGCCTAAA GGTCCTGATG ATCGAGCGCG GGCGCGAGAT CGTCCACGGC GATTACCCGA CCGAGATGAA GACGCCCTGG GAGATGCCGT TTCGCGGCGT GGGCGATGCC GCGCTTTATG CGCGCGAATA CCAGGTGCAG GCGCAGAACC GCCATTTCAA CGAGTTCACG CAGGGGCACT TCGTCAACGA CAAGGAGAAC CCCTACGCCA CTGGTCCGGA CAGCGAGTTC AACTGGCTGC GGTCCTATCA GCTTGGCGGA CGCTCGCTGA CCTGGGGGAG GCAGGCCTAT CGCTGGTCGG ACTACGATTT CAGCGCCAAC AAGCGCGACG GCAACGGCAC TGACTGGCCG ATCCGCTACG CCGACCTGGC TCCATGGTAC GACAAGGTCG AGGAGTTCAT CGGCGTTTCC GGCGCGGCGG AGGGCCTGCC GCAGCTTCCC GACGGCCGGT TCCAGCCGCC GATGGCGCTG AACGCCGTGG AGCGTCACGT CCGGCAGGTC GTGGCGGACA GGTATGGTCG CTGCATGACG GTCGGTCGTG TCGCCAACAT GACCCAGGCC AAGCCGGACG AGGGCCGCTC CGCCTGCCAG AACCGTTCGA TCTGTGCGCG CGGTTGCTCG TACGGGGCAT ATTTCTCGAC GCAATCGAGC ACGCTGCCGG CGGCCAAGGC CACGGGCAAC CTGACCGTGG TCACCGATGC CATCGTCGAG CATGTCGACT ACGATCCGGC GACGAAGCGC GTGACCGGCG TGCGCTATGT GAACACCAAG GACGGTTCGC GCGGTAGCGC CACCGCGCGC ATGGTGTTCC TCAACGCCAG TGCATTCAAT TCGGTGCACG TGCTGCTGAA TTCGCGTTCC GAGGCGATGC CGAACGGGCT GGGCAATTCG AGCGGCGTTC TGGGCACGCA GATCATGGAC CACGCCAACA CGCTGTCGAC GATTGCGCTG TTCCCGCAGT TCAACGGCCG CACCAGTTTC GGCAACCGGC CGACGGGCGT GGTCATCGCG CGCTATCGCA ACATGGACGA GATGGACGGT GCGGGGCACA CGCGTGGCTA TTCGTACCAG GGCGGCGCAT TGCAGAGCAA CTGGGGCGCG GGCAAGCGCG AGGCGGGCAT CGGCGCCGAC TTCAAGGACA AGCTGCGTAC GCCGGGCATG TGGCGCATGG TGCTGGTCGC CTTCGCCGAC TGCGTCCCGC GCGACAGCAA CCGCCTGACG CTGGACCCGG TGAAAACCGA CCGCTTCGGC ATTCCCCAAC TCCGCATCGA CTTCGCCTAT GGCAAGGAAG AGCAGGCAGC ACTTGCCCAG GCCAAGGCCG ATGCCGCCGA AATGATGACG GCGGCGGGCG GCATGGTCGT CATGGGTTCG GACCAGCCCG GCACCGGTGG CATGGCGATC CACGAGATGG GCGGCGCGCG CATGGGCCAC GACCCGAAGA CCTCGGTGCT CAACAAGTGG AGCCAGAGCC ACGACGTCGC CAACCTGTTC GTCACCGACG GCGCGCAGAT GGCGTCCTCG GCCTGCCAGA ACCCTTCGCT CACCTACATG GCGCTGACCG CACGTGCCTG CGATGCGGCG GTCAGGATGC TGCGCGAAGG TGCGATCTGA
|
Protein sequence | MQFDAIVVGS GITGGWAAKE LTQAGLKVLM IERGREIVHG DYPTEMKTPW EMPFRGVGDA ALYAREYQVQ AQNRHFNEFT QGHFVNDKEN PYATGPDSEF NWLRSYQLGG RSLTWGRQAY RWSDYDFSAN KRDGNGTDWP IRYADLAPWY DKVEEFIGVS GAAEGLPQLP DGRFQPPMAL NAVERHVRQV VADRYGRCMT VGRVANMTQA KPDEGRSACQ NRSICARGCS YGAYFSTQSS TLPAAKATGN LTVVTDAIVE HVDYDPATKR VTGVRYVNTK DGSRGSATAR MVFLNASAFN SVHVLLNSRS EAMPNGLGNS SGVLGTQIMD HANTLSTIAL FPQFNGRTSF GNRPTGVVIA RYRNMDEMDG AGHTRGYSYQ GGALQSNWGA GKREAGIGAD FKDKLRTPGM WRMVLVAFAD CVPRDSNRLT LDPVKTDRFG IPQLRIDFAY GKEEQAALAQ AKADAAEMMT AAGGMVVMGS DQPGTGGMAI HEMGGARMGH DPKTSVLNKW SQSHDVANLF VTDGAQMASS ACQNPSLTYM ALTARACDAA VRMLREGAI
|
| |