Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2870 |
Symbol | |
ID | 3915509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3092342 |
End bp | 3094459 |
Gene Length | 2118 bp |
Protein Length | 705 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445649 |
Product | Pyrrolo-quinoline quinone |
Protein accession | YP_498140 |
Protein GI | 87200883 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4993] Glucose dehydrogenase |
TIGRFAM ID | [TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.489222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTGA AGGTCCTGGG ACTTATGGCC GCACTGCTGC CGCTGGCGGC GTGCAACATC AAGAGCGAGG GCGGCGGGGA TGCCGTCGCC AACGCCGGCG TCACCGACGC CCTCATCGCC CAGGCGCCCG AAGGCGAATG GCTGAGCTAT GGCCGCGATT ATGGCGAGCA GCGCTTCTCC CCGCTCACCC AGATCAATGA CGGCAACGTC GGCCAGCTCG GTCTTGCCTG GTTCCATGAC CTCGAGACCG CGCGCGGGCA GGAAGCCACG CCGCTGATGC ATGACGGCAC GCTCTACATC TCGACCGCGT GGTCGATGGT GAAGGCGTTC GATGCCAAGA CCGGCGCGCT CAAGTGGTCC TACGATCCCG AAGTCCCGCG CGAGACGCTG GTCCGCGCCT GCTGCGACGC GGTCAATCGC GGCGTCGCGC TCTATGGCGA CAAGGTCTTC GTCGGCACGC TCGACGGTCG CCTCGTCGCG CTCGACCAGA AGACCGGAAA GGTCGTCTGG TCCAAGGTCG TCGTGCCCAA CCAGGAGGAC TACACCATCA CCGGCGCCCC GCGCGTGGTG AAGGGCAAGG TTCTGATCGG TAGCGGCGGC TCGGAGTACA AGGCGCGCGG CTATATCGCC GCCTACGACG TCAACACCGG CAACGAAGTG TGGAAGTTCC ACACCGTCCC CGGCAATCCA GCGGACGGGT TCGAGAACAA GGCGATGGAA AACGCCGCCC GCACGTGGGC TGGCGAATGG TGGAAGCTCG GCGGGGGCGG CACGGTGTGG GATTCCATCA CCTATGATCC CGCCACCAAC CTCGTCCTGT TCGGCACCGG CAATGCCGAG CCATGGAACC CGGCCGCAGC CGGGCGCGAG GGCGACAGCC TCTACACGTC CTCGATCGTC GCGGTGAATG CCGATACCGG CGACTATGTC TGGCATTTCC AGGAAACCCC GGAAGACCGC TGGGACTTCG ACTCCGCGCA GCAGATCACG CTGGCCGACC TGACCATCGA TGGGCAGCGG CGCCACGTGA TCCTTCATGC GCCCAAGAAC GGTCACGTCT ATGTGCTCGA CGCCAGGACC GGGCAGTTCC TGTCGGCAAC GCCCTTCGTG ATGGTCAACT GGGCGACCGG TATCGATCCC AAGACCGGCA AGGCCACCGT CAATCCCGAA GCCCGCTATG AAAAGACCGG CAAGCCCTTC GTCAGCCTGC CCGGTGCGGT CGGCGCACAC TCCTGGCAGC CGCAGAGCTT CAGCCCGAAG ACCGGCCTGC TCTACCTTCC GGTGAACAAC GCGGCATTCC CCTATGCCGC CGCCAAGGAC TGGAAGGCCA CCGACATCGG CTTCCAGACC GGCCTCGACG GCTATGTCAC CTCGATGCCC GCCGACGCCA AGGTCCAGGG CGCCGCGATG AAGGCGACCA CCGGTACGCT CGTGGCGTGG GACCCGGTTG CGAAGAAGGC CGCGTGGAAG GTCGAACTGC CGAGCCCGTC GAACGGCGGC ATTCTCTCGA CCGCTGGCAA TCTCGTGTTC CAGGGCACCG CGGGCGGCGA TTTCGTTGCC TACAACGCCG ACAAGGGCAA GCAGCTCTGG TCGTTCCCGG CGCAGAGCGG CATCCTTGCC GCGCCGATGA CCTATGCGAT CGACGGCGAA CAGTACGTCG CGGTCATGGT CGGCTGGGGC GGCGTGTGGG ACGTCGCCAC CGGCGTCCTC GCGCACAAGG CCAAGAAGCA GCGCAACATC AGCCGCCTGG TCGTGTTCAA GCTCGGCGGC AAGGCCACGC TGCCCGCTGC TCCCCCGATG GCCAAGATGG TCCTCGATCC GCCGCCGTTC ACCGGCACGC CCGAACAGGC CAAGGCCGGC GGCGAACTCT ACGGACGCTA CTGCAACGTC TGCCACGGCG ATGCGGCGGT TGCGGGCGGC GTGAACCCCG ATCTGCGTCA CTCCGCGGCG CTCAATGCCC CCGAGGCGAT CCGCTCGGTG GTGATCGAGG GCGCGCTGCA GCACAACGGC ATGGTCTCGT TCAAGTCGGC GCTGAAGCCC GAGGATGCGG ACAACATCCG CCACTACCTC ATCAAGCGCG CCAACGAGGA CAAGGCTCTC GAAGCCAAGG GCGGCTGA
|
Protein sequence | MRLKVLGLMA ALLPLAACNI KSEGGGDAVA NAGVTDALIA QAPEGEWLSY GRDYGEQRFS PLTQINDGNV GQLGLAWFHD LETARGQEAT PLMHDGTLYI STAWSMVKAF DAKTGALKWS YDPEVPRETL VRACCDAVNR GVALYGDKVF VGTLDGRLVA LDQKTGKVVW SKVVVPNQED YTITGAPRVV KGKVLIGSGG SEYKARGYIA AYDVNTGNEV WKFHTVPGNP ADGFENKAME NAARTWAGEW WKLGGGGTVW DSITYDPATN LVLFGTGNAE PWNPAAAGRE GDSLYTSSIV AVNADTGDYV WHFQETPEDR WDFDSAQQIT LADLTIDGQR RHVILHAPKN GHVYVLDART GQFLSATPFV MVNWATGIDP KTGKATVNPE ARYEKTGKPF VSLPGAVGAH SWQPQSFSPK TGLLYLPVNN AAFPYAAAKD WKATDIGFQT GLDGYVTSMP ADAKVQGAAM KATTGTLVAW DPVAKKAAWK VELPSPSNGG ILSTAGNLVF QGTAGGDFVA YNADKGKQLW SFPAQSGILA APMTYAIDGE QYVAVMVGWG GVWDVATGVL AHKAKKQRNI SRLVVFKLGG KATLPAAPPM AKMVLDPPPF TGTPEQAKAG GELYGRYCNV CHGDAAVAGG VNPDLRHSAA LNAPEAIRSV VIEGALQHNG MVSFKSALKP EDADNIRHYL IKRANEDKAL EAKGG
|
| |