Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1611 |
Symbol | |
ID | 3918719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1679091 |
End bp | 1681526 |
Gene Length | 2436 bp |
Protein Length | 811 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444351 |
Product | Beta-glucosidase |
Protein accession | YP_496885 |
Protein GI | 87199628 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATGC GAGCACACCT GCGGGGCCTG GCGCTAATCC TGCTGACCGG CAGCAGCCTC GCCGCAGCAG CCGACCGGGT TGATGTCCAG GCGGCGCCTT CAGGCATTGC CAACCCTGCG GCGTGGCCCG CCGCGCACAG CCCATCCGCC ATCACCGACC CCGCGACGGA GCGGGCAATT ACCCGCATTC TCAAGCGCAT GACGCTGGAG CAGAAGGTGG GGCAAGTCAT CCAGGGCGAC ATCAGCTCGA TCACGCCTGC CGATCTGGAG CGATATCCGC TGGGCTCGAT CCTCGCTGGC GGGAACAGCG GCCCCTACGG CAACGAACGC GCCGACGCCG CCACCTGGCT GCGCCTCGTC AACGAATTCC GCGCAGCTTC GCGCAAGGCG GGGGCGGGGG TGCCGATCCT GTTCGGTGTC GATGCCGTCC ACGGCCACTC CAACATTCCC GGAGCGACGA TCTTCCCGCA CAACGTCGGC CTCGGCGCGA CACGCGATGC GGATCTGATC CGCCGGATCG GCCAGGCAAC CGCTGCCGAG GTGGCGGGGT CGGGTATCGA GTGGACCTTT GCGCCGACCC TGGCGGTGCC TCAAGACTTG CGCTGGGGGC GCGCGTACGA GGGCTATTCA AGCGATCCGC AGGTCATCGC CCGCTACGCT CCGGCGATGG TAGAGGGCCT TCAGGGCACT TTAGGTGCCG TTCGGGTCTT GCCGTCGAAC CGCGTCGCGG CAAGTGCGAA GCACTTTCTG GCCGATGGCG GGACCGAGAA CGGCAAGGAT CAGGGTGATG CCAAGCTCTC TGAAGCCGAC CTCGTGCGTA TTCACGCACA GGGCTATCCC CCCGCCATCG ATGCCGGTGC GCTGACGGTA ATGGCAAGCT TTTCAAGCTG GAACGGGATC AAGAACCACG GCAACCGCTC GCTCCTGACC GATGTCCTCA AGAAGCGCAT GGGTTTCGAG GGGCTGGTCG TGGGTGACTG GAACGGTCAT GGTCAGATTC CGGGATGCAC CACCACCGAT TGCCCGTCGG CCCTCAACGC CGGCCTCGAT CTCTACATGG CGCCCGATAG CTGGAAAGGC TTGTTCGACA ATACCTTGCG GGAGGTTCGT GAAGGGAAGA TCAGCAAGAC CCGGCTGGAC GACGCCGTAC GCCGCATCCT GCGCGTGAAA TTCAAGCTGG GTCTGATGGG GCCCAGACTG GTCGAGCGCG GCGATCCCGC GGCGGTTGGC GCCGATGCCC ATCTCGAAAT CGCGCGGGAA GCAGTCGCCA AGTCATTGGT CTTGCTCAAG AATGAGGGTG GCGTCTTGCC CATCCGTCCC GGTGCGAGGG TGCTGGTCAC CGGGCCCGGG GCCGACAACA TGGCAATGCA GGCGGGCGGG TGGACGATCA CCTGGCAGGG TACGGACACC AGCGCCGCCG ATTTTCCCAA GGGCCGCACC ATCGGTCGGG CGATCTCGGA GACTGTCGCC GAGGCCGGGG GCAAGGCGGA GATTGCTTCC GATCTGCCTC CGGGGGCCAT GCCCGATGTC GCGGTCGTCG TTTTCGGTGA GCAACCCTAC GCCGAATTCC AGGGTGATGT GCCGAACCTC GATTTTCACG CGCGGGCGGG TGAACTGGAC CTGATCAAGC GTCTGAAAGC GCGGGGTATT CCCGTAGTGG CCCTGTTCCT TTCCGGGCGT CCGATGTTCG TCGGGCCTGA AATGAACCTT GCCGACGCAT TCGTGGCGGC GTGGCAGCCC GGGTCGCAGG GGCAGGGCGT TGCCGACGTC CTCGTTGCGC GCAAGGACGG CAAGCCAGCG CGCGATTTCA CTGGCACGTT GCCGTTCGCA TGGCCGCAGG ACGCGCGCTC TCCGCTGGTC GATCCGCTTT TCCCGCTGGG CTACGGCCTA TCCTATGCAA GGCCGGGCAA GGTCGGTCCG GTGAACGAGG ACGCCCGGGT CGAGCACGGG CCAGCCATGA GCGAGACCAC CTTCATCCGT CACGGCAAGG TGATCGAGCC GTGGCGGCTG GGCCTGGACA GTGTCGTGTC GACCCGCGCG GTCGACGTTG CCGCACAGGA GGATGCGCGC CAGTTCCGCT GGGCCGGGCA GGGCGCGATC GCGCTCGATG GTCCGCCGGT CGACATGGAG CGCCACCGCA ACGGCGGCTT CGTCATGCGG CTGGACTGGC GGATCGATGC GCGCGGAACC GGGCCGGTGA CCGTCGCTCT GGGAGGATCG CGGCTCGATG TGACCTCGAT CGTCGATGCC AGCGTGGCGG GCAAGCCGGT GGCGCTACGG ATACCGCTTC AGTGCTTTGC CGCGAACGGT GCGACGCTGA AGGAAGTGGG GCAGCCACTG CGCATCGGTG CCGACGCCGG CTTTACCGCA AGCATTCGCA ACGCCGGTAT CGAAGGCGTG GGCGAAAGCA TCCCGTGTCC GCGCCCGGCG CGCTGA
|
Protein sequence | MPMRAHLRGL ALILLTGSSL AAAADRVDVQ AAPSGIANPA AWPAAHSPSA ITDPATERAI TRILKRMTLE QKVGQVIQGD ISSITPADLE RYPLGSILAG GNSGPYGNER ADAATWLRLV NEFRAASRKA GAGVPILFGV DAVHGHSNIP GATIFPHNVG LGATRDADLI RRIGQATAAE VAGSGIEWTF APTLAVPQDL RWGRAYEGYS SDPQVIARYA PAMVEGLQGT LGAVRVLPSN RVAASAKHFL ADGGTENGKD QGDAKLSEAD LVRIHAQGYP PAIDAGALTV MASFSSWNGI KNHGNRSLLT DVLKKRMGFE GLVVGDWNGH GQIPGCTTTD CPSALNAGLD LYMAPDSWKG LFDNTLREVR EGKISKTRLD DAVRRILRVK FKLGLMGPRL VERGDPAAVG ADAHLEIARE AVAKSLVLLK NEGGVLPIRP GARVLVTGPG ADNMAMQAGG WTITWQGTDT SAADFPKGRT IGRAISETVA EAGGKAEIAS DLPPGAMPDV AVVVFGEQPY AEFQGDVPNL DFHARAGELD LIKRLKARGI PVVALFLSGR PMFVGPEMNL ADAFVAAWQP GSQGQGVADV LVARKDGKPA RDFTGTLPFA WPQDARSPLV DPLFPLGYGL SYARPGKVGP VNEDARVEHG PAMSETTFIR HGKVIEPWRL GLDSVVSTRA VDVAAQEDAR QFRWAGQGAI ALDGPPVDME RHRNGGFVMR LDWRIDARGT GPVTVALGGS RLDVTSIVDA SVAGKPVALR IPLQCFAANG ATLKEVGQPL RIGADAGFTA SIRNAGIEGV GESIPCPRPA R
|
| |