Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1729 |
Symbol | |
ID | 3916304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1821701 |
End bp | 1823914 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444470 |
Product | Beta-glucosidase |
Protein accession | YP_497003 |
Protein GI | 87199746 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.830877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCTTT CCCGCCGCCT TCTCGTATCG GCTTCATCGG TTCTGGCCCT TTCCGCGACG GCCATCCACG CCGCGCCAGC CTCTCCCGCC GACATTGCCA AGGCCGACGC GCGCGCTGCC GCCACGGTCC GCCAGATGAC CGGCGAGGAA AAGGTCGTCC TCACGCACGG CATCATGCCG CTGCCGCTTG CAGGCCCCGT TCCCGGAATG CCCGCCGATG CGATTCCCGG CGCGGGCTAC ATCGCCGGCA TCGAGCGTCT CGGCATTCCT GCGCTGAAGG AAACCGACGC AAGCCTCGGC GTCTCCTACG TCATGGGCAT CCGCAAGGAC GGAGCCACCG CGCTCCCGTC GGGCGTCGCG CAGGCGGCTT CGTGGAACCC GGATCTCCTC TATCGCGGCG GTGCGATGAT CGGTTCGGAA GCGCGGGCCA AGGGCTTCAA CGTGCTGCTC GCCGGCGGCG TCAATCTCAT GCGCGACCCG CGCAACGGCC GCACCTTCGA ATATCTCGCT GAAGACCCCC TTCTGTCGGG AATGCTGGTC GGCGCCGCGA TCCGTGGCAT CCAGTCGAAC AACATCATTT CCACGATCAA GCACTTCGCA CTCAACGGCC AGGAAACCGG GCGCAAGTAC GTCGACGTGA AAATCGACGA AGGCGCCGCG CGCGAGAGCG ACCTGCTGGC CTTCCAGATC GGCATCGAGC AGGGCCAGCC GGGGTCGGTC ATGTGCGCCT ACAACCGCGT CTGGGGCGAG CAGGCTTGCG CCAACGACTG GCTGCTCAAC AAGGTGCTCA AGCAGGCCTG GGGCTACAAG GGCTTCGTCA TGTCCGATTG GGGCGCGGTC CCGAACATCG AGGCCGCGCT CAAGGGCCTC GACCAGCAAT CGGGCGAACA GCTCGATCCC GGCGTGTTCT TTGCCGACAA GCTCAAGGAA AAGGCGGCGT CCGATCCGGC GTACAAGGCG CGGCTCGATG ACATGAACCG CCGCATCCTG ACCGCGATCT ACGCCTCGGG CCTGGACAAG GCGCCGGCCA CGCCTGGCGG CAAGATCGAC TTCGCCGCCA ATGCCCAGGT GGCCGAGGAA GTGGCGAAGC AGGGCATCGT TCTGTTGAAG AACGACGGCG CGCTCCCGCT CGCCAAGTCG GCAAGGTCCA TCGCCATCAT CGGCGGCTAT GCCGACGGCG CGGTGCTTTC GGGTGCCGGC TCGAGCCAGG TACACGGCGA GGGCGGACCC GCCGTCGTCC GTCCGGTGGG CGGAAAGGGC GTCTGGGCGG GCTTCATTGC CCAGCAGTAC CACCGCTCCA GCCCGATGGA CGCGATCCAG GAGCTGGCGA AGGACGCGAA GGTCACGTTC CGCGATGGCC GCTACATCGC TGATGCCGTC GAGAAGGCGC GCCAATCGGA AGTCGCCATC GTCTTCGCCA CGCAGTGGCA GACCGAAGGC CTGGACGTGC CCGACCTCTC GCTTCCGGAC GGGCAGGACG AACTGATCGC TGCCGTTGCC GCCGCCAATC CCAGGACCAT CGTCGTCCTG GAAACCGGCG GCCCGGTCAA GATGCCGTGG CTCGACAAGG TCGCCGGCGT GATCGAGGCG TGGTATCCCG GCGCGCGCGG CGGCCCGGCC ATCGCTTCGG TCCTGTTCGG CGATACCAAC CCTTCCGGCC GCCTCCCGCT GACCTTCCCG AAGGACGAAA GCCAGCTCCC GCGTCCCCGG CTCGACGGCA GCGACTGGGT CGAGCCCGAC TTCTCGGGCA ACCCCAGTTC CGGAAGCGAC AAGCTGGTGT CCGACTACGA CATCGAGGGC TCTGATGTCG GCTATCGCTG GTTCGCGCGA AAGGGCCAGA AGGCGCTGTT CCCGTTCGGC CACGGCCTGT CCTACACCAC CTTCGAAAGC TCCGGCCTCA AGGTGAAGGG CCTCGCGGCC AGCTTCACCG TAAAGAACAC CGGCCAGCGC GCGGGCGACG ACGTGGCGCA GGTCTATCTG GTCAGCCGCA ACGGCCAGGC CAGGCAGCGC CTCGTCGCGT TTCAGCGCGT CAGTCTCGCA CCCGGCGCGT CGAAGTCCGT GACAGTCACC TTCGATCCGC GCATCCTGGC GGACTACAGG AACGGCGGCT GGGTCATGGG AGGAGGGGAA TATGCCTTCG CGCTCGGCAA GGACGCGGAG AACCTGTCCG CACCGGTGAA GGTCAAGCTC GCTGCAAAGA AGTGGAAGGA CTGA
|
Protein sequence | MFLSRRLLVS ASSVLALSAT AIHAAPASPA DIAKADARAA ATVRQMTGEE KVVLTHGIMP LPLAGPVPGM PADAIPGAGY IAGIERLGIP ALKETDASLG VSYVMGIRKD GATALPSGVA QAASWNPDLL YRGGAMIGSE ARAKGFNVLL AGGVNLMRDP RNGRTFEYLA EDPLLSGMLV GAAIRGIQSN NIISTIKHFA LNGQETGRKY VDVKIDEGAA RESDLLAFQI GIEQGQPGSV MCAYNRVWGE QACANDWLLN KVLKQAWGYK GFVMSDWGAV PNIEAALKGL DQQSGEQLDP GVFFADKLKE KAASDPAYKA RLDDMNRRIL TAIYASGLDK APATPGGKID FAANAQVAEE VAKQGIVLLK NDGALPLAKS ARSIAIIGGY ADGAVLSGAG SSQVHGEGGP AVVRPVGGKG VWAGFIAQQY HRSSPMDAIQ ELAKDAKVTF RDGRYIADAV EKARQSEVAI VFATQWQTEG LDVPDLSLPD GQDELIAAVA AANPRTIVVL ETGGPVKMPW LDKVAGVIEA WYPGARGGPA IASVLFGDTN PSGRLPLTFP KDESQLPRPR LDGSDWVEPD FSGNPSSGSD KLVSDYDIEG SDVGYRWFAR KGQKALFPFG HGLSYTTFES SGLKVKGLAA SFTVKNTGQR AGDDVAQVYL VSRNGQARQR LVAFQRVSLA PGASKSVTVT FDPRILADYR NGGWVMGGGE YAFALGKDAE NLSAPVKVKL AAKKWKD
|
| |