Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0789 |
Symbol | |
ID | 3915842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 837765 |
End bp | 839096 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640443519 |
Product | glycoside hydrolase family protein |
Protein accession | YP_496068 |
Protein GI | 87198811 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0245888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATC GTCGCAGCCT GATGGCTTCC GCCGTGGCTT TGAGCGCCTC AGTGATGACC AGCAAAAGCG CCTTGGCCCG GTCCGCCAAG CCGAGGCCGC TCGATCCGCA GTTTCCCGAA GGCTTCCTGT GGGGCGCAGC CACCGCCGCG CATCAGATCG AAGGCAACAA TCTCAACGCC GACCTCTGGG TGATCGAAAA CGTCCCCGGC ACCATTTTTG CAGAACGGTC GGGCGACGCC GCAAACAGCT TCGAACTGTG GCCGGTCGAC CTTGATCTCG TGAAGGGCAT GGGGCTCAAT TCCTATCGCT TCAGCCTCGA ATGGGCGCGG ATCGAGCCGG ATGAAGGGCA TTTCTCCAAT GCCATGCTCG ATCACTACAA GGCGATGATC GAGGGTTGCC GGGCGCGGGG GCTCAAGCCG GTCGTCACCT TCAACCATTT CACCACCCCG CGCTGGTTCG CAGCCAAGGG CGGATGGCAT AATCCGGAGT CATCGGCGCT CTTCGCCCGC TTCTGCGAAC GGGCGGCGCG CCATCTTGCG GCAGGAATCG AACTCGCCAC AACATTGAAC GAGCCCAATC TGGCTGGCGT GATCGGCGAG ATCCTGCCGC CGCCACTGGT GGCAGGCGAC AAGGCAACGC AAGAAGCAGC AGCGAAGCAG CTCGGCGTGG CGCTTTATAC GCCCGGCGTC GCGCTCTACA TCAAGGAGCC GAAGACCTAT CGCGCCAACA TGATGGAAGG CCATCGCCGT GGGGTCGCCG CAATCAAGGC GGTGCGGCCC GACCTGCCGG TTGGCGTAAG CCTGGCGATG ATCGACGATC AGGCGGTTGG CAAGAACTCG ATGCGCGACC GGATCCGCGA ACGCTACTAC AACGAGTGGC TGCGCCTTGC GGGCGAGACT TGCGATTTCA TCGGTGTGCA GAACTACGAA CGCAAGGTCT GGACCGACAA GGGCGAGTTG CCGTCCCCCG CCGATGCGCG GCGCAATACT GGCGGCGCTG AAGTCTGGCC CGGGTCGCTG GCGGGCGCGG TGCGCTATGC CCATGCAGTG ACCAAGCTGC CGGTCTATGT CACCGAACAC GGCGTCAATT CCGACGACGA CGCGCTGCGC CAGTGGTTGA TCCCCGAAGC GCTTACCGAA CTGAAGCGTG CGATCGACGA TGGTGTGCCG GTGCGCGGCT ATATCCACTG GTCGCTGATC GACAATTTCG AATGGGGCTT CGGCTACAAG TACCGCTTCG GCTTGCACTC GTTCGACCAA AGCACCTTCC AGCGAACCGC CAAACCCAGC GCAGCAATTC TGGGCAGAAT TGCACGGCGA AATAGGCTCT GA
|
Protein sequence | MIDRRSLMAS AVALSASVMT SKSALARSAK PRPLDPQFPE GFLWGAATAA HQIEGNNLNA DLWVIENVPG TIFAERSGDA ANSFELWPVD LDLVKGMGLN SYRFSLEWAR IEPDEGHFSN AMLDHYKAMI EGCRARGLKP VVTFNHFTTP RWFAAKGGWH NPESSALFAR FCERAARHLA AGIELATTLN EPNLAGVIGE ILPPPLVAGD KATQEAAAKQ LGVALYTPGV ALYIKEPKTY RANMMEGHRR GVAAIKAVRP DLPVGVSLAM IDDQAVGKNS MRDRIRERYY NEWLRLAGET CDFIGVQNYE RKVWTDKGEL PSPADARRNT GGAEVWPGSL AGAVRYAHAV TKLPVYVTEH GVNSDDDALR QWLIPEALTE LKRAIDDGVP VRGYIHWSLI DNFEWGFGYK YRFGLHSFDQ STFQRTAKPS AAILGRIARR NRL
|
| |