Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1719 |
Symbol | |
ID | 3916294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1807270 |
End bp | 1809192 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640444460 |
Product | glycoside hydrolase family protein |
Protein accession | YP_496993 |
Protein GI | 87199736 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGGCT TGAAGATTTC TGCACCAATG GTGCTGCTTG CCGGTGTCGG GGCGGTCGCG CTGGGGGCAT CCGCGTCGGC AAGGCCCGCC TTTCGCGACC TGAACCACAA TGGCCGGCTC GATCCATACG AGAACGTCGC GCTGCCGGTC GAACGGCGGC TCGACGATCT GCTCAAGCGG ATGACGCTGG AGGAAAAGGT CGCGCTCATG CTCCACGGAA CGCTTCAGGC CGAAGGCGGA CGCGGCATCG GGGTGGGCAA TGCCTACGAC ACGGCCCTTG CGCAGGAACT CCTTGCGCGC GGGGTCAACA GTTTCATCAC CCGCATCGCA CCCGAACCCC GCGCCTTCGC GACGCAGAAC AATGCGATCC AGCGGCTTGC CGAGGCGACC CGGCTCGGCA TTCCCGCCAC GATCAGCACC GATCCGCGCA ATCACTTCCA GGTAGTGGTC GGCGCAAGCA GCGATGCCAG CGGCTTCTCC AAATGGCCCG AGACGCTCGG CATGGCCGCC ATCGGCGACG AGAAGCTGGT TGAGCGGTTC GGCCGACTGG TCGCCGCCGA ATATCGCGCC GTCGGCATCC AGATGGCCCT GTCGCCACAG GCCGACCTCT ATACCGAGCC ACGCTGGCCG CGCGGCAACG CGACGTTCGG CTCGGACCCG GCCACGGTCT CGCGTCTTGC CGGTGCATAT GTGCGCGGGT TCCAGGGCGG CGCGGACGGA CTCGTCCGCT CTGGCGTCGC GACCGTGGTC AAGCACTGGG TCGGCTATGG CGCCGAGCCC GAGGGCTTTG ACGGCCACAA CTATTACGGC CGCATCGCCC GTCTCGACAA CGAAAGCTTC GCCCAGCACG TTGCCGCTTT CGAAGGCGCG CTCGCCGCGA AGTCGGCAGG CGTAATGCCC ACCTATGTCA TCGCGGAGGG CGTGAGCATC GATGGCAAGC CGCTGCCGCA GGTCGGCGCG GGCTTCAGCA AGCCTCTGAT CGAGGGACTT CTGCGCGGCA CCCACAAGTT CGGCGGCATC GTCATTTCCG ACTGGGGCAT CACCAACACC TGCCCCGAAC AGTGCAGCAA CCCGACGGCC GAAAAGCCCC AGGGCTTTGC GATCGCGATG CCGTGGGGCG TCGAGGGTCT GTCCGAGGAA GACCGCTACG CGCTGGGCGC GAATGCCGGG ATCGACCAGT TCGGCGGCGT CGACAATCCA GGCCCTCTGC TTGCCGCCGT CCGTGCCGGG AAGGTTTCGC CCGCCCGGGT CGATCAGGCC GCCCGCCGCG TGCTTCGCCT GAAGTTCGAG CTTGGCCTGT TCGACGATCC CTATGTCGAC GTCGAGAAGG CAGCGCAGAT CGTCGGCAAC AAGGCGACGC AGGCCGAGGC CGATGCAGCC CAGCGCGCCG CGCAGGTGCT GCTGTTGAAC CGCAACGCGC TTCTGCCGCT CGCGCCGGGC CGAAAGGTTT GGCTCTCCGG TGTCGACGCC AGTGCCGCGC GTGCTGCCGG GCTGGTCGTG GTGGACACCG CCGAACAGGC CGAGGTGGCC ATCGTTCGCG TTGCCACCCC GCACGAAGTC CTTCATCCTC ACCACTTCTT CGGCTCGCGC CAGAACGAGG GCCGCCTCGA CTTCCGTGCT GGAGACGAGG CGACCAGAAC CATCGCCGCT GCCGCTGCGC GCATCCCGAC CGTTGTGGCC GTCGACCTCG ATCGTCCTGC CGTGCTGACC GAGGTGAAGG ACAAGGCTAC CGCGCTCTTC GGCCTGTTCG GCGCCAGCGA TGCCGTGTTG CTGGATCTCG TCACCGGCAA GGCCCGGCCG GCGGGCAAGC TGCCGTTCGA ACTGCCGTCC TCGGCAAAGG CGGTCGAGGA TCAGCATCCG GGCCGACCGG ACGACAGCGC CAACCCGCTC TTCCGGCGCG GTGACGGGCT GACCTATCCC TGA
|
Protein sequence | MGGLKISAPM VLLAGVGAVA LGASASARPA FRDLNHNGRL DPYENVALPV ERRLDDLLKR MTLEEKVALM LHGTLQAEGG RGIGVGNAYD TALAQELLAR GVNSFITRIA PEPRAFATQN NAIQRLAEAT RLGIPATIST DPRNHFQVVV GASSDASGFS KWPETLGMAA IGDEKLVERF GRLVAAEYRA VGIQMALSPQ ADLYTEPRWP RGNATFGSDP ATVSRLAGAY VRGFQGGADG LVRSGVATVV KHWVGYGAEP EGFDGHNYYG RIARLDNESF AQHVAAFEGA LAAKSAGVMP TYVIAEGVSI DGKPLPQVGA GFSKPLIEGL LRGTHKFGGI VISDWGITNT CPEQCSNPTA EKPQGFAIAM PWGVEGLSEE DRYALGANAG IDQFGGVDNP GPLLAAVRAG KVSPARVDQA ARRVLRLKFE LGLFDDPYVD VEKAAQIVGN KATQAEADAA QRAAQVLLLN RNALLPLAPG RKVWLSGVDA SAARAAGLVV VDTAEQAEVA IVRVATPHEV LHPHHFFGSR QNEGRLDFRA GDEATRTIAA AAARIPTVVA VDLDRPAVLT EVKDKATALF GLFGASDAVL LDLVTGKARP AGKLPFELPS SAKAVEDQHP GRPDDSANPL FRRGDGLTYP
|
| |