Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4513 |
Symbol | |
ID | 4597032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4773409 |
End bp | 4774575 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639779124 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_925697 |
Protein GI | 119718732 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.662398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCGT CGCCTCGTCC CGTCCTGGTC GCGGTCGCGG CCGCTGCCCT GGGGGCCGGG CTGCTGGCGC CCGCCAGCGG GTCCGCGCAC GCGCCTGCGG TGAGCGGCCG GCACCAGGAG ACCCGGACGG CGGCGTCGGT GTATCGGGAG ATGAGCCTGG CCCAGCGGGT CGGGCAGCTG TTCATGGTCG GCACCCCCGC GGACCGGGTC GACCGCCGTA CCCGCGCCCA GATCCATCGC TTCCACGTCG GCAACGTGAT GCTGACCGGC CGCAGCTATG ACGGGGTCCG CGCGCCGGCC CGGGTGTCCC GGGCGATGCG CGGCGAGGTC GACGGGAGGT CGACCGCCGG CGTCCGGCTC TTCGTCGCGA CCGACCAGGA GGGCGGCCAG GTCCGGGTGT TGCAGGGGCC CGGCTTCTCC GACATCCCCT CGGCCCTGGA GCAGGGCACC TGGCAGCCGC GCCGGCTGCG TGGCGCCGCG AAGTTGTGGG CCGGGCAGCT GCGCCGGGCC GGCGTGAACC TCGACCTGGC GCCGGTGATG GACACCGTTC CCAGCCGGCG GGCGGCTCGG CACAACCCGC CGATCGGCCG CTACGACCGC GAGTTCGGCT TCACGACCAA GGTCGTCGCC CGGCACGGGG TGGCGTTCCT CAACGGCATG GCCGACGGCG GCGTCGTACC GACGGCGAAG CACTTCCCCG GCCTGGGCCG GGTCCACGCG AACCCCGACA CCCACGCCGG CGTCACCGAC CGGGTCACGA CCCGGCACGA CGCCTACCTG CGGCCGTTCG GGGCGGCGAT CGACGCGGGC GTCCCGATCG TGATGATGTC GACGGCGTAC TACGAGCACC TCGACCCGCG GAACCCCGCG GCGTTCTCAC CGTTCGTGGT CGGCACCATG CTGCGCGGCG ACCTCGGGTT CCGCGGCGTG GTCATCTCCG ACGACCTGGC CCGGGCCCGG CAGGTCGCGG GCTTCAGCCC GGCCGGCCGG GCACTGCGGT TCATCGGCGC GGGTGGCGAC ATCGTGCTCA GCGTCGATGC CGACCCGGTG GGGGAGATGT ACCGCGCGGT CCTCGAGCGC GCCCGGACCA GCGAGCGGTT CCGCGCCAAG GTCGACGCGG CGGTGCTGCG GGTGCTGCGC GCCAAGCAGG ACCGGCACCT GCTGTGA
|
Protein sequence | MSSSPRPVLV AVAAAALGAG LLAPASGSAH APAVSGRHQE TRTAASVYRE MSLAQRVGQL FMVGTPADRV DRRTRAQIHR FHVGNVMLTG RSYDGVRAPA RVSRAMRGEV DGRSTAGVRL FVATDQEGGQ VRVLQGPGFS DIPSALEQGT WQPRRLRGAA KLWAGQLRRA GVNLDLAPVM DTVPSRRAAR HNPPIGRYDR EFGFTTKVVA RHGVAFLNGM ADGGVVPTAK HFPGLGRVHA NPDTHAGVTD RVTTRHDAYL RPFGAAIDAG VPIVMMSTAY YEHLDPRNPA AFSPFVVGTM LRGDLGFRGV VISDDLARAR QVAGFSPAGR ALRFIGAGGD IVLSVDADPV GEMYRAVLER ARTSERFRAK VDAAVLRVLR AKQDRHLL
|
| |