Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1586 |
Symbol | |
ID | 4896856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 1667133 |
End bp | 1668146 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640112177 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001043468 |
Protein GI | 126462354 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.245014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGG CGGGCCGGAC GGCCGCGATT TTCGGCTGCT CCGGCCCGGT CCTCACGGAT GCCGAGCGGC AGTTCTTCCG CGAGGCCGAT CCCTTCGGCT TCATCCTCTT CGCACGCAAC ATCGACGACC CCGCGCAACT CCTTGCCCTG ACGCAGGAAA TGCGTTCCAC CGTGGGGCGC GACGCGCCCG TCTTCGTCGA TCAGGAAGGC GGCCGCGTCC AGCGTCTCCG CGCGCCGTAC TGGCGCGAGT GGCTGCCGCC CCTCGAGGCG GTGGAGCGCG CCGGAGACCG GGCCGCCCGG ATGCTCTGGC TGCGCTACCG GCTCATCGCC GAGGACCTGC GGGCGGTGGG CATCGACGGC AACTGCGCGC CCGTGGCCGA CATCCGCACC GCGGCGACCC ATCCGTTTCT CGCCAACCGC TGCCTCGCCG ACGAGGCCGC GCGCGTGGCG GAGCTTGCCC GCGCGGCGGC CGAGGCGCAT CTGGCGGGCG GGGTCCTGCC GGTGATGAAG CATCTGCCGG GCCACGGACG GGCCGCGGCC GACACGCACC ACGACCTGCC CACGGTGACC GCCAGCCGCG AGGAGCTGGC CGCCACCGAC TTCGCGGCCT TCCGGGCGCT TGCGGATCTG CCCTTGGCCA TGACGGGCCA TGTGGTCTTC TCCGCCTATG ATGCGCAGCC TGCGACCCTC TCGGCGCCCA TGGTCGGCGT CATCCGCGAG GAGATCGGCT TTTCCGGCCT TCTCATGACG GACGATCTGT CGATGCAGGC CCTCTCGGGC GGGATCGGCG CGCGGGCGGG GGCGGCCATC GCGGCCGGCT GCGATCTGGC GCTCCATTGC AACGGCGAAC TGGCCGAGAT GGAGGCCGTG GCCGCCGCCG CGGGCGCGAT GGGGCCCGGG GCGCTGGAGC GCGCCGCAGC GGCGCTGGCC CGCCGCAGGC CGCCCGAGCC GGTTGACAGC CGGGCGCTCG AGGCCGAACA TTCCGTCCTT CTGGGCGGGC ATGGGCATGG CTGA
|
Protein sequence | MSEAGRTAAI FGCSGPVLTD AERQFFREAD PFGFILFARN IDDPAQLLAL TQEMRSTVGR DAPVFVDQEG GRVQRLRAPY WREWLPPLEA VERAGDRAAR MLWLRYRLIA EDLRAVGIDG NCAPVADIRT AATHPFLANR CLADEAARVA ELARAAAEAH LAGGVLPVMK HLPGHGRAAA DTHHDLPTVT ASREELAATD FAAFRALADL PLAMTGHVVF SAYDAQPATL SAPMVGVIRE EIGFSGLLMT DDLSMQALSG GIGARAGAAI AAGCDLALHC NGELAEMEAV AAAAGAMGPG ALERAAAALA RRRPPEPVDS RALEAEHSVL LGGHGHG
|
| |