Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2793 |
Symbol | |
ID | 4023291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3109846 |
End bp | 3110871 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637962991 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_569922 |
Protein GI | 91977263 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0138787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.035915 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCGC GCGCTTTCAT CACCGGCATC TCCGGCCTGG GCCTTACCGA CGACGAGCGC GAGTTTTTGC GCGAAACCCG TCCGTGGGGC TTCATCCTGT TCAAACGCAA CGTCGACAAT CCCGCGCAAG TCGCTGGATT GGTTCGTGAA CTGCGCAAGG AAGTCGGCCA GCCCGACGCC CCGGTCCTGA TCGATCAGGA AGGCGGGCGG GTGCAGCGTC TCGGCCCGCC GCATTGGCCC GTCTATCCGC GCGGCGCGGT GTTCTCGGCG CTATACGATA TCGATTCGCA GCTCGGGCTC ACCGCGGCGC GGCTCAGCGC GCGTCTGATC GCAGCCGATC TGGCCGATCT CGGCATCACC GTCGACTGCC TGCCGCTCGC CGACGTGCCG ATCGCCGGCG CCGACGCGGT GATCGGCGAT CGCGCTTACG GTACCGAACC CGCCAAGGTG GCGGCGATCG CCCGCGCGGT GACCGAGGGC CTGGAGCAGG GCTCGGTGTT GCCGGTTCTC AAACACATTC CCGGGCACGG GCGGGCGACG GCGGACAGCC ATTTCCGGCT TCCGACCGTG GACACGGCGC GGCAGGAGCT TGAGCGTTCC GACTTCGCGG CGTTCCAGCC GCTCGCCGAT CTGCCGATGG CGATGACTGC ACATGTTGTG TTCAGCGATC TCGATCCCGC CCAACCCGCG ACGACATCTG CGACAATCAT CGATCAGGTG ATTCGCGGAC GAATCGGGTT CCAGGGACTG CTGATGAGCG ACGACGTCTC GATGAATGCG CTGGAAGGGA CGATCGCCGA TCGCACCCGG GCCATCGTCG CGGCGGGATG CGACGTCGTC CTGCATTGCA ACGGCAAGCT CGACGAGATG CGGCAGGTCG CCGCCGAGAC GCCGGAACTG GCAGGCAAGG CGCTGCAACG CGCCGAGGCG GCGCTGGCGT CGCGCAAGCC GCCGCAGATT TTCGACCGCG CCGCCGCGCG CGCCGAACTC GAAGCCTTGA TCGGGCGCGC AGGGATAAGC GCATGA
|
Protein sequence | MTSRAFITGI SGLGLTDDER EFLRETRPWG FILFKRNVDN PAQVAGLVRE LRKEVGQPDA PVLIDQEGGR VQRLGPPHWP VYPRGAVFSA LYDIDSQLGL TAARLSARLI AADLADLGIT VDCLPLADVP IAGADAVIGD RAYGTEPAKV AAIARAVTEG LEQGSVLPVL KHIPGHGRAT ADSHFRLPTV DTARQELERS DFAAFQPLAD LPMAMTAHVV FSDLDPAQPA TTSATIIDQV IRGRIGFQGL LMSDDVSMNA LEGTIADRTR AIVAAGCDVV LHCNGKLDEM RQVAAETPEL AGKALQRAEA ALASRKPPQI FDRAAARAEL EALIGRAGIS A
|
| |