Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2748 |
Symbol | |
ID | 3910541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3131522 |
End bp | 3132547 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637884648 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_486361 |
Protein GI | 86749865 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.429307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.273595 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCGC GCGCCTTCAT CACCGGGATA TCCGGCCTTG GCCTCACAGA CGACGAGCGC GGATTTATCC GTGAATCGCG TCCGTGGGGT TTCATCCTGT TCAAGCGCAA CGTCGACAAT CCGGCGCAAG TTTCCGGATT GGTTCAGCAA TTGCGCACCG AGGTGGGCCA GCCGGACGCC CCGGTCCTGA TCGATCAAGA GGGCGGGCGG GTGCAGCGGC TCGGCCCGCC GCATTGGCCG GTATATCCGC GCGGCGCGCT GTTCTCGGCA CTTTACGATA TTGATTCGCA GCTCGGCCTC GCAGCGGCGC GTCTCAGCGC CCGGCTGATC GCGTCCGACC TGTTCGATCT CGGCATCACC GTCGACTGCT TGCCGCTGGC CGACGTGCCG GTTTCGGGCG CCGATGCCGT GATCGGCGAT CGCGCCTATG GCTTCGCGGC CGCCAAGGTG GCGGCGATCG CCCGTGCGGT CACCGAGGGC CTGGAGCAGG GCTCCGTGCT GCCGGTTCTC AAACATATCC CCGGCCACGG ACGTGCCACT GCAGACAGCC ATTTCCGCCT GCCGACGGTC GATACGTCGC GAGACGAGCT CGAACGCACC GATTTCTCCG CGTTCCAGCC GCTCGCCGAT CTGCCGATGG CGATGACCGC ACATGTTGTG TTTAGCGCCC TCGATCCCGC CCAACCCGCG ACCACTTCTG CGACAATCAT CGATCAGGTG ATTCGCGGAC GAATCGGGTT CCAGGGACTG CTGATGAGCG ACGACGTCTC GATGAATGCG CTGGAGGGCA CGATTGCGGA GCGCGCGCGC GCGAGCATCG CGGCGGGCTG CGACATCGTC CTGCATTGTA ACGGTAAGCT CGACGAAATG CGGCAGGTGG CCTCCGAGGC CCCGGAATTG ACCGGGCTGG CGCTGCAACG CGCTACGGCG GCGCTGGCGT CGCGCAAGCC GCCGCAACCG CTCGATCGCC GCGCTGCGCG CGCCGAACTC GAAACCTTGA TTCTCCGCGC AGGAACGAGC GCATGA
|
Protein sequence | MSSRAFITGI SGLGLTDDER GFIRESRPWG FILFKRNVDN PAQVSGLVQQ LRTEVGQPDA PVLIDQEGGR VQRLGPPHWP VYPRGALFSA LYDIDSQLGL AAARLSARLI ASDLFDLGIT VDCLPLADVP VSGADAVIGD RAYGFAAAKV AAIARAVTEG LEQGSVLPVL KHIPGHGRAT ADSHFRLPTV DTSRDELERT DFSAFQPLAD LPMAMTAHVV FSALDPAQPA TTSATIIDQV IRGRIGFQGL LMSDDVSMNA LEGTIAERAR ASIAAGCDIV LHCNGKLDEM RQVASEAPEL TGLALQRATA ALASRKPPQP LDRRAARAEL ETLILRAGTS A
|
| |