Gene RPD_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2793 
Symbol 
ID4023291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3109846 
End bp3110871 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID637962991 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_569922 
Protein GI91977263 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0138787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.035915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGC GCGCTTTCAT CACCGGCATC TCCGGCCTGG GCCTTACCGA CGACGAGCGC 
GAGTTTTTGC GCGAAACCCG TCCGTGGGGC TTCATCCTGT TCAAACGCAA CGTCGACAAT
CCCGCGCAAG TCGCTGGATT GGTTCGTGAA CTGCGCAAGG AAGTCGGCCA GCCCGACGCC
CCGGTCCTGA TCGATCAGGA AGGCGGGCGG GTGCAGCGTC TCGGCCCGCC GCATTGGCCC
GTCTATCCGC GCGGCGCGGT GTTCTCGGCG CTATACGATA TCGATTCGCA GCTCGGGCTC
ACCGCGGCGC GGCTCAGCGC GCGTCTGATC GCAGCCGATC TGGCCGATCT CGGCATCACC
GTCGACTGCC TGCCGCTCGC CGACGTGCCG ATCGCCGGCG CCGACGCGGT GATCGGCGAT
CGCGCTTACG GTACCGAACC CGCCAAGGTG GCGGCGATCG CCCGCGCGGT GACCGAGGGC
CTGGAGCAGG GCTCGGTGTT GCCGGTTCTC AAACACATTC CCGGGCACGG GCGGGCGACG
GCGGACAGCC ATTTCCGGCT TCCGACCGTG GACACGGCGC GGCAGGAGCT TGAGCGTTCC
GACTTCGCGG CGTTCCAGCC GCTCGCCGAT CTGCCGATGG CGATGACTGC ACATGTTGTG
TTCAGCGATC TCGATCCCGC CCAACCCGCG ACGACATCTG CGACAATCAT CGATCAGGTG
ATTCGCGGAC GAATCGGGTT CCAGGGACTG CTGATGAGCG ACGACGTCTC GATGAATGCG
CTGGAAGGGA CGATCGCCGA TCGCACCCGG GCCATCGTCG CGGCGGGATG CGACGTCGTC
CTGCATTGCA ACGGCAAGCT CGACGAGATG CGGCAGGTCG CCGCCGAGAC GCCGGAACTG
GCAGGCAAGG CGCTGCAACG CGCCGAGGCG GCGCTGGCGT CGCGCAAGCC GCCGCAGATT
TTCGACCGCG CCGCCGCGCG CGCCGAACTC GAAGCCTTGA TCGGGCGCGC AGGGATAAGC
GCATGA
 
Protein sequence
MTSRAFITGI SGLGLTDDER EFLRETRPWG FILFKRNVDN PAQVAGLVRE LRKEVGQPDA 
PVLIDQEGGR VQRLGPPHWP VYPRGAVFSA LYDIDSQLGL TAARLSARLI AADLADLGIT
VDCLPLADVP IAGADAVIGD RAYGTEPAKV AAIARAVTEG LEQGSVLPVL KHIPGHGRAT
ADSHFRLPTV DTARQELERS DFAAFQPLAD LPMAMTAHVV FSDLDPAQPA TTSATIIDQV
IRGRIGFQGL LMSDDVSMNA LEGTIADRTR AIVAAGCDVV LHCNGKLDEM RQVAAETPEL
AGKALQRAEA ALASRKPPQI FDRAAARAEL EALIGRAGIS A