Gene RSp0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSp0833 
SymbolpehC 
ID1223140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp1052058 
End bp1054100 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content69% 
IMG OID637240693 
Productpolygalacturonase transmembrane protein 
Protein accessionNP_522394 
Protein GI17549054 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00798361 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCCAAAC AGAAACACTC CAGCGCGCGC CATGCCGGCC GACAGGCGCC GCACCGCCCG 
CAGTCGCCCG CGCGGCGCGC ATTCGTCATG TGGTCGGGCG CCTCGGCCGG GGCCGCGCTG
CTCGGCACGC TGCCGGGCTG CGGCGGCGAT GGCGGCAGCA GTGCCACGGC CGCCACCGCC
AGCCCCGCCG ACGCAACCGC GACCGTGACC ACGCGGGACG CGATCTGGGG CGAATCCGGT
GCCGCCACGC GCATCGTCGC CTCGCTGCAA GGCGTGGCCC AATCGATGTT TCCCCGCCGC
GACTTCGTGG TGACGGACTA CGGCGCCCAG CCTTGCGCCA TCGTCGACGC GGTCAACCCT
TACACCGACG CCAGCAAATC GCCGCTCAGC GCGGGCGCCG ACAAGACGCC CGCGACCGGC
GCCTTCGATG CGCGCGCGGC TTTCACCGCC GCCATCGCCG CCTGCAATGC GGCCGGGGGC
GGCCGGGTGT TGGTGCCGGC CGGCAACTGG TACTGCGCCG GCCCGATCGT GCTGCTCAGC
AACGTCAACT TCCACCTGAG CGCCGACTGC ACGATCTACT TCAGCCCCAA CCCGGACGAC
TACGCCAAGG ACGGCCCGGT CGATTGCGGC GCCAACGGCA AGCTCTACTA CAGCCGCTGG
CAGTCAAACG ATTGCCTGAA CTACGGCGCC CCCGTCTATG CGCGCAACCA GCGCAACATC
GCGCTGACCG GGGAGGGCGA CAGCTCCACC CTCAACGGGC AGGCCATGAC GCCGTTCGCG
GGCAGCGGCA ACACCAGCAC CTGCTGGTGG ACCTTCAAGG GCACCAAGGG CGAGTACGGC
GCCGTCAACG CTTCCACGCC GTCGCAGGCG TACAGCAATC CGAACAACGT CGACCTGCGC
ACGGCGGCAC CGGGCATCGC CGATGACCTC TACGCCAGGC TGACCGACCC GACCACGCCG
TGGCAGCAGG ACCAGAACTA CCTGTCCGCG CTGTCCGAGG CCGGGGTGGC GGTGGCGCAG
CGCATCTTCG GCAAGGGCCA CTACCTGCGG CCGTGCATGG TCGAATTCAT CGGCTGCACC
AACGTGCTGA TGGAGGCTTA CCGCACCCAC GCCACGCCGT GCTGGCAGCA CCATCCGACC
GACTGCGCCA ACGTGGTCAT CCGCGGCGTG ACGGTCGACA GCATCGGCCC CAACAACGAC
GGCTTCGACC CCGATGCGTG CAGCAACGTG CTGTGCGAGG ACATGACCTT CAACACCGGC
GATGACTGCA TCGCCATCAA GTCCGGCAAG AACCTCGACA CCGGCTACGG CCCGGCGCAG
GACCACGTGA TCCGGAACTG CATCATGAAC AGCGGCCACG GCGGCATCAC GCTCGGCAGC
GAGATGGGCG GCGGCGTGCA GCGGATCTAC GCGCGCAACC TGACGATGCG CAATGCGTTC
TATGCGACCG ATCCGCTGAA CATCGCCATC CGCATCAAGA CCAACATGAA CCGCGGCGGC
TATGTCCGGG ACTTCTACGT CGATGACGTG ACGCTGCCCA ACGGCGTCAG CCTCACCGGC
GGCGGCTACG GTAGCGGCCT GCTGGCGGGC AGCCCCATCA ACAGCAGCGT GCCGCTGGGC
GTGGCGACGG CCGCCAGCGC CAATCCGTCG GCATCGCGGG GTGGCCTCAT CACCTTCGAC
TGCGACTACC AGCCGTCCAA GGACGCCATC CGCACGCGGC CCGCGCAGGT GCAGAACATC
CACATCTCGA ACGTGCGCGC GTCCAACGCG ACGGTGGGCG GGACGACGGG GTCGTGCTTC
CAGGCCATCG TCGCGCAGGG GCCGGTCGCC TTCGACTACA ACGGCCCGAC GCCCGCGCCG
ACGGTCCCGC CCATCGCGGG CGTGACGATC ACCGACTGCG ATTTCGGCAG CCCGGTCGCC
GCCGGCCCGG CCAGCGCGTC GACGCCGGGC CCGATCTACG CCTACAACGT CAGCGACATC
ACGCTCACCA ACGTCCGGAT CGGCGCGCAG ATCTACAACA CTACCGTGAG CGACACGCGC
TGA
 
Protein sequence
MPKQKHSSAR HAGRQAPHRP QSPARRAFVM WSGASAGAAL LGTLPGCGGD GGSSATAATA 
SPADATATVT TRDAIWGESG AATRIVASLQ GVAQSMFPRR DFVVTDYGAQ PCAIVDAVNP
YTDASKSPLS AGADKTPATG AFDARAAFTA AIAACNAAGG GRVLVPAGNW YCAGPIVLLS
NVNFHLSADC TIYFSPNPDD YAKDGPVDCG ANGKLYYSRW QSNDCLNYGA PVYARNQRNI
ALTGEGDSST LNGQAMTPFA GSGNTSTCWW TFKGTKGEYG AVNASTPSQA YSNPNNVDLR
TAAPGIADDL YARLTDPTTP WQQDQNYLSA LSEAGVAVAQ RIFGKGHYLR PCMVEFIGCT
NVLMEAYRTH ATPCWQHHPT DCANVVIRGV TVDSIGPNND GFDPDACSNV LCEDMTFNTG
DDCIAIKSGK NLDTGYGPAQ DHVIRNCIMN SGHGGITLGS EMGGGVQRIY ARNLTMRNAF
YATDPLNIAI RIKTNMNRGG YVRDFYVDDV TLPNGVSLTG GGYGSGLLAG SPINSSVPLG
VATAASANPS ASRGGLITFD CDYQPSKDAI RTRPAQVQNI HISNVRASNA TVGGTTGSCF
QAIVAQGPVA FDYNGPTPAP TVPPIAGVTI TDCDFGSPVA AGPASASTPG PIYAYNVSDI
TLTNVRIGAQ IYNTTVSDTR