Gene RPC_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3676 
Symbol 
ID3969613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4090514 
End bp4092316 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content67% 
IMG OID637926786 
Productglycoside hydrolase 15-related 
Protein accessionYP_533530 
Protein GI90425160 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTGCAC GAATCGAAGA TTATGCACTG ATCGGCGATT GCGAAACCGC AGCCTTGGTC 
GGCCGCGATG GCTCGATCGA CTGGCTGTGC TGGCCGTCGT TCGATTCGGA GGCCTGCTTT
GCCGCGTTGC TCGGCAGCAA TCAGCACGGC CGCTGGCAGA TCGCACCGGC CGACGCGATC
ACCGCCAGCT CGCGGCGCTA CCGTGGCGAC ACGCTGATCC TGGAAACCCG GCTGCAGACC
GCGACCGGCG TTGTCACCCT GATCGACTTC ATGCCGCCGC GCGGCAGCGC GTCCGACGTC
GTGCGCTTGG TACGCGGCGA GCAGGGCCGC GTGAAGCTGC GCATGGAGCT GGTGATCCGC
TTCGGCTTCG GCGTCAACAT TCCATGGGTG AAACGCACCG ACGGCGGCGC GCTGCTGGCG
ATCTGCGGTC CCGACATGGC GGTGCTGCGC AGCTCGGTCG AGACCCACGG CGAGGCCATG
ACCACGGTGT CGGAGTTCGA GGTCGCGGCC GGCGAGACCG CATCGTTCGT GCTGGCCTAT
GCGGCGTCGC ATTTGGCGGT GCCGGAACCG ATCGACGCCG AGCAGGCGCT CGCCGACACC
GAACAATTCT GGGCGGAGTG GTCGGGCCGC TGCACCTATC ACGGCGGCGA CCGCGACCTG
GTGATGCGCT CCTTGATCAC GCTGAAGGCG CTGACCTTCG CGCCGAGCGG CGGCATCGTC
GCGGCCCCCA CCACTTCGCT GCCGGAGAAA CTCGGCGGCG CCCGCAATTG GGACTATCGC
TATTGCTGGC TGCGCGACGC CACCTTCACG CTGCTGGCGC TGATCAATTC CGGCTACACC
GAGGAGGCCT CGGCCTGGCA CAATTGGCTG TTGCGCGCGG TGGCCGGCGC GCCGGCCGAC
ATGCAGATCA TGTACGGCAT CATGGGGCAG CGCCGGCTGC TGGAATGGCA GGCCGACTGG
CTGCCGGGCT ATGAGGGCGC TGCGCCGGTG CGGATCGGCA ACGCCGCGCA CGCGCAATTG
CAGCTCGACG TCTATGGCGA GCTGATCGAC GCGTTCCATC AGTGGCGGGT GGCGGACATC
ATGCTCGATG GCGAGTCGTG GTCGCTGGAA TGCGCTGTGC TGGAGCATCT TGCGAAGATC
TGGAACGAGC CCGATAGCGG CATCTGGGAG CTCCGCGGCC CCGGCCGGCA CTACGTCTCC
TCAAAGGTGA TGACCTGGGT GGCGTTCGAC CGCGGCATCA AGAGCGCGGA AATGTTCGGC
CTCGACGGAC CGCTCGCGCA ATGGCGGGCG CTACGCGACG AGATCCATCG CGACATCTGC
GCCAACGGCT TCGACAAAGA GCAGAACTGC TTCGTGGTGT CCTACGGCGC CAAGATGCTG
GACGCGTCGA TCCTGCTGTT GCCCTCGGTC GGTTTCCTGC CGGCGTCGGA TCCGCGGGTG
CAGGGCACGC TCAAAGCGAT CGAGCGGCAT CTGATGCGCG ACGGCTTCGT GCTGCGCCAC
GATCCGCGCG AGGTGACGGA TGAAAAGCAG CCGATCGAGG GTGCGTTCCT GGCCTGCAGC
CTGTGGCTGG CCGACGCCTA TCTGCTGGCC GGCGAAATCG GCAAGGCGAA GGCGCTGTTC
GACCGCGTCG CCGCGGTGGC CAACGATGTC GGCTTGCTCG CCGAAGAATA TGATTCCGAA
GCCAAGCGGC AGACCGGCAA TTTCCCGCAG GCGCTGACCC ACATCGCGCT GATCAACACC
GCGCAGAATC TGTCGGCCGT GCAGCAGCCC GCCGACAAGC CGGTGACGCA GCGCGCGAAG
TAA
 
Protein sequence
MPARIEDYAL IGDCETAALV GRDGSIDWLC WPSFDSEACF AALLGSNQHG RWQIAPADAI 
TASSRRYRGD TLILETRLQT ATGVVTLIDF MPPRGSASDV VRLVRGEQGR VKLRMELVIR
FGFGVNIPWV KRTDGGALLA ICGPDMAVLR SSVETHGEAM TTVSEFEVAA GETASFVLAY
AASHLAVPEP IDAEQALADT EQFWAEWSGR CTYHGGDRDL VMRSLITLKA LTFAPSGGIV
AAPTTSLPEK LGGARNWDYR YCWLRDATFT LLALINSGYT EEASAWHNWL LRAVAGAPAD
MQIMYGIMGQ RRLLEWQADW LPGYEGAAPV RIGNAAHAQL QLDVYGELID AFHQWRVADI
MLDGESWSLE CAVLEHLAKI WNEPDSGIWE LRGPGRHYVS SKVMTWVAFD RGIKSAEMFG
LDGPLAQWRA LRDEIHRDIC ANGFDKEQNC FVVSYGAKML DASILLLPSV GFLPASDPRV
QGTLKAIERH LMRDGFVLRH DPREVTDEKQ PIEGAFLACS LWLADAYLLA GEIGKAKALF
DRVAAVANDV GLLAEEYDSE AKRQTGNFPQ ALTHIALINT AQNLSAVQQP ADKPVTQRAK