Gene RPC_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3056 
Symbol 
ID3973374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3387512 
End bp3389011 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content65% 
IMG OID637926166 
Productintegrase catalytic subunit 
Protein accessionYP_532919 
Protein GI90424549 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.278904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACAG TGGAGACGGT GGCTCGGATT CGGCGTGAGC ATTTTCTCAA GGGCAAGACG 
ATCAAGGAGA TCGCCCGGGA TCTGAGGGTG TCGCGGAACA CGGTCCGCAA GGTGCTGCGG
TCCGGTGAGA CGTCATTCGA GTATGGGCGC GAGGTTCAAC CGCGACGGAA GCTTGGCCGA
TGGACGGCTG AGCTCGATGC ACTGCTTGCA AGCAACGCTG CGAAGGCCGC CCGCGAGCAA
TTGACGCTGA TCCGGATCTT CGAAGAGCTG CGTGGGCGAG GCTATGACGG CGGCTACGAT
GCTGTGCGTC GCTACGCCCG CCACTGGGCC AAAGAGCGAG GCCAGGCGAC GGCCGCGGCC
TATGTGCCGC TGAGCTTTGC GCCGGGAGAA GCCTACCAGT TCGACTGGAG CCACGAAGTC
GTGCTGCTCG GCGGGGTGAC GGTGATCGTC AAGGTCGCCC ATGTCCGGCT CTGTCACAGC
CGCATGCTGT TGGTGCGCGC TTATCCGCGC GAGACGCAGG AGATGGTGTT TGACGCCCAT
GACCGGGCGT TCGCACTGTT CAAGGGGACC TGCGGGCGCG GCATCTACGA CAACATGAAG
ACGGCGGTTG AGACGATCTT CGTCGGCAAG GCCCGTCTTT ACAATCGCCG CTTCATGCAG
ATGTGCAGCC ACTATCTGGT TGAGCCGGTC GCCTGCACAC CGGCGTCGGG CTGGGAGAAG
GGCCAGGTCG AGAACCAGGT CGGCCTGGTG CGCGAGCGGT TCTTTACGCC GCGGCTGCGG
TTCAAGAGCT ATGACGAGAT GAACGCCTGG CTCACCGACA AATGCATCGC CTACGCCAAG
GCGCATCGCC ATCCGGAGTT GACCGAGCAA ACGATCTGGG AGGTATTTGA AGCCGAGCGG
CCGAAGCTCG TCCCCTATGC CGGCCGGTTC GATGGATTCC ATGCGGTGCC GGCATCGGTC
TCCAAGACCT GCCTGGTGCG CTTCGACAAC AACAAATACT CTGTGGCGGC GAGCGCCGTC
GGGCGTCCGG TCGAGGTTCA TGCCTATGCC GACCGTATCG TCATCCGACA GGACGGCCGC
ATCGTGGCCG AGCATCCTCG CTGCTTCGGC CGCGGCGAGA CCAGTTACGA TCCCTGGCAT
TACGTGCCGG TGCTGGCGCG CAAGCCCGGG GCCTTGCGCA ACGGTGCGCC GTTCAAGGAC
TGGGTGCTGC CGGCCGCGAT GGAGCGAGTG CGGCGCAAGC TCGCCGGCGT CGTCGATGGC
AATCGGCAGA TGGTCGACAT CCTCAATGCG GTGCTAACCG ACGGGCTGCC GGCGGTTGAT
GCCGCCTGCG CCGAAGCTGT CGATCACGGC GTTCATTCCG CCGACGCCAT CCTCAACATC
CTGGCGCGCC AGCGCGATCC CACGCCGCCG GCCAACATCC TTACGCCCGC CGCGCTGACA
TTGCGCCACG CGCCGCTCGC CGATTGTGCC CGTTACGACA ACCTGAGGAG AACCGTCTGA
 
Protein sequence
MLTVETVARI RREHFLKGKT IKEIARDLRV SRNTVRKVLR SGETSFEYGR EVQPRRKLGR 
WTAELDALLA SNAAKAAREQ LTLIRIFEEL RGRGYDGGYD AVRRYARHWA KERGQATAAA
YVPLSFAPGE AYQFDWSHEV VLLGGVTVIV KVAHVRLCHS RMLLVRAYPR ETQEMVFDAH
DRAFALFKGT CGRGIYDNMK TAVETIFVGK ARLYNRRFMQ MCSHYLVEPV ACTPASGWEK
GQVENQVGLV RERFFTPRLR FKSYDEMNAW LTDKCIAYAK AHRHPELTEQ TIWEVFEAER
PKLVPYAGRF DGFHAVPASV SKTCLVRFDN NKYSVAASAV GRPVEVHAYA DRIVIRQDGR
IVAEHPRCFG RGETSYDPWH YVPVLARKPG ALRNGAPFKD WVLPAAMERV RRKLAGVVDG
NRQMVDILNA VLTDGLPAVD AACAEAVDHG VHSADAILNI LARQRDPTPP ANILTPAALT
LRHAPLADCA RYDNLRRTV