Gene RPC_4522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4522 
Symbol 
ID3972300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5046242 
End bp5047531 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content62% 
IMG OID637927632 
Productphage integrase 
Protein accessionYP_534363 
Protein GI90425993 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.292224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGGG GATTCGGAAA ACTGACCGTC AAAACGGTCG AGCATTTGTC TCGGCGAGGA 
ATGCACGCTG ACGGCGGCGG GCTTTATCTG CAGATCGCCG AGGGCGGCTC AAAAAGCTGG
CTGTTTCGAT ACAAGCTCCA CGGCCGCACA CATTGGCATG GCTTGGGTTC GCTGCGTGAC
GTGGGCCTTG AAGAGGCGCG AGAGAAGGCG ACCGGAGCGC GCAAGGTCCG CCGCAATGGC
GGCGACCCCA TCCAGGCAAA GCGCCAAGCG GAAGCCGCCG CCCGCATCGA AGCGGCCAAG
GCGATCACGT TCGGAGAGGC GGCCAAGCGG TTCATCAAAG CCAATCGGTC AGGCTGGAAA
AACGCCAAGC ACGCCGACCA ATGGGTGATG ACGTTGCTCG GCATCGACCA GAAGGGCAAG
CCGAGCAAGA ACGACTACTG CAAGACCATC CGTGATCTTC CCATCGGCGC GATTGATACG
ACGCTGGTCC TGCGGATCAT CGAGCCGATC TGGGCGAGCA AGACCGAAAC GGCCACCCGT
ATTCGCAGTC GCATCGAGTT GGTGATCGAC GCCGCCAAGG CGAAGGGCGA GTTCAACGGC
GAAAACCCGG CACGTTGGAA GGGGCATCTC GATAACCTGC TCCCAGCCAC CTCGAAGGTT
CGCAAGGTCC GCAATCATCC GGCGCTGCCT TACAGACAAT TGCCCGACTT CATGCGCAAG
CTCCGCGAGC GTGACGGTGC CGCTGCGGCG GCTCTTGAGT TTCAAATCTT GACGGCCGTC
AGGCCGGGCA ACGCAGTAGC GGCAAAATGG GACCAGATCG ATCGGCGGAC ATCAGTCTGG
ACCATTCCGT CCGCACTCAT GAAGAGCGAT GCCGAACACA GAGTGCCATT GAGCAAGGCA
GCGCTCGCCG TGCTTGATCG CATGGAAGCC ACGAAGGATG ACAGCGAGTA CATCTTCCCC
AACACCAAGG GCAAGCCGCT GAGTGACGCT TCGATGGCCG CGGTGATCGA CAGGATGAAT
GAGCTGAAGC GGCACTGGAT CGATCCCAAG CTTGACCGCG GAATCGTTCC TCACGGATTC
AGGTCGTCGT TCCGCGATTG GGCTGCAGAG CACGGCTATG ACGATCCCGT CGCCGAAGCT
GCGCTCGCGC ACAAGGTCAG CGACGAAGTG GTCGCGGCCT ATCGGCGCAC GACGTTCCTC
GACCTGCGCA ACCGGATGAT GGAGGACTGG GCCGACTATT GCGCCCGACC TTTCAGCAGC
GGTGACATCG TCGTGGCGTT CCGGGCGTAG
 
Protein sequence
MARGFGKLTV KTVEHLSRRG MHADGGGLYL QIAEGGSKSW LFRYKLHGRT HWHGLGSLRD 
VGLEEAREKA TGARKVRRNG GDPIQAKRQA EAAARIEAAK AITFGEAAKR FIKANRSGWK
NAKHADQWVM TLLGIDQKGK PSKNDYCKTI RDLPIGAIDT TLVLRIIEPI WASKTETATR
IRSRIELVID AAKAKGEFNG ENPARWKGHL DNLLPATSKV RKVRNHPALP YRQLPDFMRK
LRERDGAAAA ALEFQILTAV RPGNAVAAKW DQIDRRTSVW TIPSALMKSD AEHRVPLSKA
ALAVLDRMEA TKDDSEYIFP NTKGKPLSDA SMAAVIDRMN ELKRHWIDPK LDRGIVPHGF
RSSFRDWAAE HGYDDPVAEA ALAHKVSDEV VAAYRRTTFL DLRNRMMEDW ADYCARPFSS
GDIVVAFRA