Gene RPC_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1833 
Symbol 
ID3971713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1987592 
End bp1988995 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content68% 
IMG OID637924946 
Producthypothetical protein 
Protein accessionYP_531711 
Protein GI90423341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000567311 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.596658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA CATCGACCGA GCGTCTCAGA GACTATCTCG CGCAGCTGCC GCCGCAGGCC 
CAGGCGCTGC TGATCCGGGA ATTTGAGCGC TCCATCGAAC GCGGCGAAGA CGCCACCGTC
GCCAATTTCG TGCTCGGCCA GTTGCGCCAG ATCGTCCGCG GCACCGACGA AGAGGCGCCG
GCACGCACCG ACGACCCGGC ACGGCTGCTG TTCCGACCGC TCGAGCCGTT CCTGGTCGAA
CATTCCGGCG CGGTTCGGCC CGGTCAGATC CGCCGCGCCT CGCTGCTGCC GGTTTGGCAA
TGGCTCGGGC GCGACGGCGC ACCGCAGGCG GTGGCCGAAT TCGAGGCGAC GCTGGCGCGG
CTGCGTGGCG GCGGGACATC CTCGGAGATC GCTCACGCCG CGCGGAAGCT ACAGATGGCC
GCCGCCGACG CGATCGCCCA GGCGACCTCG GTGGCCCCCG GCGGCGACAA TCAGCGCATG
CTGGCTCGGA TCGGTTCGGC TGCGGTGGTG GAAGACCTGC GCGCGGTCGG CGCGGTGTTG
AAGAATCGAG ACGCGCTCGA CGCATTGTCC GACAAGTTGC CCGGCGGCCT CGGGGCATTC
GGCGATGCCC AGGTCAATTC GGTGATCGCC ACGCTCAACG TTCCCTCGCT GCAGACGCCG
CTGGTGTTGC CGTTCGCGCT GTCGTTGGTG ATGCAGCGGC TGTCGGCGCC CTGGCAGATC
ATCCGGATCG CGGTGCGGAT CGCCAGTTCC GACGACGAGG TCAGGGTCGC GGCAATTCCT
CTTGGCGTCG CCGTCACCAT GGCGCTGCAC GATCTGGCGC ATCTGGTCGC CGACCTGCGC
GCGCAGATCA AGCGCGGCCA CTTCGAGAAC TTCGCCGAAC GCCTGAAGCT GGTGCACGAT
GGCTTGCGCG GCGTGCGCAC CGAACTCGAT CTGCGCAACG ACTCGGTGTG GGGTCGGCAA
ATGGCCGCGA TCAGGGTCGA CATCTCCAAT TCGCTGCAAT CGGAGATCGA GAGCGTGCCG
GGCCGGGTTC GCCGGCTGCT GCGGCAGCGC CCCGACAAGG ACATCGCGGC GACCACCAAG
ATCGATCCCA GCGAAATCGA CGAGGTGCTG GCGCTGATCG CCTTCGTGGC GGTCTGCCGC
ACCTATGCCA GCGAACTGGC GATCAACGAG GTGACGCTGC GCACCTATTC CGACCTGCAG
CAATATGTCG AAAAATCCAC CGAGGCGCTG GTGCAGGCGT TGCGCGGCGC CGATGCGAAA
GTGCGCGGGT TCCGGCAGAT GCAGGTCAAG GCGGCGATCC GGTTCTGCGA GGCGCTGTTC
GGCCAGGACT ACGCCTCGCT GATGAGCCGG GCGGCGGACA ACGCCATGGT GGTGGTGGAG
CGCAAGCCGA GCCGGGCGGG CTGA
 
Protein sequence
MSQTSTERLR DYLAQLPPQA QALLIREFER SIERGEDATV ANFVLGQLRQ IVRGTDEEAP 
ARTDDPARLL FRPLEPFLVE HSGAVRPGQI RRASLLPVWQ WLGRDGAPQA VAEFEATLAR
LRGGGTSSEI AHAARKLQMA AADAIAQATS VAPGGDNQRM LARIGSAAVV EDLRAVGAVL
KNRDALDALS DKLPGGLGAF GDAQVNSVIA TLNVPSLQTP LVLPFALSLV MQRLSAPWQI
IRIAVRIASS DDEVRVAAIP LGVAVTMALH DLAHLVADLR AQIKRGHFEN FAERLKLVHD
GLRGVRTELD LRNDSVWGRQ MAAIRVDISN SLQSEIESVP GRVRRLLRQR PDKDIAATTK
IDPSEIDEVL ALIAFVAVCR TYASELAINE VTLRTYSDLQ QYVEKSTEAL VQALRGADAK
VRGFRQMQVK AAIRFCEALF GQDYASLMSR AADNAMVVVE RKPSRAG