Gene RPC_4085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4085 
Symbol 
ID3973174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4537117 
End bp4538778 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content72% 
IMG OID637927189 
Producthypothetical protein 
Protein accessionYP_533930 
Protein GI90425560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.414268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0981997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATT ACTACCCGCT GATCGCCCGC GCCATTGCCG GACTGGACCC CAGTGCCCCC 
GGCGAGAGCC GCCGTGCGCT CTACGAACGT GCCCGTTCGG CGCTGATCGC GCAGCTGCGC
GGGGTGCAGC CGCCGCTCAG CGAATCCGAG ATCACCCGCG AGCGGCTGGC CTTGGAAGAG
GCGGTGCGTA AGGTCGAATC CGAGGCGGCG CAACGCGCCC GCGATGTCAC CCGGTCGGCC
GATCAGAAAG CCCGCAACCG CTCCGCCGAG GCGTCGCGCG CCGGCGACAG CCTGCGCGCC
GGTAGCCGCC CCGCCGCCAA ACCCGGCGAC GGCAGGCCCG GCGAAATCAG GCCCGGCGAA
ATCAGACCCG GTGAAATCAG GCCCGGCGAG CCCGAGCCCG AGGCCGGCGA CGCCGCGCCG
CAGGACGGCG CGCCGCGGCC CTCGGCGCCG CCGTTGTCCA CCCGCGGCGA GCGGCCGCCG
CTCCGCGAGG AGCCGCCGCT GCCGCGCAAT CTGCGCGTCG AGCTGCCGCG TCCCGGGCAA
CCGTCGCCGT TTGCGGATCT CGATGATCCG CAGGCGTCCC GCGATCGGCC GCCGCAACCG
CCGCGCCGCC AGGAACCGGA GCGTCCGCCG AGCGCCAGCA CCGGGATGCG CGGCTTCCGC
GACGTCACCG CCGACGTCGA CGATCTCGGC CGCGCGGCCG CGCAGGCCAA TCGGCAGGCC
CGCAAGACCT ACGCCAACGT CACCTCGCCG TCGCCGGAGT TCGACCGGCT CGAGCCGAGC
ATGGAGAACC GCGGCGATCC GGAGCCGCCT TATTCCTACG ACGAATCCCC GGACGAGGCG
GCGCGCTATC AATCCTCGGT GCGCTCGCGC CCGGAGCCGA AACAGAAGCC GAACCGGCTC
GCCGGCGGCG GCTTTCCGAT CAAGAGCGCG CTGGCGATCG GCGTGGTGCT GGTTCTGGTC
GGCGCCGCGA TGCTGTGGGG CCCCTCGGCG TGGTCGTCGC TGCGCGCGAT GTTGAGCGCG
GAGCCGGCGG TGGTGGAAGC CCCGAAGGAG AGCGCGCCCA CCAGCCGGCC GAAGATCGCC
GACCGCGTCG GCCAGCCGTC GTCGATGGAC CAGGTGGCGC CGGTGGCGCA GCGCGTGGTG
CTGTATGACG AGGATCCGGC CGATCCGAAG GGCAAGCAAT ATGTCGGCAC GGTGGTGTGG
CGCACCGAGC AGATCAAGGG CAGCGGCGGC AAGCCGGGCG ACATCGCGGT GCGTGCCGAC
GTCGAGATCG CCGAGCGCAA GTTCAAGATG ACGATGTCGT TCCGCCGCAA CACCGACGCC
TCGCTGCCGG CCAGCCACAC CGCGGAACTG ACGTTCGTGC TGCCGGCGGA TTTTTCCGGC
GGCGGCGTCT CCAACGTGCC GGGCATCCTG ATGAAGTCCA ACGAGCAGGC CCGCGGCACG
CCGCTGGCCG GGCTCGCCGT CAAGGTCACC GACGGTTTCT TCCTGGTGGG CTTGAGCAAT
GTCGACGCCG ACCGCGCCCG CAACCTGCAG CTCCTGAAGG AGCGCTCCTG GTTCGACATT
CCGCTGGTCT ATTCCAACCA GCGCCGCGCC ATCATCGCGA TCGAGAAGGG CTCGCCCGGC
GAGCGCGCCT TCTCCGATGC GTTCACGTCC TGGGGCGAGT AG
 
Protein sequence
MADYYPLIAR AIAGLDPSAP GESRRALYER ARSALIAQLR GVQPPLSESE ITRERLALEE 
AVRKVESEAA QRARDVTRSA DQKARNRSAE ASRAGDSLRA GSRPAAKPGD GRPGEIRPGE
IRPGEIRPGE PEPEAGDAAP QDGAPRPSAP PLSTRGERPP LREEPPLPRN LRVELPRPGQ
PSPFADLDDP QASRDRPPQP PRRQEPERPP SASTGMRGFR DVTADVDDLG RAAAQANRQA
RKTYANVTSP SPEFDRLEPS MENRGDPEPP YSYDESPDEA ARYQSSVRSR PEPKQKPNRL
AGGGFPIKSA LAIGVVLVLV GAAMLWGPSA WSSLRAMLSA EPAVVEAPKE SAPTSRPKIA
DRVGQPSSMD QVAPVAQRVV LYDEDPADPK GKQYVGTVVW RTEQIKGSGG KPGDIAVRAD
VEIAERKFKM TMSFRRNTDA SLPASHTAEL TFVLPADFSG GGVSNVPGIL MKSNEQARGT
PLAGLAVKVT DGFFLVGLSN VDADRARNLQ LLKERSWFDI PLVYSNQRRA IIAIEKGSPG
ERAFSDAFTS WGE