Gene RPC_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1004 
Symbol 
ID3969691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1103071 
End bp1104774 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content57% 
IMG OID637924121 
Producthypothetical protein 
Protein accessionYP_530893 
Protein GI90422523 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0319616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTGG TTTTTAGAGG CCGTGCAACT AGACGCACGA AGCCGTCCCC CCTAGACGAG 
GGCGACGTAC TTGAGCTGCT TGCCAACGAT TGGGACGATT TCGGTTTTGG CACTCTCTTT
GGCGTCACTT GCCGCGTAGG CGAGGTCTCA CTTGATCTAG ATTATCTGAG GTTGCTAGTT
GAGGACGAAA ATCCGAGCCG CGTCAAACTA CAACAATTGT TGAGCGACGG GTGGGATGGC
AACTTCCCGA TTCCTGACAC AAACTACATT TCGGTTCCGT CCGACATCTC GTTCTATCAG
CAGCTCGAGG GCGTGATCGG CATCGAACGG ACCATCGACG TCGCGCTCGC GCTTCGCGAT
GCCAGCTATC TCGTGAACGT CGTCAGGGAC GAGATTGCCG TCGCCCTGAC CCAAACCACC
GGCTTTGGAC GCTCTCTCCA ACGCGAACGT GGAAGTATCA AAGCGTATTT CGACGGCTGG
CGTGTCCTCG ATAATCAGCA CATCGCGGTT CTTGATCTCG GATTTGCATT TGAGAACGTG
TTCGGAGAGC GATCAGCGCT CGATCTGAAA TACCAATCAA GCAGCCCGTT TCCTCACGAC
ATAAACGTCC TGATTGGCCC CAATGGGCAC GGAAAATCCC AAGTTCTTCA TCAGATGGTT
CGGGACTGGA TATCGCCTAG CGACGACACC GAGTTCGGGT TCGTCCGTAA GCCAAACCTC
AGCCGGATGG TGGTCGTCTC GTACAGTCCT TTCGAGCATT TCCCAGTCGA CCTCGCGGGG
AAGCGCTTGA AGGACGTTGA TGCCTACAAG TATTTCGGCT TTCGCGGCCG CCGTGCTCCG
TCGGAAGCCA ACAGGCGCGG CCGGATCGCG CTTTCGCACG AATTTCCGAA GAGGGATGCT
GCGCGCTCGC TCCTCGATTG CCTCAGCGAT GATCAGAAAT ATCAGTCCAT TAGGGATTGG
GCGAACAAGG TGCAGACAGT CGAGGGTGTC TTGCGTAGCG CCATCGACTT CGATTTCGCC
ACCGTGGAGA TTCCTTTGGC GCGCCGCGCC CGATCGCTAT TCGCGGCGGA CACGACTGTA
GAGCCACTGA GCATTATGAC CGGCGATGAC GAAGAGCGCC GTCAGTTCCT GCCAATCGCG
TCTGATCGAA TAGGCGTTCT AGACGCCGCG AAAGTGTTGG ATTCGCTTGT CGAAGAATCC
GGTGTTACTT TCTTCAAAGA TGAGGAGCCG ATCGAGCTGA GCTCCGGCCA ACGGCTCTTC
GCGTACATCG TGATCAACAT TCTCGGCGCT ATCAAGAGGA ACAGCCTTAT CCTGGTCGAC
GAGCCGGAGT TGTTCCTGCA CCCGACGCTG GAGGTTCAGT TCGTGGGCAT GCTGAAGCGA
ATCCTGAAAA GCTTCAACTC GAAGGCCCTG TTGGCGACCC ATTCCGTAGT GACGGTACGC
GAGGTGCCAG CGGACTGCGT CCACGTGTTC GACAAGGGCG AGGATTTTCT ACTGGTCCGT
CATCCGCCCT TTCAGACATT CGGCGGTGAC ATACAGCGCA TATCGTCCTA CGTGTTTGGC
GACAGTCATG TCTCCAAACC GTTCGAGGAC TGGATTACGG AGCAACTTGC GGAGTTCGGC
TCGGCGGACG AGTTGCTCGC GGCCCTTGGT GACGGGGTCA ACGAGGAGCT CATCGTGCGG
ATCAGGGCCA CGGGGCGCGA ATAG
 
Protein sequence
MRVVFRGRAT RRTKPSPLDE GDVLELLAND WDDFGFGTLF GVTCRVGEVS LDLDYLRLLV 
EDENPSRVKL QQLLSDGWDG NFPIPDTNYI SVPSDISFYQ QLEGVIGIER TIDVALALRD
ASYLVNVVRD EIAVALTQTT GFGRSLQRER GSIKAYFDGW RVLDNQHIAV LDLGFAFENV
FGERSALDLK YQSSSPFPHD INVLIGPNGH GKSQVLHQMV RDWISPSDDT EFGFVRKPNL
SRMVVVSYSP FEHFPVDLAG KRLKDVDAYK YFGFRGRRAP SEANRRGRIA LSHEFPKRDA
ARSLLDCLSD DQKYQSIRDW ANKVQTVEGV LRSAIDFDFA TVEIPLARRA RSLFAADTTV
EPLSIMTGDD EERRQFLPIA SDRIGVLDAA KVLDSLVEES GVTFFKDEEP IELSSGQRLF
AYIVINILGA IKRNSLILVD EPELFLHPTL EVQFVGMLKR ILKSFNSKAL LATHSVVTVR
EVPADCVHVF DKGEDFLLVR HPPFQTFGGD IQRISSYVFG DSHVSKPFED WITEQLAEFG
SADELLAALG DGVNEELIVR IRATGRE