Gene RPD_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1994 
Symbol 
ID4022476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2230793 
End bp2232427 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content67% 
IMG OID637962187 
Producthypothetical protein 
Protein accessionYP_569130 
Protein GI91976471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA CCGCCAATCT CGGATTGCCC TTTATCGCGG CGAGCCAGGC GCAGAAGCAC 
GTCACCCACA ACGAGGCGCT GTTCAGCCTC GACGGCCAGG TTCAACTCGC CGTGCTGTCG
GCGGCGCTGG CGACGCCGCC GGCCTCGCCG GACGATGGCG AGCGCTGGAT CGTGCCGGCC
GGCGCGAGCG GCGCGTGGGC CGGGAAAGCC GCGCAGATCG CGGCCTGGTA CGACGGCGGC
TGGCGGTTCT TCGCGCCCCG GCCGGGTTGG CTCGCCTACA ATCTCGCGAC GCAGACGCTG
CTGGCGTGGA CCGGCGCAGC CTGGGTGAAC GCGCTGGCGG CGTTTCAGAA CCTGCCGATG
TTCGGCCTCA ATACCACCGC CGATGCGAGC AATCGGCTTG CGGTGAAATC CGACGGCGTG
CTGTTCGGCA ATGACGACGT CACCCCGGGC AGCGGCGACG TCCGCGTCAC ATTGAACAAG
AGCGACGCGG CGAAAGACGC CGGGCTGACG CTGCAGAATA ATTGGAGCAC GCGGGCGCAG
CTCGGGCTGC TCGGCGACGA CAATTTCCAC ATCAAGGTCA GCGCAAACGG CTCTGCGTTC
ACCGATGCGA TCCAGATCGA CAGGACCACC GGCAATGTCG GCATCCGCAC CGCGCCGAGC
AGCGGCGGCA ATGCGTTGCA GGTCGCCGGC TCGAACGCAT TGTTCAGCAA CAGCGCCGGC
GGGTTCTCCT TCACCTTCAG CAAGGCCGCC ACGGCCCACG ACGCCGCTTT GTATCTGCAG
ACCAACTACA GCACCAAGGC GCTGTTCGGC CTGCTCGGGC TCGACGATTT TTCGCTGAAG
GTGACGCCGG ACGCCGCAAA TTACTACGCA GGACTTCGGG CGTGGTCGGC GCTGCACGGC
CGGCTCGACA TCAAGGATGC GCGGCGCAGG CAGCCGATGC ATTGGTCGCC GCGGCCGGGC
AGCACCATGC TCGACAGCAT CGGGCTTGGC GCGTCGATCA CCGGCGCCGC GACCGCGGTG
TCGCCGTCGT CGGGCAATCT GTTTCTGTCG GCCCCGCGGC TCGATTTCAA CTCCGCCGCC
ACGGCCGGCG CCAGCGCTGG CGTCAACGGA TCGGCGCTGA CGTTGTGGCG CGGCAACGGC
GGCGGTCTCG GCGGTTTCTA TCTGCTGATG CGGTTCGGCA TCGAGACGTT CCAGTCGAAT
TGCCGGCTGT TCGCCGGGCT GGTCGGCTCC GCTGGCGCGA TCGGCAACGT CAATCCGAGC
ACGTTGTCGA ACCTGATCGG CGTCGGCTTC GATTCCGGCG ACGCGACGCT GTCGCTGATC
AGCAATGACG GCAGCGGCGC AGCGACCAAG ACGGGCCTCG GCGCCGGCTT TCCGACCACC
GGCGGACAAG ACCTGTACGA ACTGCTGCTG TCGGCCGAGC CGAACGGCAG TGAGGTTCGC
TACCGCGTCG AGCGGCTGAA TTCCGGCGAC GTCGCCAGCG GCGTTGTGAC GACCAATCTG
CCCGTCAACA CGCAGTTTCT GACGCCGCAT CTGTGGATGA ACAACGGCAC CAGCGCCGGC
GCGGTCAGCG TGGCGCTGGT GCAGATGTAT TGCGAGCCGG CGGCGTTGCT CGGCTCGCGC
GGACTGATCG GTTAG
 
Protein sequence
MTVTANLGLP FIAASQAQKH VTHNEALFSL DGQVQLAVLS AALATPPASP DDGERWIVPA 
GASGAWAGKA AQIAAWYDGG WRFFAPRPGW LAYNLATQTL LAWTGAAWVN ALAAFQNLPM
FGLNTTADAS NRLAVKSDGV LFGNDDVTPG SGDVRVTLNK SDAAKDAGLT LQNNWSTRAQ
LGLLGDDNFH IKVSANGSAF TDAIQIDRTT GNVGIRTAPS SGGNALQVAG SNALFSNSAG
GFSFTFSKAA TAHDAALYLQ TNYSTKALFG LLGLDDFSLK VTPDAANYYA GLRAWSALHG
RLDIKDARRR QPMHWSPRPG STMLDSIGLG ASITGAATAV SPSSGNLFLS APRLDFNSAA
TAGASAGVNG SALTLWRGNG GGLGGFYLLM RFGIETFQSN CRLFAGLVGS AGAIGNVNPS
TLSNLIGVGF DSGDATLSLI SNDGSGAATK TGLGAGFPTT GGQDLYELLL SAEPNGSEVR
YRVERLNSGD VASGVVTTNL PVNTQFLTPH LWMNNGTSAG AVSVALVQMY CEPAALLGSR
GLIG