Gene RPD_2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2989 
Symbol 
ID4023492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3328784 
End bp3329905 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content70% 
IMG OID637963188 
Producthypothetical protein 
Protein accessionYP_570116 
Protein GI91977457 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.662915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGCC GGTTGCTCAG TTCGGCGGCT GTATTGCTCC TGATCGGCTC TGTCGTTTCG 
GATCCAGTCC ACGCGCAGAA CCTTGAAGCC GGCAAGAGCC CGTCCCAGAT CTTTGCCGGC
ACCTGCGCGG CTTGCCACAA GGGCGCCCGT GGTCTGGTGC GGTCGGTGCC TCCGAGCTCG
CTGGCGTCGT TTCTGCGCCA GCACTACACG ACCAGCAGCG ACATGGCCTC ATTGCTCGCC
TCGTATCTGA TCTCGAACGG CGCTACCGAC ACCCGCTACA AGCAGAACGA CGCGAAATCC
GAACCCGGTC AGCCGGAGGG CCGGCAGGGC CGCAGGCAAC GCCCGGTGGC GGGTGAGGCG
GCCCGGCCCG AAGCTGCCGC GCCCGAGGCG GGCGCTCCTG TGCAGGCCGA AGAGGGCGTC
CGCCGAAGCC GCAACAGCAA ACGCCAGCCC AAGCCCGAGG CGGACAAGCC GGCCGAAGGC
GCTGCCGCGG CAGAGACCGC CGCGCCGGCG CCTGCGGAGC AACCGCCGGC GAAGGAACGT
CGCAAGCACG GTCGGAAGGA CAAGCCCGGA CAGGCCGCGC CTGCTGGCGC CGATGCGGCC
AAGGGCGAAC CTCAGAAGCC GTCGCCGGCC GCGACCGAGC CGGCCGTCGC GAAGCCAGAT
ACCACGAAGC CTGACAGCGC CAAACCGGAT GGCGCCAAGC CGGACGAGTC GAAGGCAGGC
GCTGCGAAAC CAGATGCGCC GAAGACCGAC GCCGCCCCGC CGGACGCTTC CAAGATTGAT
ACGCCCAAGA CTGACGCTCC CAAGACTGAC GCCTCCAAAC CGGATGCTTC GAAGTCCGAA
ACGCCGAAGA CCGAGGCTGC GCAGCCCGAC CGCGCAGCGA CGCCCGCTGA GGTGCCGCTG
CGCCCCGATC CGGTTCCGGC GGTGACGCCG GCCCCGAAGG CCGTCGATAG CTCGAAGACC
CCTGAGGCCG CTCCGCTTGC TGCGAAGCCG GCGGAGCCGC CCGCGTCCGC GACGACCTCT
CCGCCCGCCG CAACGACGCC CGGCGAGCCG TCGATCGCCG TGACGCCGAT TCCGCCGCCG
CCTGCGCAGG GCAGCGCGTC AGATGTTCCG ATCTCCCGTT AA
 
Protein sequence
MVSRLLSSAA VLLLIGSVVS DPVHAQNLEA GKSPSQIFAG TCAACHKGAR GLVRSVPPSS 
LASFLRQHYT TSSDMASLLA SYLISNGATD TRYKQNDAKS EPGQPEGRQG RRQRPVAGEA
ARPEAAAPEA GAPVQAEEGV RRSRNSKRQP KPEADKPAEG AAAAETAAPA PAEQPPAKER
RKHGRKDKPG QAAPAGADAA KGEPQKPSPA ATEPAVAKPD TTKPDSAKPD GAKPDESKAG
AAKPDAPKTD AAPPDASKID TPKTDAPKTD ASKPDASKSE TPKTEAAQPD RAATPAEVPL
RPDPVPAVTP APKAVDSSKT PEAAPLAAKP AEPPASATTS PPAATTPGEP SIAVTPIPPP
PAQGSASDVP ISR