Gene RPD_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3871 
Symbol 
ID4024387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4310167 
End bp4311390 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content67% 
IMG OID637964075 
Productlate embryogenesis abundant protein 
Protein accessionYP_570993 
Protein GI91978334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.396289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCACG ATGCTCGAAC CCTGGATCAG ATCCGGCGCG ATACGGAACG CGCCCGGGCG 
GGATTGACCG AAACCGTCGG CGAGCTGCGG GCGACCGTCG CCGACACCGC GAGCGACCTG
CGAGAGCGCT ATTCGCCGCA GGCGATCAAG GACGATGTCA GCCATTACAT CAAGACGCGT
GGCGAAGAGA TCGCCGACAA GGTCAGCGAC ACCATCCGCA ACAATCCGGT GCAGGCCGTG
GCGGTCGGCG CGACGCTGGC CTATCCGCTG TGGAAGATTG TGCGCGCGAT TCCGACCCCT
GTGTTGATGG TCGGCGCCGG CCTTTATCTC GCCGGCTCGA AGTCCGGCCA GCAATTGACA
CAGCGTGCGT CTGACGCGGC GGTCGATCTC GCCGGGGACG TCGAGCGTCG GGCCCGCGCG
TTCGGCTCCG ACGCCGTCGA CACCGCAGAA GCCGCCAAGG AATATGCGAC CGGCGCGGTG
CAGGCTGTGG GCGAGGCCGC CACCAGCCGC GCCAATGAAT TCCGTCGCGC CGCGATTTCC
ACTGCCGCCG ATTTGAAGAA CAAAGGCGAG CAGTTCGGTC GCAATGTTTC GGCGCAGGTC
GACGACCTTG GTCGCACCGC GGCCGCCGCG GGTGGGGCTT TCGCCGGGGA AGTCGACGAT
GTCGCGGGCC GTGGCGCCGG CATTGCCGGG GCTGTGACAG ATACGCTCCG TGACACTGCG
GCCTCGGTGC GCGACGCCGC AGCGTCGGTC CGTGACAACG CCGCGGATGC AGCGATGCGG
CTGCGTGACA AGGTCGGCGA AACGGCAGAT TCCGGACTCG ATGCGGCTGT GCGGGTTCGC
GAGCGCGCGA CCGATCTAGG CAATCGCGCC GGCAAGAGCT TCACGGAAAC CGTGAGCAAT
CACCCGCTGC TGGTCGCCGG CATCGGCCTC GTGGTCGGCG GTCTGATCGC GAGTGCGATC
CCGCGGCTGC GCGCCGAGAG GCAGGTGTTT GGAAATGCCG GTCGGAGGAT GCGGGACCAG
GCCGAGGACA CGATGGCGCG CGGCGTCGAA ACGGTGAAGC AGAAGGGGCG CGACGTCTAT
GAAAGCGCGG TCAACGCCGC CGAAGACGAG GGGCTGACGC GCGAGAAGAT GGGCGATCAG
GTCCGCGATC TCGGCGACCG AGCCCGCAAG GTTGCCGAAG CCGCGGTCTC GACGTTCGAG
TCGCCGTCGC AGAACAAGCA TTGA
 
Protein sequence
MAHDARTLDQ IRRDTERARA GLTETVGELR ATVADTASDL RERYSPQAIK DDVSHYIKTR 
GEEIADKVSD TIRNNPVQAV AVGATLAYPL WKIVRAIPTP VLMVGAGLYL AGSKSGQQLT
QRASDAAVDL AGDVERRARA FGSDAVDTAE AAKEYATGAV QAVGEAATSR ANEFRRAAIS
TAADLKNKGE QFGRNVSAQV DDLGRTAAAA GGAFAGEVDD VAGRGAGIAG AVTDTLRDTA
ASVRDAAASV RDNAADAAMR LRDKVGETAD SGLDAAVRVR ERATDLGNRA GKSFTETVSN
HPLLVAGIGL VVGGLIASAI PRLRAERQVF GNAGRRMRDQ AEDTMARGVE TVKQKGRDVY
ESAVNAAEDE GLTREKMGDQ VRDLGDRARK VAEAAVSTFE SPSQNKH