Gene RPD_3243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3243 
Symbol 
ID4023752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3598860 
End bp3600041 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content72% 
IMG OID637963447 
Producthypothetical protein 
Protein accessionYP_570369 
Protein GI91977710 
COG category[S] Function unknown 
COG ID[COG5330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.399185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.111483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGA TTGTTCGGCA GTTCATCAGT TGGATCCGTC ACGCGCCCGC GGGCGAGCGC 
GCTGAAGCCA CCCGGGCGCT GGCGCGGGCC TGGCTGATCT CCGATCTTTC GCCCGAGGAT
CGCTCCGCCG CCGAAGGCGC GCTGCTGATG ATGCTGGATG ACGTTTCGCC GCTGGTGCGC
CAGGCGATGG CGCAGGTGTT TTCGCGCAGC GCGCGCGCCC CCGCCGCGAT CGTTCGTGCG
CTCGCCGCCG ATCAGCCCTC GGTGGCGCTG CCGGTGCTTG AATTCTCGCC GCTGCTGATC
GACGCCGATC TGGTCGACAT CGTCGCCACC GGCGACGACG CGGTGCAATG CGCGATCGCG
CGGCGGGTGC GGCTGCCCGC GGCGGTGTGC GCTGCGATCG CCGAGGTCGG CTCGCCGGCC
GCGGCGCTGG AACTGATCGA AAATCCCCGC GCCGAACTCG CGCCGTTCTC GTGGGACCGT
ATCGTCGAGC GCCACGGCCA TCTCGCCGCG ATCCGCGAGT CGATGTTGGT GCTGGAGGAT
CTGCCGGCGG CGACGCGGCT GGCGCTGATC GCCAAGCTCT CCGAGACGCT GCAGCAATTC
GTGGTGGCGC GGCGCTGGCT GTCGGCGGAT CGCGCCGCGC GCGCGGTGGG CGAGGCGCTG
GATCGCTCGA CCGTGACGGT CGCGGCGCGC TCGCGCGGCG ACGACATGCG CGCGCTGATG
CAGCATCTGC GCGCCACCTC GCAGCTCACC GCCGGGCTGA TCCTGCGCGC GCTGCTGTCC
GGCAATCGCG AATTGTTCGA ATACGCGCTG GTCGAACTGT CCGGCCTGCC GCAGGCGCGG
GTGGCGTCGC TGCTGCACGA ATGCGGCGGC CCGAGCCTGA ATGCGCTGCT GGCGCGCGCC
GGCCTGCCGC AGTCGACCTT CGCCGCGTTC CGCGCCGCGC TGGAGGCGCA GCACGAGATC
GGCTTCGTCG GCAGCGAGGG CGGCGCGGTG CGGCTGCGCC GGCTGATGGT CGAGCGGGTG
CTGACGCGCT GCGAGACCGG CGCCGACGCG GCGGCGCCGC TGCTGATCCT GCTGCGCCGC
TTCGCCACCG AATCGGCCCG CGAGGAAGCC AGCGCGTTCT GCGGCGAACT GATCGCCGAG
CACGACGAGC GCGCCTGGCA GGACGAGCCG ATCGCCGCGT AG
 
Protein sequence
MPMIVRQFIS WIRHAPAGER AEATRALARA WLISDLSPED RSAAEGALLM MLDDVSPLVR 
QAMAQVFSRS ARAPAAIVRA LAADQPSVAL PVLEFSPLLI DADLVDIVAT GDDAVQCAIA
RRVRLPAAVC AAIAEVGSPA AALELIENPR AELAPFSWDR IVERHGHLAA IRESMLVLED
LPAATRLALI AKLSETLQQF VVARRWLSAD RAARAVGEAL DRSTVTVAAR SRGDDMRALM
QHLRATSQLT AGLILRALLS GNRELFEYAL VELSGLPQAR VASLLHECGG PSLNALLARA
GLPQSTFAAF RAALEAQHEI GFVGSEGGAV RLRRLMVERV LTRCETGADA AAPLLILLRR
FATESAREEA SAFCGELIAE HDERAWQDEP IAA