Gene RPB_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3544 
Symbol 
ID3911346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4054687 
End bp4056042 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content70% 
IMG OID637885446 
Productparallel beta-helix repeat-containing protein 
Protein accessionYP_487150 
Protein GI86750654 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.502732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.506953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTCG ACCGGCGCCG TTTGATCGGA CTGACAGCCG GCGCGCTGGC GCTGTCGGCG 
GCGCCACTCG CCGCGCAGCC GCTCACCTCG CAACGCGGCC GCGACGCCAC GCAGGCCGGC
CTTCGCCCCG ACAGTTCCGA CGACCAGACC GCGGCCCTGC AGCGCGCGAT CGACAGCGCC
GCCCGCGCCC GCGTCCCGCT GGCGCTGCCG CCGGGCCATT ATCGCACCGG CCCACTTCGG
TTGCCGTCGG GCGCGCAGCT CAGCGGCGTG CGCGGCGCGA CGCGACTCGT CTTCACCGGC
GGCGTCTCGC TGTTCGACAG CGCCGGCGCC GAGACGCCGA CGCTCAGCGG CCTCGTCCTC
GACGGTGGCG CCATCCCGCT GCCGGCGCGG CGCGGCCTGG TGCATTGCGT CGGCGCGCGC
GACCTGCGGA TCACCGATTG CGAGATCACC GCCAGCGGCG GCTGCGGCGT CTGGCTGGAA
ACCACCTCGG GCATGATCAG CGACAACACG CTGACCGCGA TCGCGGTCAC CGGCGTGGTG
TCGTTCGACG CCAAGGGGCT GAGCGTGACG CGCAACACCA TCATCGGCGC CAACTCAAAC
GGCGTCGAGA TCCTGCGCAC CTCGATCGGC GACGACGGCA CGCTGGTCAC CGGCAACCGG
ATCGAGAACA TCAAGGCCGG CCCCGGCGGC TCGGGGCAGT ACGGCAACGC CATCAACGCG
TTTCGCGCCG GCAACGTCAT CGTCAGCGGC AACCGGATCA GGAACTGCGA TTACTCCGCG
GTGCGCGGCA ATTCGGCGTC GAACATCCAC ATCACCGACA ACAGCGTCAG CGACGTGCGC
GAGGTCGCGC TGTATTCGGA ATTCGCGTTC GAAGGTGCGG TGATCTCGGG CAACACCGTC
GACGGCGCAG CGCTCGGCGT CTCGGTGTGC AACTTCAACG AGGGCGGCAG GCTCTCGGTC
GTGCAGGGCA ACATCATCCG CAACCTCAAG CCGAAGCGGC CGATCGGCAC CGCGCCGGAC
GACGACGCCG GCATCGGCAT CTATGTCGAG GCCGACACCG CAGTGACCGG CAATGTGATC
GAGAACGCAC CGTCGTTCGG CATCGTCGCC GGATGGGGCA AGTACCTGCG CGATGTCGCC
ATCACCGGCA ATGTCGTGCG CAGGGCGTTC ATCGGCATCG GCGTCTCGGT GATGGACGGC
GCCGGCACGG CCGCGATCAA CGGCAACGTC ATCGCCGAAG CGCCGCGCGG CGCCGTGGTC
GGGCTCGACC ACGCCCGCCC GGTGACGCCG GACCTCACCG CGCCCGGCGC AGCCAAATTC
GCGCAGATCG CCCTCGGCAG CAACTCGGTG CGGTGA
 
Protein sequence
MTLDRRRLIG LTAGALALSA APLAAQPLTS QRGRDATQAG LRPDSSDDQT AALQRAIDSA 
ARARVPLALP PGHYRTGPLR LPSGAQLSGV RGATRLVFTG GVSLFDSAGA ETPTLSGLVL
DGGAIPLPAR RGLVHCVGAR DLRITDCEIT ASGGCGVWLE TTSGMISDNT LTAIAVTGVV
SFDAKGLSVT RNTIIGANSN GVEILRTSIG DDGTLVTGNR IENIKAGPGG SGQYGNAINA
FRAGNVIVSG NRIRNCDYSA VRGNSASNIH ITDNSVSDVR EVALYSEFAF EGAVISGNTV
DGAALGVSVC NFNEGGRLSV VQGNIIRNLK PKRPIGTAPD DDAGIGIYVE ADTAVTGNVI
ENAPSFGIVA GWGKYLRDVA ITGNVVRRAF IGIGVSVMDG AGTAAINGNV IAEAPRGAVV
GLDHARPVTP DLTAPGAAKF AQIALGSNSV R