Gene RPC_4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4229 
Symbol 
ID3972952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4705787 
End bp4707955 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content56% 
IMG OID637927332 
Producthypothetical protein 
Protein accessionYP_534072 
Protein GI90425702 
COG category[S] Function unknown 
COG ID[COG4694] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0264095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCGG ATCCCAATAC CGGATACTTG CCTGCAGCGT GCGAGTTCGA GTTCACTACA 
GACAGCGGCG AGACGGTGCG AGCCGGCACT CTAAACGCTT TGAATAGCCG TCTGGCGGTT
TTCAATGAGG ACTTCATTGA TCTAAACCTG CAGTGGTCGG CCGGCAAGGC GCGCCCTGTG
TTCTATATCG GCAAAGAACA AGCCGAAGCA GCTGCGAATC TGAAAAGGAC TGAAGCAGCT
ATACCGGCGG CGACTGAGCG AAAGGCAGGT GCCGAGAAGC TTGTCAGGGC CGGGGAACAG
CAAGTTGCTA CATTCAAGCG TGAAGTGGCT CGGAAGATCT CTCAGGAAAT CAGACCGGCA
AATCGAAAAT ATGAGGCGCC CCAGCTCACC TCCGATTATG CTGCACTTGA TTTGGATGCG
GGAGCAACAC TCAACGATAC ACAACTGGCG GCGCAGCGGG AGTTGTGTCG GCGAGATGAG
GCAATGCCGG AACTCGCGAA TGTAGACTTC GATACCGCAC CGGCAAACGT CGCATTGCAA
ACGGCGATCC GCCTGCTGCA GGAGACTCCG AGCCTCACAA TAATTCCTGA ACTGGAGACG
CACTCGGAAA TGTTGCGGTG GGTTCAAGAG GGGGTAGACT ATCACGAAAA CCACGGACTC
AATGAGTGTC TATTGTGCGG CGGTATTCTA TCGTCCGAGC GCAAAGATCT GTTGGCGAGC
GCGTTAGACG AGGGTTTTGA GAAATTTCAA GAGGCGCTCG ACCTCGCAAA GGAGAAACTA
GAGGCCGAGC GGGAACGCTA TCAGAAAATA GCGTTGTCGA TTTCATCATC CAGGGACGTC
GTGGCCGATC GGCGCGAAGA ATACGACCAA GCCGCCGGTC GGGTGAAGGA GGAGCTCGCC
GATTCCATTG CGAAGGTACT TGAACCGGCG ATTGCGGCGA TCGACGCGAA GTTGGCGAGG
CCGACTCTCA AGCCTGACGC TAAGGCGATG ACGTGGGGTG ATTCGCGGCT GACCGACGAA
GCCTTTGCGC GGATCGAATC CTGGGGAGGG GTATTGAATC TTAATATCGC AAGACACAAC
GAGGCGTGCG CCAGCTTTTC TCAACGTCAG GAGGACGCGC GCCTTGCCCT GCGTAGGCAC
CATTTGGCCG AGAATCACCA GGAGTATGCG AGCGTTGGCG AGGCGGTGGC CAGTGCGGAG
AATGAACTCA AGGCCGCGAC TAGCGAACTC CAGGCGCTTC AGGCCGAAGC GCGCGATCTG
AGTGTGCGTG TGAAGGAACA CGGGCAAGCC GCCGCGAAAA TCAACCGGCT GATCGAAGCG
TATCTCGGCC ACAAGGAACT CTCGATTGTC AGCGTTGATA AGGGCTACGA AATCCACCGG
CGTGGTCGAC CCATCGAGGG TAGCCCCAGC GAGGGGGAAA AAACAGCGAT AGCCATATGT
TACTTCCTGT CGACGCTGGA AGCTGACGGG AAATCTATAA AGGACAGCAT CGTCGTGGTG
GATGATCCAA TATCGAGTCT CGACAGCAAG GCTCTCAACT ACGCCTGCTC CTTGCTCTTA
AGTCGGCTTG CGGATGCATC CCAGGTTTTC GTGCTGACCC ACAATCAGAA CTGTATGAAC
GAGTTTAAGA AGGCATGGAG CCGCTTCCAT CGACCGCGCA AAGAAAATGC CACGCCGACC
GCCAGTCTTC TTTTCCTGGA CGTCAAAATT CCCCACGGAG GAGAGACGCG CACTACCACG
ATCGTGGAAA TGTCGCGATT GCTTCGTGAG TACGACTCCG AATATCACTA TCTCGTCGAC
CACGTTCTCA AGTTCCATGC GAGTAACGAC TCGGAGTATG AGTACGCCTA TATGATTCCG
AACGTCCTTC GGAGGGTGCT CGACGTATTC CTCGCGTTCC GCTGCCCGGG CAGCGCTGGC
TTCGCGGGCA AGATGGAACA GCTTGGGAAG GACTACGAAG CGCTCGACGG CGAGCGGGTG
GCAGCGTTAC AGCGACTTGT TCAGCTAGAA TCCCATTCGG ATAGCGTTGA CGATCTCATT
GGATTTTCGT CGATGACGCT CGAGGAAAGT AAGTCCGCGG CTGCGGCGCT GGTCGGCTTG
ATGGAAATTG TGGATCCGAA TCACCTGGCC GCGTTGCGTC GCCTGTGCCG ACCCGGCAAA
GGTGGGTAG
 
Protein sequence
MQPDPNTGYL PAACEFEFTT DSGETVRAGT LNALNSRLAV FNEDFIDLNL QWSAGKARPV 
FYIGKEQAEA AANLKRTEAA IPAATERKAG AEKLVRAGEQ QVATFKREVA RKISQEIRPA
NRKYEAPQLT SDYAALDLDA GATLNDTQLA AQRELCRRDE AMPELANVDF DTAPANVALQ
TAIRLLQETP SLTIIPELET HSEMLRWVQE GVDYHENHGL NECLLCGGIL SSERKDLLAS
ALDEGFEKFQ EALDLAKEKL EAERERYQKI ALSISSSRDV VADRREEYDQ AAGRVKEELA
DSIAKVLEPA IAAIDAKLAR PTLKPDAKAM TWGDSRLTDE AFARIESWGG VLNLNIARHN
EACASFSQRQ EDARLALRRH HLAENHQEYA SVGEAVASAE NELKAATSEL QALQAEARDL
SVRVKEHGQA AAKINRLIEA YLGHKELSIV SVDKGYEIHR RGRPIEGSPS EGEKTAIAIC
YFLSTLEADG KSIKDSIVVV DDPISSLDSK ALNYACSLLL SRLADASQVF VLTHNQNCMN
EFKKAWSRFH RPRKENATPT ASLLFLDVKI PHGGETRTTT IVEMSRLLRE YDSEYHYLVD
HVLKFHASND SEYEYAYMIP NVLRRVLDVF LAFRCPGSAG FAGKMEQLGK DYEALDGERV
AALQRLVQLE SHSDSVDDLI GFSSMTLEES KSAAAALVGL MEIVDPNHLA ALRRLCRPGK
GG