Gene RPC_4734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4734 
Symbol 
ID3972710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5298595 
End bp5299908 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content65% 
IMG OID637927846 
Productprotein of unknown function DUF224, cysteine-rich region 
Protein accessionYP_534575 
Protein GI90426205 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.960339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.585687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG AGTTCAACCT GGCGCAGCTG GCTGATCCGG ACATCGCGGT GGCCGACGGC 
ATCCTGCGCG CCTGCGTGCA TTGCGGCTTC TGCACCGCGA CCTGTCCGAC CTATGTGCTG
CTCGGCGACG AGCTCGACAG TCCGCGCGGC CGCATCGTGC TGATCAAGGA GATGCTGGAG
AAGAACGCGC CGCCGACCGC CGAGGTGGTG AAACATATCG ATCGTTGCCT GTCCTGCCTG
GCCTGCATGA CCACCTGCCC GTCCGGCGTG AACTACATGC ATCTGGTCGA TCAGGCGCGC
GCCCGGATCG AAAAGGACTA CACCCGGCCA TGGCCGGATC GGCTGGTGCG CGCCGCTTTG
GCCTGGCTGA TGCCGCGGCC GGCGCTGTTT CGGCTCGGCA TGGTGGTGGG GCGTTTGGTG
CGCCCGCTGG TCAATCTGCT GCCCCTGCCG CACGATCTGC GCAAGCCGAC ATTGCTGAAC
CGGGTCAAGG CGATGCTGGC GCTGGCGCCG GCTCATCTGC CGCCCGCAGG GCCGTCCTCG
GGCACTGTGT TTCCGGCGCT CGGACCGCGG CGGGGCCGCG TCGCGCTGCT GCACGGCTGC
GCCCAGCAGG TGCTGGCGCC GCGCATCAAC ACCGCGGCGA TCCGGCTGCT CACCCGCCAC
GGCATCGAGG TGGTGCTGGT GCCCGACGAG CAGTGCTGCG GCGCCTTGAT CCATCATCTC
GGCCGCGACG TGCAGACGCT GGCCTATGCG CGGGCCAACA TCACAGTCTG GATGAAAGAG
GCCGAGCGCG GCGGGCTCGA CGCCATTCTG GTGACGACGT CGGGCTGCGG CACGGTGATC
AAGGACTACG GCTTCATGCT GCGCGAAGAC GCCGAATTCG CCGCATCGGC TGCCAAGGTG
TCGGCACTCG CGCAAGATAT CAGCGAATAT GTCGCCGCGC TCGAATTGCA AAACCCCCTG
CAGCGCAGCG ATCTTGTGAT CGCGTACCAC TCCGCCTGCT CGTTGCAGCA CGGCCAGAAA
ATCCGCGAGC TTCCCAAAGA ATTGCTTTCC AAGTCCGGAT TCGTGGTGAA AGATGTGCCG
GAGAGTCATC TGTGTTGTGG TTCGGCGGGC ACGTACAACA TTCTCCAGCC TGACATTGCG
GGAAAACTTC GCGACCGAAA GGTCGCCAAC ATCGCAACCG TCAAGCCGGA CATGATCGCC
GCGGGCAATA TCGGCTGCAT GGTGCAGATT GCCAGCGGAA CGTCGGTCCC TGTGGTGCAC
ACGATTGAAC TTCTCGATTG GGCGACCGGC GGCCCGCGTC CAGGATTGAA CTGA
 
Protein sequence
MKTEFNLAQL ADPDIAVADG ILRACVHCGF CTATCPTYVL LGDELDSPRG RIVLIKEMLE 
KNAPPTAEVV KHIDRCLSCL ACMTTCPSGV NYMHLVDQAR ARIEKDYTRP WPDRLVRAAL
AWLMPRPALF RLGMVVGRLV RPLVNLLPLP HDLRKPTLLN RVKAMLALAP AHLPPAGPSS
GTVFPALGPR RGRVALLHGC AQQVLAPRIN TAAIRLLTRH GIEVVLVPDE QCCGALIHHL
GRDVQTLAYA RANITVWMKE AERGGLDAIL VTTSGCGTVI KDYGFMLRED AEFAASAAKV
SALAQDISEY VAALELQNPL QRSDLVIAYH SACSLQHGQK IRELPKELLS KSGFVVKDVP
ESHLCCGSAG TYNILQPDIA GKLRDRKVAN IATVKPDMIA AGNIGCMVQI ASGTSVPVVH
TIELLDWATG GPRPGLN