Gene RPC_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4271 
Symbol 
ID3971694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4756236 
End bp4757282 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID637927375 
Producthypothetical protein 
Protein accessionYP_534114 
Protein GI90425744 
COG category[S] Function unknown 
COG ID[COG3768] Predicted membrane protein 
TIGRFAM ID[TIGR01620] conserved hypothetical protein, TIGR01620 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.714598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.990265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGC GGTTTCAAAG TCGCCGGCCG GCCACCTTCA AGCTCGGCGA TCCCAGCGTC 
GTCGTGACCG ATCCGGTCGA GGACGCGAGT CGCCCGCAGC CACCGCGCGG CCGGGTGCAG
ATCACGCCGG AAAACGACCA GGTCGCGGCC GAGACGGTGC CGGCGGAAGC CGCCCCGGTG
CTGGCTGTAC GCAAGGGATT CCGCTGGGCG ACGCTGTTCT GGAGCGCGGT CAGCGGCCTG
GTTTCGCTCG CCGCCTGGCT CGCGGTGACG CGGCTGATCG AAGACCTGTT CGCGCACAAT
AAGACGCTCG GCAGCATCGC GCTGGTGCTG GCGGTGATCG CCGGCGTCGC CTTGGCGGTG
ATCGTCCTTC GTGAAGTCAC GAGCCTGTTG CGGATGGGTG CGATTGAAAA GCTGCACCGC
CGCGCGCTGG TGGTGCTGGA GACCGACAAC CGCGCCGAAG GCCGCGCCAT CGTCAAGGAA
CTGCTGACGC TGGAGAACCA GAACCCGCAG CTGGCCCGCG CCCGCACCAC GCTCAACACG
CATCTCAACG ACATCATCGA CGGCGCCGAT CTGATCCGGC TCGCCGAGCG CGAACTGCTC
GAGCCGCTCG ATCAGCAGGC CCGCAACCTG GTCTCTACCG CCGCGCAGCG CGTCTCGCTG
GTCACCGCGA TCAGCCCGAA GGCGCTGATC GACATCCTGT TCGTGTTCAT CGCCGCGTTG
CGGCTGGTGC GGCAGCTGGC GCGGCTCTAT GGCGGCCGCC CCGGCACCAT CGGCATGATG
CGGCTGATGC GGCAGGTGAT CGCGCATCTC GCCATCACCG GCGGCGTCGC AATCGGCGAC
AGCGTGGTGC AGCAGGTGCT GGGTCACGGC ATCGCCGCCA AACTGTCCTC CAAGCTCGGC
GAAGGCGTGC TCAACGGCAT GCTGACCGCC AGGCTCGGGC TCGCGGCGAT CGACCTGACG
CGGCCGCTGC CGTTCATCGC GGTGCCGCGA CCGGCGCTCG GCGATCTGGT CAAGGACCTG
ATGCGCAAGC GTGAGAAAGA CGAATAG
 
Protein sequence
MTERFQSRRP ATFKLGDPSV VVTDPVEDAS RPQPPRGRVQ ITPENDQVAA ETVPAEAAPV 
LAVRKGFRWA TLFWSAVSGL VSLAAWLAVT RLIEDLFAHN KTLGSIALVL AVIAGVALAV
IVLREVTSLL RMGAIEKLHR RALVVLETDN RAEGRAIVKE LLTLENQNPQ LARARTTLNT
HLNDIIDGAD LIRLAERELL EPLDQQARNL VSTAAQRVSL VTAISPKALI DILFVFIAAL
RLVRQLARLY GGRPGTIGMM RLMRQVIAHL AITGGVAIGD SVVQQVLGHG IAAKLSSKLG
EGVLNGMLTA RLGLAAIDLT RPLPFIAVPR PALGDLVKDL MRKREKDE