Gene RPB_4222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4222 
Symbol 
ID3912030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4797019 
End bp4798125 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content69% 
IMG OID637886125 
Productsecretion protein HlyD 
Protein accessionYP_487824 
Protein GI86751328 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.163749 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCGC ATCCGATGTT CGATGGATTG TTCCGGCTGT TCGCGGGCTT CATGCTCGCG 
CTTGTGGCGC TCGCTCTGGC CGGATGTGAT GAGCAGAAGG CCACCCGGGA TCGTCCAGCG
CGGCCTGTTC TGGTCGACAC TGTCTACTAT GCCGCCGCTT CGCCGAAGCG CAGCTTTGTC
GGAACGATTC GGCCGCGGAT CGAAACCGAT ATCGGCTTTC GGGTCGCCGG CAAGGTCGCT
CAGCGCCTCG TCGAGGTCGG CCAGACGGTG GAGCCCGGCC AGACGCTGGC GTTGCTCGAC
CAGACCGATC TCAAATTGCA AGCCGAGCAG GCGGACGCCG AGCACAGCGC GGCCAAGGGC
GTGCTGGCGC AGGCGAGGGC CTCGGAAAAT CGCGCCAAGG AGCTCCGCGC CAAAGGCTGG
GCGACCGAGG CTCAGATGGA CCAGGCGCGC GCGGCCGCCG ACGAGGCGCG CGCCCGCTTT
GCCCGCGCCG AGCGCTCGGT CGACCTGACC CGCAACGCGC TGTCCTACGC CAGCCTGGTG
GCCGACACCC GTGGCGTCGT CACCGCGGCG CTGATCGACT CCGGACAGGT GGTCTCGGCG
GGGCAGGCGG CCTTCCGCAT CGCGCGGTTC GGCGAGAAGG AGGCGGTGGT GGCGATCCCG
GAGACGCTGA TCACCCGGGC GACCCAGGGC GTCGCCAGCG TTTCGCTGTG GTCGGAGCCG
GGTCGCAGCT ATGCGGCGAA GCTGCGGGAG GTCGCGCCGA TGGCCGATCC CGCGACCCGC
ACGTTTCTCG CCAAGTTCAC CCTGCCCGAT GCCGACGACC GCGTCGTGCT CGGGATGACC
GCGACGCTGA CGCTGGCCGA TCCGGACACG CTCCGCGTCG CCCGGGTGCC GCTGTCGGCG
CTGTTCAATC AGGGTGGCAC GCCGTCGGTC TATGTCGTGG ACTCCGCCGG ACAGGTGACG
CTGAAGCCGG TGACGGTGAA GGCCTATGAG ACCGAGAAGG TCGTGATCGG CGGCGGCGTC
GAGGACGGCG CCAAGGTGGT CGTGCTCGGG GTGCAGAAGC TCGATCCGGC CGAGAAGGTT
CGGGTCGTGT CGTCGCTGTC GTTCTAG
 
Protein sequence
MFAHPMFDGL FRLFAGFMLA LVALALAGCD EQKATRDRPA RPVLVDTVYY AAASPKRSFV 
GTIRPRIETD IGFRVAGKVA QRLVEVGQTV EPGQTLALLD QTDLKLQAEQ ADAEHSAAKG
VLAQARASEN RAKELRAKGW ATEAQMDQAR AAADEARARF ARAERSVDLT RNALSYASLV
ADTRGVVTAA LIDSGQVVSA GQAAFRIARF GEKEAVVAIP ETLITRATQG VASVSLWSEP
GRSYAAKLRE VAPMADPATR TFLAKFTLPD ADDRVVLGMT ATLTLADPDT LRVARVPLSA
LFNQGGTPSV YVVDSAGQVT LKPVTVKAYE TEKVVIGGGV EDGAKVVVLG VQKLDPAEKV
RVVSSLSF