Gene RPB_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3942 
Symbol 
ID3911749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4498749 
End bp4499852 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content69% 
IMG OID637885846 
Producthypothetical protein 
Protein accessionYP_487546 
Protein GI86751050 
COG category[S] Function unknown 
COG ID[COG3768] Predicted membrane protein 
TIGRFAM ID[TIGR01620] conserved hypothetical protein, TIGR01620 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.32994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC GGGTGCCTCC CCGAAGGCCG GCGACCTTCA AGCTCACCGA TCCGAGCGTT 
GTGCTGATCG ATTCCGACGA GAGCGGCAGC AGCCGCCCCG CCGCGCAGGC GAGCGCGACG
GCGTCGTCGA CGGCGTCAGC GGCGTCGTCG GCCTCGGCGC CGCCGCCGCG CGCGCGGATC
CAGCTCGCCA AGGAAGCCGA GCCGCCGATC GTCGCGCCGA AACCGCCGGC CAGCGTCATC
AATCCGAAGA AGGGTTTCCG CTGGGGCACG CTGTTCTGGT CAGCCGCGGG CGGGCTGGTG
ACGCTGGCGT TCTGGCTGTG GGTCAGCAAG CTGATCGAGG ATCTGTTCGC GCAGAGCCAG
ACGCTCGGCA CCGTCGGCAT GGTGCTGGCG CTGCTCGCCG CCGGCTCGCT CGCGATCATC
GTCGGCCGCG AGGCATTCGG GTTGATCCGG CTGGCGCGGA TCGAGCAGTT GCACGCCCGG
GCCGCGAAGG TGCTCGAAAC CGACAACCGC GCCGAGGCGC AGGCGATCAT CCGCGAATTG
CTGAAATTCG AGCACCCCAA TCCGCAACTC GCGCGCGGCC GTGCGACGCT GGAAAAGCAC
ATCGACGACA TCATCGACGG CGCCGATCTG ATCAGGCTCG CCGAGCGCGA ATTGATGACG
CCGCTCGACC TCGAAGCGCG GGTGATGATC TCGAAGGCGG CGCAGCGCGT CTCGCTGGTG
ACTGCGATCA GCCCGAAGGC GCTGATCGAC GTGCTGTTCG TGGCGATCGT CGCCACCCGG
CTGATCGGCC AGCTCGCGCG GCTGTACGGC GGCCGCCCCG GCGCGCTCGG CATGTTCAAG
CTGATGCGGC AGACGATATC GCACCTCGCC GTCACCGGCG GCATCGCGCT CAGCGACAGC
GTGATGCAAT CCGTGCTCGG CCACGGCCTC GCCTCGCGGC TGTCGGCCAA GCTCGGCGAA
GGCGTCGTCA ACGGCATGCT GACCGCCCGC CTCGGCCTCG CCGCGATCGA CCTGACGCGG
CCGCTCCCGT TCGACGCCCT GTCCCGCCCG GTGCTCGGCG ATCTGGTCAA GGATCTGCTG
AAGAAGCGCG AGAAGGACGA GTAG
 
Protein sequence
MTERVPPRRP ATFKLTDPSV VLIDSDESGS SRPAAQASAT ASSTASAASS ASAPPPRARI 
QLAKEAEPPI VAPKPPASVI NPKKGFRWGT LFWSAAGGLV TLAFWLWVSK LIEDLFAQSQ
TLGTVGMVLA LLAAGSLAII VGREAFGLIR LARIEQLHAR AAKVLETDNR AEAQAIIREL
LKFEHPNPQL ARGRATLEKH IDDIIDGADL IRLAERELMT PLDLEARVMI SKAAQRVSLV
TAISPKALID VLFVAIVATR LIGQLARLYG GRPGALGMFK LMRQTISHLA VTGGIALSDS
VMQSVLGHGL ASRLSAKLGE GVVNGMLTAR LGLAAIDLTR PLPFDALSRP VLGDLVKDLL
KKREKDE