Gene RPB_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3472 
Symbol 
ID3911274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3976757 
End bp3978052 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content72% 
IMG OID637885375 
Producthypothetical protein 
Protein accessionYP_487079 
Protein GI86750583 
COG category[S] Function unknown 
COG ID[COG5323] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGG CGTTTCTGCG GCAGGCGGGG ACGCGGACGC TGGCCCGGCT GCAGCATGAT 
TTCGCGACCT TCGCGCATCC GCATCAGGAG CATGGCGAGG CCGGCAACAA TGGCGGGCCG
TGGACGACCT GGCTGCTGCT CGGCGGCCGC GGCGCCGGCA AGACGCGGAC CGGGGCCGAA
TGGGTGCGGG CGCTGGCGCA TGGCACGCCG CCTTATGCCG AGCGGCCGCA TCGCCGGATC
GCGCTGATCG GGGAGAGCTG GCAGGACGCC CGCGAGGTGA TGGTGGAGGG CGAGTCCGGC
CTGCTGCGCT GTTCGCCGCG CGCCGAGCGG CCGGAATGGA TCGCGTCGCG GCGGCGGCTG
GAATGGCGCA ATGGCGCGGT CGGGCAGGTG TTCTCCGCCG ACGATCCGGA AAGCCTGCGC
GGGCCGCAAT TCGACGCCGC GTGGTGCGAC GAGCTGGCGA AATGGCGCTA TGCCGAGGCC
TGCTTCGACA CGCTGCAATT CGGGCTGCGG CTGGGCCTGC AGCCGCGCCA ACTGGTCACC
ACCACGCCGC GGCCGCTGCC GCTGATCAAG CGGCTGCTGG CCGATCCGCG CACGCGGGTG
ACGCGCGCGC CGACGAAGGC GAATGCCGAT CATCTGTCGC CGGCGTTTCT CGATGCCGTG
GTCGGCCGCT ATGCCGGCAC GCGGATGGGG CGGCAGGAAC TCGACGGCGA AATGATCGAG
GACCGCGCCG ATGCGCTGTG GTCGCGGGCG CTGATCGAAT CCTGCCGCGT CGCGCAGCCG
CCGGGGCTGG CGCGCGTGGT GGTGGCGATC GACCCGCCGG GGACCTCGAA GGTCGGCGCG
GATGCCTGCG GCATCGTCGC CGTCGGCCGC AGCGACAGCG GCGCCTACTA CGTGCTGGAA
GACGCCTCCG CCGCCGGGCT GTCGCCGGCC GCCTGGGCGG CCAAGGCGGT GGCGCTGTAT
CACCGGCTCG ACGCCGATAC GCTGATCGCC GAAGTCAACA TGGGCGGCGA GATGGTGCGC
GCCGTGCTGC GCGAGACCGA CGGGGCGGTG CCGCTGAAGG AAGTCCGCGC CAGCCGCGGC
AAATATCTGC GCGCCGAGCC GGTCGCGGCG CTGTACGAGC AAGGCAAGGT CAAGCATGTC
GGCTGCTTCC CGCTGCTCGA AGACGAAATG TGCGACTTCG GCATCGACGG CCTCTCGTCG
GGCCGCTCGC CCGACCGGCT CGACGCCCTG GTGTGGGCGA TCACCGGGCT GATGAACGGC
CGCAATGCCG GCGGGCCGCG GATCAGGCAG TTGTGA
 
Protein sequence
MTEAFLRQAG TRTLARLQHD FATFAHPHQE HGEAGNNGGP WTTWLLLGGR GAGKTRTGAE 
WVRALAHGTP PYAERPHRRI ALIGESWQDA REVMVEGESG LLRCSPRAER PEWIASRRRL
EWRNGAVGQV FSADDPESLR GPQFDAAWCD ELAKWRYAEA CFDTLQFGLR LGLQPRQLVT
TTPRPLPLIK RLLADPRTRV TRAPTKANAD HLSPAFLDAV VGRYAGTRMG RQELDGEMIE
DRADALWSRA LIESCRVAQP PGLARVVVAI DPPGTSKVGA DACGIVAVGR SDSGAYYVLE
DASAAGLSPA AWAAKAVALY HRLDADTLIA EVNMGGEMVR AVLRETDGAV PLKEVRASRG
KYLRAEPVAA LYEQGKVKHV GCFPLLEDEM CDFGIDGLSS GRSPDRLDAL VWAITGLMNG
RNAGGPRIRQ L