Gene RPB_3156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3156 
Symbol 
ID3910957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3611954 
End bp3613282 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content64% 
IMG OID637885058 
Producthypothetical protein 
Protein accessionYP_486763 
Protein GI86750267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.469309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTTC GGGCAATCAT CTTCGGCTCG CTGGCGGCGA TCGGCGTCGC CGTGTCGGGC 
GCCGCGGCGC AGCAAAGCGG GCCGGTTCCG GTCTCCGCCA CGCAGCCGGA CTACGACGCG
CTGTTCGCGC AGATGTATCA GCGGCCGGCC GATCTCGACG TCAGCTTCAA ATTCGCGCGT
GAGGCGGTGG AGCGCGGCGA CTACGAGGCG GCGATCGGCG CGCTGGAGCG GATGCTGTTC
TTCAATCCGG ATCTGCCGCG GGTGAAGCTC GAACTCGGCG TGTTGTACTT CAAGCTCGCC
TCCTACGAGA TCGCGCGCGG CTATCTGATC GACGCGATCA AGGGCAGCAA CGCGCCCGAC
GACATCCGCG CGCAGGTGAT GGCCTATCTC GCCGAGATCG ATCGACGGCT GGCGACATAC
GAATACAGCG TGTTCATGCA CGCCGGCATG CGCTATCAGA CCAACGCCAA TATCGGCCCG
AACGGCCAGC AGGTGCGCGC CCTCGGACAG GACGCCATTC TCGATCCGCG CTTCGGCAAA
AAGCCGGACT GGAGCACGTT CCAGACCGTC GCGGCGAGCT ACGCCTACAA GCTCAATCTG
CGCGGCGACG CGATCGAGAC CACGTTCCTC GGCCTGAATT CCCGGCAGAT GACGCTGAGC
CAGTTCAATC TCGGCTTCGT CGAGATCACC GCCGGCCATC GCGTCGGTCT CGGCCAGAAT
TCGTCTTTCA AATATTACGG CATCGGCGAC AAGGTCTGGC TCGGCAATTA CAGCTACTTC
AACGCGCTCG GCGGCGGCCT GTCGGCCCGC ACCCAGCTCG GCAGCCTCGG CCTGGTCGAG
GGCTATGTCG AGACGCGACA CCGCCGCTTC AGCGACTCGA CTTATTTTCC CACCGCCAGC
GACCAGACCG GCGACCTGCT GACCGCGGCG GTGCTGACCG ACTTCCGCTG GGGCGGGGTG
CACTGGACCA CGCGCGCCGG TTTCGACAGC AACAAGGCCA TCGCCGATTA CAACAGCTAC
AAGCGCTATT CGATCGACGT CGCGTTGCCG ATCGAATTCG TCGCGACGGT GTTCGGCGCG
CAGCGCGCCT TCGTGTTCGC GCCGACGGCG GGGTTCAGCC AGGCCAATTA CGAAGCGCCG
AATTTCATCG TCGATCCGGT GATCGTCCGG CGCGATCAGG AATATCGCTA CGGCGCGATC
TTCGATGCGC AACTCGTCGA CAATATCGGG CTGCGCACCC AGGTGAGCTA CACCAAGATC
GACTCCAACC TGCCCAACTA CCGCACCAAC AATCTGTCGG CCTCGATCGG CCCGACGCTG
CGCTTCTGA
 
Protein sequence
MTFRAIIFGS LAAIGVAVSG AAAQQSGPVP VSATQPDYDA LFAQMYQRPA DLDVSFKFAR 
EAVERGDYEA AIGALERMLF FNPDLPRVKL ELGVLYFKLA SYEIARGYLI DAIKGSNAPD
DIRAQVMAYL AEIDRRLATY EYSVFMHAGM RYQTNANIGP NGQQVRALGQ DAILDPRFGK
KPDWSTFQTV AASYAYKLNL RGDAIETTFL GLNSRQMTLS QFNLGFVEIT AGHRVGLGQN
SSFKYYGIGD KVWLGNYSYF NALGGGLSAR TQLGSLGLVE GYVETRHRRF SDSTYFPTAS
DQTGDLLTAA VLTDFRWGGV HWTTRAGFDS NKAIADYNSY KRYSIDVALP IEFVATVFGA
QRAFVFAPTA GFSQANYEAP NFIVDPVIVR RDQEYRYGAI FDAQLVDNIG LRTQVSYTKI
DSNLPNYRTN NLSASIGPTL RF