Gene RPB_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0502 
Symbol 
ID3909406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp560323 
End bp561774 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content64% 
IMG OID637882390 
Productputative Omp2b porin 
Protein accessionYP_484124 
Protein GI86747628 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0711534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.785623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTGA TCAAACGACT GGCGCTGGGC ACGGCGGCGG CATTGTTGAC TGCCGGCGTC 
GCGCAGGCGG CCGATCTTCC GATGAAGGCC AAGGCCGTCG AGTACGTCAA GGTCTGCTCG
ATCTACGGCG CCGGCTTCTA CTACATCCCC GGCACCGACA CCTGCATCAA GCTCGGCGGC
TATCTGCGCG CCGACGCCAC GTTCGGTGGC TCGGGTGCTT ATGACTCCCC GGCCTGGAGC
GGCACCTCCG GCGCCAAGAG CCGCAACGCC AACAGCTATG TGTTCCGTTC GCGTCAGGAC
ATCAACATCG ACACCCGCAC CGCGACCGAA TACGGCGTGG TCCGCACCTA TTTCGACGCG
ACCTTCAACT GGACCACGGG CTCCGATTCC ACGGCCGGCG GCACGCTCGG CGTGTATTTC
GCCTTCATCC AGTTCGCCGG CTTCACCTTC GGCAAGGCGG TGTCGCAGTT CGATACGCCA
TGGTCGGGCT ACCCGGGCAA CAACACCTCG TTCCTGATCG GCGGCTATGA CGACGTCACC
GGCATCAACC AGGTCGCCTA CACCGCCGAA TTCGGCAACG GCGTCTCGGC CTCGATCTCG
CTCGAAGATG CGTCGTCCTA CAATCAATCG GCGCTGTACA ACACATCGAC CATGACGGCG
GCCAATTTCG CCACCGGCGT CTACGGCACC AACTCCTACG CCGGCTCGCA GATCCCCGAC
ATCGTCGGCA AGATCCGCGT CGACCAGGCC TGGGGCCTGT TCCAGGTGTC GGCCGCGGCT
CACCAGGTTC GCGCCAGCTA CTACAACACC GCCCTCGAGA CCTCGGGTCA TCCGAGCGAT
ACCTGGGGCT ACGCGGTGCA GGGCGCAATT TCGCTGAAGA ACCTGCCGAC CGGTCCCGGC
GACAGCATCA ACTTCACCGC GACCTACTCG AACGGCGCCA CCCGCTACGT GATCGGCGGT
TCGTCTCCGA ACTCGTTCGC GATGTACGGC AGCTCCAGCT CGGCCTACCA GAGCCTCGCG
ATCGCGGGCG TGGCTGATGG TGTGTTCTCC GGCACCGACG TCAGCAACGG CAGCGGCATC
TCCAAGACCA CCGCATGGGG CGTCCGTGGC GCGTTCAACC ACAACTGGAA TCCCTACTGG
TCGACCTCGC TGTTCGGTTC GTACACCTCG ATCGACTACA ACGGCACCGC GACGGCCCAG
ATCTGCGCCA CGGCTGTCGG CTTCACCTGC AACCCGGACT TCAAGATCGC TCAGATCGGC
ACCGTCACCC GTTGGACCCC GGTCAAGAAC CTGACGTTCT CGGGCGAAGT GATGTACACC
TATCTCGACC AGAGCTACTC CGGCAACGTC GCCCTTCCGG CGGTGTCGTC GGTGTCCAAG
CCGGCCGCGA CCTACGAGCT CAAGGACCAG GGCGCCTGGA GCGGCAACCT CCGGGTTCAG
CGCACCTTCT GA
 
Protein sequence
MRVIKRLALG TAAALLTAGV AQAADLPMKA KAVEYVKVCS IYGAGFYYIP GTDTCIKLGG 
YLRADATFGG SGAYDSPAWS GTSGAKSRNA NSYVFRSRQD INIDTRTATE YGVVRTYFDA
TFNWTTGSDS TAGGTLGVYF AFIQFAGFTF GKAVSQFDTP WSGYPGNNTS FLIGGYDDVT
GINQVAYTAE FGNGVSASIS LEDASSYNQS ALYNTSTMTA ANFATGVYGT NSYAGSQIPD
IVGKIRVDQA WGLFQVSAAA HQVRASYYNT ALETSGHPSD TWGYAVQGAI SLKNLPTGPG
DSINFTATYS NGATRYVIGG SSPNSFAMYG SSSSAYQSLA IAGVADGVFS GTDVSNGSGI
SKTTAWGVRG AFNHNWNPYW STSLFGSYTS IDYNGTATAQ ICATAVGFTC NPDFKIAQIG
TVTRWTPVKN LTFSGEVMYT YLDQSYSGNV ALPAVSSVSK PAATYELKDQ GAWSGNLRVQ
RTF