Gene Rpal_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0473 
Symbol 
ID6408121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp509782 
End bp511851 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content66% 
IMG OID642710385 
ProductCarbohydrate-selective porin OprB 
Protein accessionYP_001989509 
Protein GI192288904 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.693985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGTTC GACGACGCCC CGCCCGCCCG ACTTCGGCCC GCCGCGACTT CGGCTCGCGT 
GAGCTATCCA CGCCGCACAA TCGGATCTCT CTGCGTCCGT CCGCCTCGGC GCTGATCGCC
GGCGCGGCGA CCGCCGCAGC TTTGCTGCTG CCGCAGGGCG CCGCCGCGGG CGGCATAGGC
AAGCAGCGGC CGGACCCGGC GTACGACTGG AGCGGGTTCT ATGTCGGCGC CCACGCGGGT
TACGTCCGCG GCCATGCCGG CGCGACGCTG CAGGACGCGA CGTCGGTCGC GCAGAACAGC
AACGGCGCTG GCGGCATCAC CGCGGGCGCC CAGGCGGGCT ACAATTTCGT TACCGGATCG
AACCTGCTGC TCGGCGTGGA GGCCGACATC TCGTTTCCGA GCTATCTGCC GCAGAATGCG
GTGCTGTCGG AGTTCGATAA CGGTACCGCC AATACGCTGC AGCACCTCGA CTATTACGGC
ACCGTCCGCG CCCGGCTCGG CGTCATTGCC GGCCGTTGGC TCGGCTATGT CACCGGTGGC
CTCGCCTTCG AGCACGAGCG CTACCTCACC GATCCCGATG CCGGCCGCGT GATGAATGCT
CGGATCGGCT GGGCCGCAGG TGCGGGCGTC GAATACGGCT TCGCGCCGAA CTGGAGCGTG
CGGATCGAAT ATCTCTACAG CCAATACCAG AATGCGAGCG TCGATCTGCC GACCGGGGCG
AGCTACGCCA CGTCGCTGAA TCTGCAGACC GTCCGGGTCG GCGTGAACCG CAAGATCGAT
TGGTTCAGCA CCGGCAGCAC AGACGCCTTC CCGTCGCTCG CGGCAGTCGC CGATCCGGAG
TCCGACCGCT GGGAGATCCA CGGCCAGACC ACTTTCATCG GGCAGGGCTA CCCGTCGTTC
CGGGCGCCGT ATAGCGGCAC CAACAGTCTG TTTCCCGGCG CGCAGTTCAA GAACACCTGG
AGCACCAGCC TGTTCGCCAA CGTCCGGCTG TGGGACGGCG GCGAACTCTA TTTCAACCCT
GAATTCCTGC AGGGCTACGG GCTCAGCGAC ACGGTGGGGG CCGGCGGCTT CCCGAATGGC
GAAGCGCAGA AATCCGGCTT CGCCTACCCG CATTTCAGCC CGTCACGGTT CTATCTGCGC
CAGACCTTCG GGCTCGGCGG CGAGCAGGAA CAGCTCGCGT CGAGCGGTTC GCAGCTCTCC
GGCAAGGCCG ACATCTCGCG GCTGACGCTG CAGGTCGGCC GCTTCAGCGT CATCGACGTG
TTCGACGGCA ACTCTTACGC GCACGATCCG CGCCGCGATT TCATGAACTG GTCGATCTGG
GCGTCGGGCG CGTTCGACTA CGCGGCCGAC AAGCTCGGCC TCGGCTATGG CGCCACCGCC
GAACTCAACC AGAAACAATG GGCGATCCGC GCCGGCTACT TCCTGGTCGA TGCTGAATCC
AACTCGAACA ATTACGACAT GAAGCTGCTC AGGCGCGGTG AATACGTCGC CGAGCTCGAG
ACCCGCTATT CGCTGTTCGG CCTTCCCGGC AAGCTGCGCA CGCTGGGCTT CATCAACAGC
ACCTATTCGG GGAGCTATCG CGAGACGCTC AACGATCCGT CGCTGAACCT CGACATCACC
CAGACCCGCC GCGGCCGGAT CAAATACGGC TATGCGTTGA ATCTGGAGCA GGCGGTCACC
GACGACATCG GCGTGTTCGG ACGCTGGAGC TGGAACGACG GCAAGAACGA GATCATGGCG
TTCACCGATA TCGACAGCAG CCTGTCGGGC GGCGTGTCGA TCCGCGGCCA GCGCTGGGGC
AGGCCGGACG ACGTGATCGG CATTGCCGGC GCGCTCAACG GCTTGTCGCG CGATCATCGC
GATTTCCTCG CCGCCGGCGG CCTCGGCCCG CTGATCGGCG ACGGCGCTCT CAACTATCGC
CGCGAGCGTG TGTTCGAGAG TTACTACGCG CTGGCGCTCA ATTCGTCGTG GACCGCGACC
GCCGACTACC AGCTGATTGC CAACCCCGCC TACAACGCCG ACCGCGGCCC GGTCTCGGTG
TTCTCTGGCC GGGTGCACGG GGAGTTCTGA
 
Protein sequence
MIVRRRPARP TSARRDFGSR ELSTPHNRIS LRPSASALIA GAATAAALLL PQGAAAGGIG 
KQRPDPAYDW SGFYVGAHAG YVRGHAGATL QDATSVAQNS NGAGGITAGA QAGYNFVTGS
NLLLGVEADI SFPSYLPQNA VLSEFDNGTA NTLQHLDYYG TVRARLGVIA GRWLGYVTGG
LAFEHERYLT DPDAGRVMNA RIGWAAGAGV EYGFAPNWSV RIEYLYSQYQ NASVDLPTGA
SYATSLNLQT VRVGVNRKID WFSTGSTDAF PSLAAVADPE SDRWEIHGQT TFIGQGYPSF
RAPYSGTNSL FPGAQFKNTW STSLFANVRL WDGGELYFNP EFLQGYGLSD TVGAGGFPNG
EAQKSGFAYP HFSPSRFYLR QTFGLGGEQE QLASSGSQLS GKADISRLTL QVGRFSVIDV
FDGNSYAHDP RRDFMNWSIW ASGAFDYAAD KLGLGYGATA ELNQKQWAIR AGYFLVDAES
NSNNYDMKLL RRGEYVAELE TRYSLFGLPG KLRTLGFINS TYSGSYRETL NDPSLNLDIT
QTRRGRIKYG YALNLEQAVT DDIGVFGRWS WNDGKNEIMA FTDIDSSLSG GVSIRGQRWG
RPDDVIGIAG ALNGLSRDHR DFLAAGGLGP LIGDGALNYR RERVFESYYA LALNSSWTAT
ADYQLIANPA YNADRGPVSV FSGRVHGEF