Gene RPB_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2241 
Symbol 
ID3909024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2568705 
End bp2570279 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content65% 
IMG OID637884136 
ProductType I secretion outer membrane protein, TolC 
Protein accessionYP_485857 
Protein GI86749361 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.417493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAA GCGCTCAGAT TGCCATTTTT GCAACCGTCG TCGGGCTGGT TTCGAGCCCC 
GCCGCTCTTG CGGCGGAGCC GTTCACGATT TTGGACGCCA TCAACCAGGC GGTGAAAACC
AACCCCGGCG TCGGCGAGGC GGCAGCCAAC CGTCGCGCGA CGGAGGCCGA GTTGCGGCAG
AGTCAGGGGA CCTTGCTACC GCAGGTTCGG CTCGAAGCCA GCGCTGGCCC GGAGATGCTG
AAGCAATACG TGTCTCCAGC GCCGCTCAAC AACGACGTCT ATCTGCGCGG ACGTCAGGCC
GGTGTCGTGG TTCGGCAGCT GCTGTTCGAT GGGTTTGCCT CGATCAACGA AGTCTGGCGG
CAGGCGGCTC GCGTCGACGC CGCGGCGTTC CGGGTGTTGG AGCGCACCGA ACTGATCGGG
CTCGATGCCG CCGAAGCTTA TATCGATGTG GCGCGCTACA CGCGGCTGGT TGGACTCGCC
GAACAGAATC TCAAGGTCCA TCTCGAATTG CGCAAGAACG TGTTGGCGCG CTTCGAGGGT
GGTCGGGCCG GCGAGGGCGA CACGCAGCAA GCGGAAGAAC GCGTCGCTGC AGCGCAGGCC
GTGGCGGCGG AGTTCCATCT CAGCCTCGAG ACGGCGCGCG CCAAGTTCCG CAAGGTCGTC
GGCCTCGAGC CGTACAATCT GCGCTTCCCC GGCCGTCTCG CGGATCTGCC TAAGAACAAG
GCCGCATCGC TCGACATCGC CTACAAGTTC AATCCGACGC TGCGCGCCGC GGGCGCCGAC
GTGGTCGCGG CCAAGCGCGG CTTCGATGCC ACAACCGGCG CGTTCCTTCC GACGCTGTCG
CTGGAAGGCC GCGCCTCGCG CGGCAAGGAG TCGATCCTCT ACAACAATCA GTACGACCAG
GTCAGCGGCA AGCTGGTGGC GTCCTGGGAT ATCTTCAACG GCGGCCAGAG CAGCTGGAAG
CGTGAAGAAG CTGCACAGCG GATGATCGAG GAGCAGCAGC GTCAGGCTCG GCTGCAGCGC
GATGCGCTGG AATCGATCGA CAAGGCCTGG GCCGCGCGGA CCATCACCAA CGACCGCGTC
GCAGCGCTCG TTCGCGACGT CGAGGCCGCG CGCCGGACCT TCATCGCCTA CAATAAGGAA
TACGAGCTCG GCCAGCGCAC GCTGATCGAT CTGCTCAACT CGCAGAACCA GTATTTCAAC
GCCAATGTGT CGCTGGTTTC GGCGCGCGGA GTTGCGGTGT TCGCCGACTA TCAGCTTCTC
GCCACGATGG GGCAGATGCT GAACTATCTG AAGACCGGTC ATCCGCCGGA GACCGAGTTG
GTCGACGTGC ATCCGAGCGG GTTCATCGGC TACAAGCTCG CGCCCATCCG GCTTGCGCCC
CCGTCGCCCG GTCCGGAACC GCTCAGCACT GTGCCGCCGG TGCCGCTGTT CGGGTTCTTT
TTCAACGGCC CGCCGAAATT GCCGACCGTG ATCAATTTCG ACGATCGCTG GGCGTCTCAT
GAGGTCGCCG AGAGTAGCGC GCTGTTCGTT GCAGCCGGAA CCTACGGTCG CAGCGCGCAG
GCCGGCGCCA AGTAA
 
Protein sequence
MLKSAQIAIF ATVVGLVSSP AALAAEPFTI LDAINQAVKT NPGVGEAAAN RRATEAELRQ 
SQGTLLPQVR LEASAGPEML KQYVSPAPLN NDVYLRGRQA GVVVRQLLFD GFASINEVWR
QAARVDAAAF RVLERTELIG LDAAEAYIDV ARYTRLVGLA EQNLKVHLEL RKNVLARFEG
GRAGEGDTQQ AEERVAAAQA VAAEFHLSLE TARAKFRKVV GLEPYNLRFP GRLADLPKNK
AASLDIAYKF NPTLRAAGAD VVAAKRGFDA TTGAFLPTLS LEGRASRGKE SILYNNQYDQ
VSGKLVASWD IFNGGQSSWK REEAAQRMIE EQQRQARLQR DALESIDKAW AARTITNDRV
AALVRDVEAA RRTFIAYNKE YELGQRTLID LLNSQNQYFN ANVSLVSARG VAVFADYQLL
ATMGQMLNYL KTGHPPETEL VDVHPSGFIG YKLAPIRLAP PSPGPEPLST VPPVPLFGFF
FNGPPKLPTV INFDDRWASH EVAESSALFV AAGTYGRSAQ AGAK