Gene RPB_1975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1975 
Symbol 
ID3909480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2241246 
End bp2244581 
Gene Length3336 bp 
Protein Length1111 aa 
Translation table11 
GC content67% 
IMG OID637883869 
ProductOuter membrane autotransporter 
Protein accessionYP_485594 
Protein GI86749098 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTGC CGACGGTGAT GGGTGGCAGG ATTGCGAACG GCAGGGTGCG GATCGCGTCC 
GTCGCCGCTG CACGGATCAA GCAATTGCTG CTGTCGACCG CGCTGGTCGG CATCGTGGTC
TATCCGCAGG CGCTGCAGGC GCAATCCTGG AACGGCGCCG TCGACAGTGA CTTCTCCAAC
GGCGCCAATT GGAGCACCGG CGTCGCACCG ACCTCGACCG ACGTCTCGAC GATCGCATCC
ACGCCGACCA CCCCGGTGGT GGCCGCGACC ACGGCGAATG CCGGGCAGAT CACGATGACC
GGCGCCACCC TCGGCATCAA TGCCGGCGCG ACGCTGGTCT TGAACACCCT GACTGTCGGG
AGCGGGTCCA CCGTGACCGG GGCGGGCGCC ATCGCGACGA ATTCGTTGTT TCTCGACTCC
GGCTCGGTCG CGCCCGATGT CGCGCTCGGG ATCCTGGCGA CCATGGGGGG CGGCACCATC
AGCGGCGCGA TCAGCGGCGC CGGCCAGATG TTCGTCTCTG GCGGCTTCAA CTACCTCACC
GGCGCCAACA CCTACAGCGG CGGCACCGAG ATCAACACCA ACGCGTTCCT GACGGTCGGC
GACAGTGGCA CCCTCGGCTC GATCACCGGC AACGTCCTGA ACAACGGCAC GCTGACCTTC
ACCCGCTCCG ACACGACGAG CTATGGCGGC GTGATTTCCG GCGCCGGCGT TGTCATCAAG
AACGGCATCG GCACGCTGGT CCTGTCCGGC GCCAACACCT ACAGCGGCGC GACGACCATC
CTCAATGGCA CGCTGCAGAC CGCCGTCGAT TACGTGCTCA GCAGCGCCTC GGCGCTGACG
ATTGCCGCCG GTGCGACGCT CGATCTCACC AATACACAAC AGACCGTGGC GTCGCTGGCG
GGCGCCGGCA ATGTGAATCT TGGAGCTGGC CACACCTTCA GTTTCGGTGG CGACAATGCG
TCCACCAGCT TCACCGGAAA TTTCATTGGC ACGCTAAGCG CCATCAAGGA AGGCACCGGC
ACCTTCAGCT TCAGCGGTAC CGGCCCGAGC GCTGGCGCCA TCCAGATCAA TGCGGGTACG
CTGGCATTGA CCGGCGCGGC CGACTTCAGC GGTGCAGGCG TCACCTTGCA GGCCAATACG
ACGCTGGACA TTTCCGGGGC CAACGCCGGC ATCTCGATCA AGAGCTTGCA GGGCCAGAAC
GGCACGGCGG TCCATCTCGG CAGCCGCACG CTGACGGTCC AGCGCACCGA CTTTGTTTTC
GGCTCCAACT TCAGCGGTGT CGTCGACGGC AGCGGTGGCC TGACCGTCGG GCAAAATTCG
TCGCTCAGGT TGTTCGCCGC CAACACCTAT AGCGGCCCGA CCACGCTCAA CTCGGCGTCG
GCTACGCTGG ACCTCCGACA TGTCGGCGCC ATCGCCACCT CCAGCGAGGT CAATCTTGCC
GCCAGCGGCG CTACGCTGGA TATCTCCAAC GCCGGTGGAG ACGCCACGAT CCAGGCGTTG
CGCGGCGTCG CTGGCAGCAT TGTCACCCTG CGCAACAGCA ATCTGGTGAT CACCAACGCG
GCGTCGGAAT TCGCCGGAAT CATCAAGCCA GCGTTTGGAA CGCCCACCGG CGGCGTCATC
CTCAATGGCG GCCACCAGAC CCTGTCCGGG ATCAACACCT ATCTGGGCGC CACCACGGTC
AATGCCGGGA CGCTCTCGGT CAACGGCTCG ATCGCAACCT CGTCGCTGAC CACCGTGAAT
GCCGGCGGCA CGCTCGGCGG CACCGGCACG GTGGGTACCA CGCTGATCGA CGGCGGCGCG
CTGGCGCCGG GCAATTCGAT CGGCACGCTG AATGTGAACG GCAACCTCGT CTTCACCGCG
GCCTCGAGCT ACATGGTCGA GGTGTCGCCG ACCGCCGCCG ACCGCACCGA CGTCACCGGC
ACCGCGACGC TCGGCGGCGC GACGGTCAAC GCGATCTTCG CCAGCGGCAG CTATGTCGAG
AAGCAGTACA CCATCCTCAA TGCCGCCGGC GGCATCGTCG GCACCTTCGG TTCGGTGGTG
AACAGCAATC TGCCGTCGGG CTTCAAGTCG AGTCTGGGCT ACGATGCTAA CAACGCCTAT
CTCAACCTGG TGCTGGACTT TACGCCGACA CCGACTCCAA CACCCACGCC GTCGCCGCTG
CCGATCAACA GCGGCCTCAA CGGCAACCAG ACCGCGGTCG CCAACGCGCT GAGCGGGTAC
TTCGCGCGGA CCGGCAGCAT TCCGATCGTG TTCGGCGCGC TCGGGCCGGC CGGGCTGAGT
GCTGCGGCGG GCGAGACGCC GACCGGCGTT CAGCAGACCA CGTTCAACGC CATGAACATG
TTCATGGGCG TGCTGACCGA TCCGTTCAGC AATGGCCGCG GGCGCGGGGC TCAGACGCCG
GTCGCGATGT CCTACGCGGG CGGCGGCTCC GTGCGCGATG CCCACGCGAT GATCACCAAA
GCGGTGGTGA AGCCGCCGTT CGAGTCGCGT TGGGTCAGTT GGGCGGCGGG CTTCGGCGGT
TCGCAGACCA CCGACGGCAA CGCGACGCAG GGCAGCGCCA CGTCGACCAG CCGGCTCTAC
GGCATCGCGG CCGGCGCGGA CTACTGGCTG TCGCCGCAGA CCGTCGCCGG CTTCGCGATG
GCCGGCGGAG CGACCAGTTT CGGTCTGTCC GGCGGTCTGG GATCGGGACG CTCCGATCTG
GTGCAGGTCG GCGGCTTCGT CCGCCATAGC GTCGGCGCCG GCTATCTCAC TGCCGCGGCC
GCTTATGGCT GGCAGGAGAT CACCACCGAG CGGACCGTCG GCATCGGCGG GGTCAATCAG
TATCGCGCGA CCTTCAATGC CAATGCCTAT TCGGCCCGTG TCGAAGCCGG TCACCGCTGG
ATCGCCCCGG CGCTCGGCGG CGTCGGGCTC ACGCCCTATG CGGCGGCGCA GATCACTGCG
TTCGATCTAC CGGCTTATGC CGAGCAGACG GTGAGCGGCA CCGGCGTGTT TGCGCTCGAC
TACGCGTCGA AGACCGCGAC GGCGACGCGC AGTGAACTCG GCCTGCGCGG CGATCGGTCG
TTCGCGCTCG ATGGCGCGCT GCTGACGCTG CGCGGCCGGG CGGCCTGGGC GCACGACTTC
GACACCGAGC GCTCGATCGC GGCGACGTTC CAGGCGCTGC CCGGGGCGAG TTTCGTCGTG
AACGGCGCGC GGCCGGCGCG GGATGCGGCG CTGACCACGG TATCGGCCGA GGTGAACTGG
CTGAACGGAT TCTCGGTCGC GGCCAGCTTC GAGGGCGAAT TCTCCGACGT CACCCGCAGC
TACGCCGGCA AGGGCGTGGT GCGCTACGCC TGGTGA
 
Protein sequence
MDVPTVMGGR IANGRVRIAS VAAARIKQLL LSTALVGIVV YPQALQAQSW NGAVDSDFSN 
GANWSTGVAP TSTDVSTIAS TPTTPVVAAT TANAGQITMT GATLGINAGA TLVLNTLTVG
SGSTVTGAGA IATNSLFLDS GSVAPDVALG ILATMGGGTI SGAISGAGQM FVSGGFNYLT
GANTYSGGTE INTNAFLTVG DSGTLGSITG NVLNNGTLTF TRSDTTSYGG VISGAGVVIK
NGIGTLVLSG ANTYSGATTI LNGTLQTAVD YVLSSASALT IAAGATLDLT NTQQTVASLA
GAGNVNLGAG HTFSFGGDNA STSFTGNFIG TLSAIKEGTG TFSFSGTGPS AGAIQINAGT
LALTGAADFS GAGVTLQANT TLDISGANAG ISIKSLQGQN GTAVHLGSRT LTVQRTDFVF
GSNFSGVVDG SGGLTVGQNS SLRLFAANTY SGPTTLNSAS ATLDLRHVGA IATSSEVNLA
ASGATLDISN AGGDATIQAL RGVAGSIVTL RNSNLVITNA ASEFAGIIKP AFGTPTGGVI
LNGGHQTLSG INTYLGATTV NAGTLSVNGS IATSSLTTVN AGGTLGGTGT VGTTLIDGGA
LAPGNSIGTL NVNGNLVFTA ASSYMVEVSP TAADRTDVTG TATLGGATVN AIFASGSYVE
KQYTILNAAG GIVGTFGSVV NSNLPSGFKS SLGYDANNAY LNLVLDFTPT PTPTPTPSPL
PINSGLNGNQ TAVANALSGY FARTGSIPIV FGALGPAGLS AAAGETPTGV QQTTFNAMNM
FMGVLTDPFS NGRGRGAQTP VAMSYAGGGS VRDAHAMITK AVVKPPFESR WVSWAAGFGG
SQTTDGNATQ GSATSTSRLY GIAAGADYWL SPQTVAGFAM AGGATSFGLS GGLGSGRSDL
VQVGGFVRHS VGAGYLTAAA AYGWQEITTE RTVGIGGVNQ YRATFNANAY SARVEAGHRW
IAPALGGVGL TPYAAAQITA FDLPAYAEQT VSGTGVFALD YASKTATATR SELGLRGDRS
FALDGALLTL RGRAAWAHDF DTERSIAATF QALPGASFVV NGARPARDAA LTTVSAEVNW
LNGFSVAASF EGEFSDVTRS YAGKGVVRYA W