Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1975 |
Symbol | |
ID | 3909480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2241246 |
End bp | 2244581 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883869 |
Product | Outer membrane autotransporter |
Protein accession | YP_485594 |
Protein GI | 86749098 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTGC CGACGGTGAT GGGTGGCAGG ATTGCGAACG GCAGGGTGCG GATCGCGTCC GTCGCCGCTG CACGGATCAA GCAATTGCTG CTGTCGACCG CGCTGGTCGG CATCGTGGTC TATCCGCAGG CGCTGCAGGC GCAATCCTGG AACGGCGCCG TCGACAGTGA CTTCTCCAAC GGCGCCAATT GGAGCACCGG CGTCGCACCG ACCTCGACCG ACGTCTCGAC GATCGCATCC ACGCCGACCA CCCCGGTGGT GGCCGCGACC ACGGCGAATG CCGGGCAGAT CACGATGACC GGCGCCACCC TCGGCATCAA TGCCGGCGCG ACGCTGGTCT TGAACACCCT GACTGTCGGG AGCGGGTCCA CCGTGACCGG GGCGGGCGCC ATCGCGACGA ATTCGTTGTT TCTCGACTCC GGCTCGGTCG CGCCCGATGT CGCGCTCGGG ATCCTGGCGA CCATGGGGGG CGGCACCATC AGCGGCGCGA TCAGCGGCGC CGGCCAGATG TTCGTCTCTG GCGGCTTCAA CTACCTCACC GGCGCCAACA CCTACAGCGG CGGCACCGAG ATCAACACCA ACGCGTTCCT GACGGTCGGC GACAGTGGCA CCCTCGGCTC GATCACCGGC AACGTCCTGA ACAACGGCAC GCTGACCTTC ACCCGCTCCG ACACGACGAG CTATGGCGGC GTGATTTCCG GCGCCGGCGT TGTCATCAAG AACGGCATCG GCACGCTGGT CCTGTCCGGC GCCAACACCT ACAGCGGCGC GACGACCATC CTCAATGGCA CGCTGCAGAC CGCCGTCGAT TACGTGCTCA GCAGCGCCTC GGCGCTGACG ATTGCCGCCG GTGCGACGCT CGATCTCACC AATACACAAC AGACCGTGGC GTCGCTGGCG GGCGCCGGCA ATGTGAATCT TGGAGCTGGC CACACCTTCA GTTTCGGTGG CGACAATGCG TCCACCAGCT TCACCGGAAA TTTCATTGGC ACGCTAAGCG CCATCAAGGA AGGCACCGGC ACCTTCAGCT TCAGCGGTAC CGGCCCGAGC GCTGGCGCCA TCCAGATCAA TGCGGGTACG CTGGCATTGA CCGGCGCGGC CGACTTCAGC GGTGCAGGCG TCACCTTGCA GGCCAATACG ACGCTGGACA TTTCCGGGGC CAACGCCGGC ATCTCGATCA AGAGCTTGCA GGGCCAGAAC GGCACGGCGG TCCATCTCGG CAGCCGCACG CTGACGGTCC AGCGCACCGA CTTTGTTTTC GGCTCCAACT TCAGCGGTGT CGTCGACGGC AGCGGTGGCC TGACCGTCGG GCAAAATTCG TCGCTCAGGT TGTTCGCCGC CAACACCTAT AGCGGCCCGA CCACGCTCAA CTCGGCGTCG GCTACGCTGG ACCTCCGACA TGTCGGCGCC ATCGCCACCT CCAGCGAGGT CAATCTTGCC GCCAGCGGCG CTACGCTGGA TATCTCCAAC GCCGGTGGAG ACGCCACGAT CCAGGCGTTG CGCGGCGTCG CTGGCAGCAT TGTCACCCTG CGCAACAGCA ATCTGGTGAT CACCAACGCG GCGTCGGAAT TCGCCGGAAT CATCAAGCCA GCGTTTGGAA CGCCCACCGG CGGCGTCATC CTCAATGGCG GCCACCAGAC CCTGTCCGGG ATCAACACCT ATCTGGGCGC CACCACGGTC AATGCCGGGA CGCTCTCGGT CAACGGCTCG ATCGCAACCT CGTCGCTGAC CACCGTGAAT GCCGGCGGCA CGCTCGGCGG CACCGGCACG GTGGGTACCA CGCTGATCGA CGGCGGCGCG CTGGCGCCGG GCAATTCGAT CGGCACGCTG AATGTGAACG GCAACCTCGT CTTCACCGCG GCCTCGAGCT ACATGGTCGA GGTGTCGCCG ACCGCCGCCG ACCGCACCGA CGTCACCGGC ACCGCGACGC TCGGCGGCGC GACGGTCAAC GCGATCTTCG CCAGCGGCAG CTATGTCGAG AAGCAGTACA CCATCCTCAA TGCCGCCGGC GGCATCGTCG GCACCTTCGG TTCGGTGGTG AACAGCAATC TGCCGTCGGG CTTCAAGTCG AGTCTGGGCT ACGATGCTAA CAACGCCTAT CTCAACCTGG TGCTGGACTT TACGCCGACA CCGACTCCAA CACCCACGCC GTCGCCGCTG CCGATCAACA GCGGCCTCAA CGGCAACCAG ACCGCGGTCG CCAACGCGCT GAGCGGGTAC TTCGCGCGGA CCGGCAGCAT TCCGATCGTG TTCGGCGCGC TCGGGCCGGC CGGGCTGAGT GCTGCGGCGG GCGAGACGCC GACCGGCGTT CAGCAGACCA CGTTCAACGC CATGAACATG TTCATGGGCG TGCTGACCGA TCCGTTCAGC AATGGCCGCG GGCGCGGGGC TCAGACGCCG GTCGCGATGT CCTACGCGGG CGGCGGCTCC GTGCGCGATG CCCACGCGAT GATCACCAAA GCGGTGGTGA AGCCGCCGTT CGAGTCGCGT TGGGTCAGTT GGGCGGCGGG CTTCGGCGGT TCGCAGACCA CCGACGGCAA CGCGACGCAG GGCAGCGCCA CGTCGACCAG CCGGCTCTAC GGCATCGCGG CCGGCGCGGA CTACTGGCTG TCGCCGCAGA CCGTCGCCGG CTTCGCGATG GCCGGCGGAG CGACCAGTTT CGGTCTGTCC GGCGGTCTGG GATCGGGACG CTCCGATCTG GTGCAGGTCG GCGGCTTCGT CCGCCATAGC GTCGGCGCCG GCTATCTCAC TGCCGCGGCC GCTTATGGCT GGCAGGAGAT CACCACCGAG CGGACCGTCG GCATCGGCGG GGTCAATCAG TATCGCGCGA CCTTCAATGC CAATGCCTAT TCGGCCCGTG TCGAAGCCGG TCACCGCTGG ATCGCCCCGG CGCTCGGCGG CGTCGGGCTC ACGCCCTATG CGGCGGCGCA GATCACTGCG TTCGATCTAC CGGCTTATGC CGAGCAGACG GTGAGCGGCA CCGGCGTGTT TGCGCTCGAC TACGCGTCGA AGACCGCGAC GGCGACGCGC AGTGAACTCG GCCTGCGCGG CGATCGGTCG TTCGCGCTCG ATGGCGCGCT GCTGACGCTG CGCGGCCGGG CGGCCTGGGC GCACGACTTC GACACCGAGC GCTCGATCGC GGCGACGTTC CAGGCGCTGC CCGGGGCGAG TTTCGTCGTG AACGGCGCGC GGCCGGCGCG GGATGCGGCG CTGACCACGG TATCGGCCGA GGTGAACTGG CTGAACGGAT TCTCGGTCGC GGCCAGCTTC GAGGGCGAAT TCTCCGACGT CACCCGCAGC TACGCCGGCA AGGGCGTGGT GCGCTACGCC TGGTGA
|
Protein sequence | MDVPTVMGGR IANGRVRIAS VAAARIKQLL LSTALVGIVV YPQALQAQSW NGAVDSDFSN GANWSTGVAP TSTDVSTIAS TPTTPVVAAT TANAGQITMT GATLGINAGA TLVLNTLTVG SGSTVTGAGA IATNSLFLDS GSVAPDVALG ILATMGGGTI SGAISGAGQM FVSGGFNYLT GANTYSGGTE INTNAFLTVG DSGTLGSITG NVLNNGTLTF TRSDTTSYGG VISGAGVVIK NGIGTLVLSG ANTYSGATTI LNGTLQTAVD YVLSSASALT IAAGATLDLT NTQQTVASLA GAGNVNLGAG HTFSFGGDNA STSFTGNFIG TLSAIKEGTG TFSFSGTGPS AGAIQINAGT LALTGAADFS GAGVTLQANT TLDISGANAG ISIKSLQGQN GTAVHLGSRT LTVQRTDFVF GSNFSGVVDG SGGLTVGQNS SLRLFAANTY SGPTTLNSAS ATLDLRHVGA IATSSEVNLA ASGATLDISN AGGDATIQAL RGVAGSIVTL RNSNLVITNA ASEFAGIIKP AFGTPTGGVI LNGGHQTLSG INTYLGATTV NAGTLSVNGS IATSSLTTVN AGGTLGGTGT VGTTLIDGGA LAPGNSIGTL NVNGNLVFTA ASSYMVEVSP TAADRTDVTG TATLGGATVN AIFASGSYVE KQYTILNAAG GIVGTFGSVV NSNLPSGFKS SLGYDANNAY LNLVLDFTPT PTPTPTPSPL PINSGLNGNQ TAVANALSGY FARTGSIPIV FGALGPAGLS AAAGETPTGV QQTTFNAMNM FMGVLTDPFS NGRGRGAQTP VAMSYAGGGS VRDAHAMITK AVVKPPFESR WVSWAAGFGG SQTTDGNATQ GSATSTSRLY GIAAGADYWL SPQTVAGFAM AGGATSFGLS GGLGSGRSDL VQVGGFVRHS VGAGYLTAAA AYGWQEITTE RTVGIGGVNQ YRATFNANAY SARVEAGHRW IAPALGGVGL TPYAAAQITA FDLPAYAEQT VSGTGVFALD YASKTATATR SELGLRGDRS FALDGALLTL RGRAAWAHDF DTERSIAATF QALPGASFVV NGARPARDAA LTTVSAEVNW LNGFSVAASF EGEFSDVTRS YAGKGVVRYA W
|
| |