Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4066 |
Symbol | |
ID | 6411750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4365448 |
End bp | 4368693 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642713948 |
Product | outer membrane autotransporter barrel domain protein |
Protein accession | YP_001993037 |
Protein GI | 192292432 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.392825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGGTTA GGGGCATTGG TGAGCTGAGG TCGAGGGCAA CGGATCGGGC GGGGAGAGCG CGAAGACGGA TCTTGCTTTC ATCGCTGTTG GCTTCGACGG CGCTGGTCGC AATCACGATG CCCGCTTCTG CGCAGCAGGT CTGGGTCGGC ACCGGGCAGG ATTACAATAC GGCATCCAAT TGGAGCGGTC CCGCCGCGGT GCCCGACACC GGTTCGACTG CAGTGTTCAC CAACAACGGT GCGTCGACAT CGGTGGTGCT CTCGGTCACG CGCAGCCCGG ATGGCTTCAC GTTTGATACA GGCGCGCCGA GCTACACGAT CGGAGTCGCA TCCGGCGGGC AGCTCAACAT GAGCGGCGCC GGCATCGTCA ACAACTCCAG CAATGCCCAG AACTTCCTCA TCGGTCCCGG CAGCCAGATT GATTTTCTGG GCTCCAGCAC CGCCGGCAAT GCGACCATCA CAACTCTGAG CGGGGCCACG TTGCTGTTCA GCCGATCCTC GTCGGGAGGA ACGGCTTCAA TTGCCAACGA TGGGACGATG GTCCTGCGGA CAGATTCGGG CGCGATCTCG ATCGGGTCGC TGTCGGGCAC CGGGACCGTC GCGGCGACGA CGGTTAGCGG GGTTCCGGTC CAGACTCTGA CCGTCGGCAG CCTGAACACC TCCACGGAAT TCTCAGGCAC GTTCGTCGAC AACGGAGCAC AATTCGCGCT CGGCAAGACC GGCACCGGCA CGCTGACGCT CACCGGCGAC AATTTCTACA CCGGCGGCAC CACGATCTCG AGCGGAACGC TGCAGCTCGG CAATGGCGGG ATCAGCGGCT CGATCACCGG CGACATCACC AACAATGCGA CGCTGACGGT CAACCGCAGC AACGGCACCA GCCTCGGCGG CGTGATCTCC GGCAGCGGCC AATTAGTCAA GCTTGGCGGC GGCATTCTAG CGCTGCTCGG CAACAACACT TACACCGGCG GCACGACGAT CTCGGCGGGC ACGCTGCGGG TCGGCAACGG CGCCACCAGC GGTTCCATCG TGGGCGACGT CGTCAACAAC GGCGTACTGC AGTTCAATCG GTTCGACTCG ATCGGCTTCA ATGGCGTGAT CACCGGCACC GGCAGCGTCA CCAAGCTCGG CAATAACGCG ATGATCCTGG GCGGGGACAA CACCTATACG GGCGGCACGA CAATCAGCGG CGGTTACTTG CAGGTCGGCA ATGGGGGCAC CGGCGGCTCG ATCGTCGGCG ACGTCCTCAA CAACGGGACG CTGGAATTCG CGCGTTCCGA CGCCCATACG TTCAGTGGCG CTATTTCCGG CACCGGCAAT CTGATCAGCT TCGGCGGCAG CGCCGGCAGT GGCGTTTTCA CGATGACCGG AACCAATACT TACACCGGGG GCACCACCGT CTCCAGAGGC ACATTGCAGA TCGGCGACGG CGGCACCTCG GGCTCGATCG TCGGCGACGT CACCAACAAC GCCACGCTCG CCTTCAATCG CTCCGACGCG ACCAGCTTCG GCGGCGCGAT CTCAGGCGGC GGCAATCTGA TCAAGCGCGG CGCCGGCAAC CTGTCGCTGA CCGGCGTCAG CAGCTACACC GGCGCCACCA CGGTTGAAGC CGGCACGCTC AGCGTCAACG GCTCGATCGC GTCCTCGTCG CTGACGACGG TGAACGCCGG CGCAGCGCTC GGCGGCAACG GCACGGTTGG CACCACGCTG ATCAACGGCG GCGCGCTGGC ACCCGGCAAT TCGATCGGCA CGCTGAATGT GAGCGGCAAC CTGACCCTCA CGGCTGCGTC GAGCTACATG CTCGAGCTGT CGCCGAGCAG CGCCGACCGC GTCAACGTCA GCGGCACCGC CACGCTCGGC GGCGCCACGG TGAAAGCGTC GTTCGCCAGC GGCGGCTATG TCGAGCGGCA ATACACTCTC GTCAATGCGA CCGGCGGCGT GGTCGGCACC TTCGGCACGC TGGTGAATAC CAATCTGCCG TCCGGCTTCA GATCGAACCT CGGCTACGAT TCCAACAATG CCTATCTCAA TCTGGTGCTC GACTACACGC CCGGTCCGTC GCCGGACATC AACAGCGGCC TGAACCGCAA CCAGACGGAG GTCGCCAATG CGCTGAGCGG TTACTTCGCG CGCACCGGCA GCATTCCGAT CGTGTTCGGC GCGCTGAACC CGAGTGGGCT CAGCGCCGTG TCGGGCGAGA CCGCGACCGG CGCGCAGCAG TCGACGTTCA GTGCCATGAC CCAGTTCCTG GGCGTGCTGA CCGATCCGTC GAGCAACGGC CGCGGTGCGC GGGATGCTGC GCCGGGGCCG TTGGGGTTCG CGGATCGCAC GCCCCGCGGC TCGGCGTCCG ACGCCTATGC GATGATCACC AAGAGCGCTG CCGAGCGGTT CGTTCCGCAT TGGAATGTGT GGGGCGCGGG CTTCGGCGGC TCACAGACCA CCGATGGCAA TGCTTCGCTC GGCTCCGCCA CCGCAACCAG CCGGCTCGCC GGCATTGCTG CAGGTGCCGA CTACTGGCTG TCGCCGCAGA CTGTCGCGGG TTTCGCGATG GCCGGCGGCG CCACGCAATT CGGACTAGCG GGCGGCCTCG GCTCGGGCAC GTCGGATCTG CTCCAGGTTG GCGGCTTCAT CCGCCACAGT TTTGGTGCGA GCTACCTGAC CGCAGCGGCG GCCTATGGCT GGCAGGACAT CACCACCGAA CGCACCGTCG CGATCGGCGG CCTCAATCAG CTCCGCGCCA ACTTCAACGC CAACGCTTAC TCCGCGCGGG TCGAGGCCGG GCATCGCTGG ATCGCCCCGG CGATCGGCGG TGTTGGTCTG TCACCGTACG CTGCCGCGCA AGTGACGGCC TTTGATCTGC CGGCCTATGC CGAGCAGGCT GTGGGCGGAA CCGGCGTGTT CGCGCTCGGC TATGCGGCCA AGACCGTGAC CGCGACGCGC AGCGAGCTCG GCGTGCGGAC CGACAAGTCG TTCGCGCTGG ATGGCGCGCT GCTGACGCTG CGCGGCCGCG CCGCCTGGGC GCACGACTTC GATGTCGACC GGTCGGTGGC GGCGACCTTC CAGGCGCTGC CCGGCGCCAG CTTCGTTGTG AACGGCGCGC GACCGGCGCG CGATGCGGCG CTGACCACGG TGTCGGCGGA AGTGAGCTGG CTGAACGGCT TCTCGGTCGC CGCCAGCTTC GAAGGCGAGT TCTCCGACGT GACCCGCAGC TATGCCGGCA AGGGACTGCT GCGCTACGCG TGGTGA
|
Protein sequence | MQVRGIGELR SRATDRAGRA RRRILLSSLL ASTALVAITM PASAQQVWVG TGQDYNTASN WSGPAAVPDT GSTAVFTNNG ASTSVVLSVT RSPDGFTFDT GAPSYTIGVA SGGQLNMSGA GIVNNSSNAQ NFLIGPGSQI DFLGSSTAGN ATITTLSGAT LLFSRSSSGG TASIANDGTM VLRTDSGAIS IGSLSGTGTV AATTVSGVPV QTLTVGSLNT STEFSGTFVD NGAQFALGKT GTGTLTLTGD NFYTGGTTIS SGTLQLGNGG ISGSITGDIT NNATLTVNRS NGTSLGGVIS GSGQLVKLGG GILALLGNNT YTGGTTISAG TLRVGNGATS GSIVGDVVNN GVLQFNRFDS IGFNGVITGT GSVTKLGNNA MILGGDNTYT GGTTISGGYL QVGNGGTGGS IVGDVLNNGT LEFARSDAHT FSGAISGTGN LISFGGSAGS GVFTMTGTNT YTGGTTVSRG TLQIGDGGTS GSIVGDVTNN ATLAFNRSDA TSFGGAISGG GNLIKRGAGN LSLTGVSSYT GATTVEAGTL SVNGSIASSS LTTVNAGAAL GGNGTVGTTL INGGALAPGN SIGTLNVSGN LTLTAASSYM LELSPSSADR VNVSGTATLG GATVKASFAS GGYVERQYTL VNATGGVVGT FGTLVNTNLP SGFRSNLGYD SNNAYLNLVL DYTPGPSPDI NSGLNRNQTE VANALSGYFA RTGSIPIVFG ALNPSGLSAV SGETATGAQQ STFSAMTQFL GVLTDPSSNG RGARDAAPGP LGFADRTPRG SASDAYAMIT KSAAERFVPH WNVWGAGFGG SQTTDGNASL GSATATSRLA GIAAGADYWL SPQTVAGFAM AGGATQFGLA GGLGSGTSDL LQVGGFIRHS FGASYLTAAA AYGWQDITTE RTVAIGGLNQ LRANFNANAY SARVEAGHRW IAPAIGGVGL SPYAAAQVTA FDLPAYAEQA VGGTGVFALG YAAKTVTATR SELGVRTDKS FALDGALLTL RGRAAWAHDF DVDRSVAATF QALPGASFVV NGARPARDAA LTTVSAEVSW LNGFSVAASF EGEFSDVTRS YAGKGLLRYA W
|
| |