Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0882 |
Symbol | |
ID | 6408536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 938359 |
End bp | 941691 |
Gene Length | 3333 bp |
Protein Length | 1110 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 642710796 |
Product | Sel1 domain protein repeat-containing protein |
Protein accession | YP_001989915 |
Protein GI | 192289310 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.574847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGCG TGTCGTGGAG TGTCGAGGAG ATCGAGCCGT CAGTGCGCGA GAAGGCCGAG GCCGCCGCAA AGCGCGCCGG GCTGTCGCTC ACCGACTGGA TCAATGCTCA GCTCGGCGAG GCTGCCCCGC AGCAGCCGGC CACCGAACAA CTGCGCATGC CCGGCCGTCC GGCGATGCCC GAGCGCAGCG CCACCGAAGT CGCTGAAATC CATCAGCGGC TCGACGCCAT CGCCCGCCAG ATCGATCATA TTTCCCGCGC CCCGGCGCGC AGCGAACCGC CGGTCGCCCG GCAGCTGAAT GACGCGATTT CCCGGCTCGA CGCCCGCTTG GCGCGGATCA CCGAGCCGAA GCCCGCGGCG GCTCGGCCTG CTGAGGCCGC AGCGGTGCCG CAGACGCCCA CCGACCGGGT CGAACGCGCC GCTGCGCAGG TTTACCCGTC CCCGACGCTC GACCCGAACG CGCTCGACAA GGCGATTGCC GAGATCGCCG CGCGGCAGAG CGAACTCGAT GCCAGCATCG GCCGGATGCC GCGCCAGCCG GCGTCGTTTG CTCCGCCGAT CGCGCACGCG ATGGCTCCGC CGCCCCCGCA GGCCGGACCG GACTTCACCA GCCTGGAGAA GCAGCTCCAC AAGATCACCA GCCAGATCGA CGCGCTGCAG CGCTCCGACA AGGTCGAGCA CTCGATCGCG GCGTTCCGCG CCGACCTGGC CGAGATCCGC CAGACCATCA CCGAAGCGAT GCCGCGCAAG GCGATCGAAA CGCTGGAAGG CGAGATCCGC TCGCTGGCGC AGCGTCTCGA CGAAAGCCGC GCCAATGGCA GCAACAGCGA AGTCATCGTC GGCATCGAAC GCGCACTTGG CGAGATCCAC GCCGCGTTGC GCTCGCTGAC GCCGGCCGAG CAGCTCGCCG GCTTCGACGA AGCGATCCGC AATCTCGGCG GCAAGATCGA CATGATCGTT CGCAACAGCG ACGATCCCGG CACGGTGCAG CAGCTCGAAA ACGCCATCGG CGCGCTGCGC GGCATCGTCT CCAATGTCGC CTCGAATGAA GCGCTGGGGC AGCTCAGCGC CAACGTCCAC GCGCTCGGCG AGAAGATCGA GCAGCTGGCG CAGGCCGACA ACCATAGCAT TTCGTTCGCC GCCCTGGAGC AGCGCATCTC GGCGCTGACC GCGGCGTTGG AAAGCCGCGA GCGCCCTGCG CCGAGCGAAT CCACCGAGCA GCTCGAAAGC GCCGTGCGGA CACTGTCCGA GCGGATCGAC CATCTGCCGA TCGGCAACGA TAATCAGTCC GCCTTCGCGC ATCTCGAACA GCGCGTCGCG CATCTGCTGG AGCGGATGGA AGCCGCAACC GAGCAGCGCG GCGGCAGTGC CAATCTCGGC CGCGTCGAGG AAGGCCTGCA CGACATCCTG CGGATGCTCG AGCGGCAGCA GTCGCAATTT GACGTGCTGG CTGACATCGA CCGTCGGCCG GCACCGGCGC TGGATCCGAG CTTCGTCGAC ACCATCAAGC GCGAACTTTC CGACATGCGC TTCAGCCAGT CGGAAACCGA TCGTCACACC CAGGATTCGC TCGAGGCGGT GCACAACACC CTCGGCCACG TCGTCGACCG GCTGGCGATG ATCGAAGGCG ATCTGCGCAC CGCGCGCGCA ACGCCTCCGC TGGCCCCGCC GCCGGCCCCC GCTCCGGCGC CCGAACAGGC GAAGCCCGCC TTCCTGGCCG CGGCGCCGAT CGCGCCGGCA CCGGCTGCTG CCGCGGCCGC GCCTGTCCCG CCGCAGCCTC AGCCGGAGAT GGCCAATCCG GCCGCCGAGC CGTTCGCAGC CGCGCCGCGC GAATTTGCTG CCGCCAAACC CGCCGTCGAG CCGCCGGCGC CGGCGCCCGA ACCGCGTGGG CCGCGGGCTT TCCACGACAT CCACGAGTCC GCCGCGGGCC CGCGTCAGCC GCAGAAGATC GAGCCGGTCG TTGCTGTGCC GCAGCCGGCG CGCCGGGAAG CCACGCTGCC GCCGGACCAT CCGCTCGAGC CCGGCACCAA GCCGCCGGCC CGGGTCGCTT CGCCGTCGGA ACGGATCGCC GCGTCCGAAA ACGCGCTCGG CGAAATCGCT CCGGCGCAGC CGGAGCCGGC CAACGCCACC AGCTTCATCG CGGCCGCGCG CCGCGCCGCA CAGGCGGCGG CGTCCGCCAG CGCCGGCAAG GCCAAGCCCG GCAAGCCGAA GACCGATGGC GACAAGCCCG ATCCGGACGG CGGCACCCCG GGCTCGCCGC TCGGCTCCAA GATCAAGTCG CTGCTGGTCG GCGCCAGCGT GGTGGTGATC GTGCTGTCGA GCTTCCAGAT GGCGATGAAA CTGTTCGACA GCGGCGAAGC GCCGCCGGTC GCCAGCGTCA CCGCGCCGAA GCTGAGCCCG GCGCCGGACA AACAGCGCCT ACCTGCGGAC GAGCGGCAAT CTGATCCGAC CGAGCCCACC GCGCCGCCGT CTGTGGCGCC GGTGGCTCCG CCGCCGTCGA TGATTTCGCC GACTCCGGTC GAGCGTCAGT CGCTGTACAC CCCGCCAGCC CCGCCGCAGA CCGAGCCGGC CGCGCCCAGC GACATCACCG GCACGATCCC GTCGCAGCCG AGCGCGGCGC CGGAGAAGTT CGGCACCGTC GCGATTCCGT CCGCCGAACG GCTGCCCGAC ACCATCGGCG GCGCGACGCT ACGTACCCTC GCGCTCAAGG GCGATGCCGC CGCCGCTTAC GAAGTCGCCA CCCGCTACGT CGAAGGCAAG GGCGTGCCGG TGAACTACGA CGAAGCCGCC AAATGGTATC AGCGCGCAGC GGATGCCGGC GTGACACCGG CGATCTTCCG GATCGGCACG CTGTACGAGA AGGGTCTCGG CGTGAAGCGC GACCTCGACG TCGCGCGGAC GCTGTATTCG ACCGCGGCCG ATCGCGGCAA CGCCAAGGCG ATGCACAATC TGGCCGTGCT GTACGCCGAC GGCGGCAGCA AGGGCGCGAA CTACAAGACC GCCGCGGCCT GGTTCCGCAA GGCCGCCGAG CGCGGCGTTG CCGACAGCCA GTTCAACCTC GGCATCCTGT ATGCCCGCGG CATCGGCGTC GATCAGAACC TCGCCGAGTC GTACAAGTGG TTCTCTCTCG CCGCCGCTCA GGGCGATGAG GATGCGGGCC GCAAGCGCGA CGACGTCGCC AAGCGCCTCG ACCCGCAGTC GCTGGCAGCC GCCAAGCTCG CGATCCAGAC CTTCACGGCC GAGCCGCAGC CCGACGCCGC CGTCAAGGTC GCGGCGCCCG CCGGAGGCTG GGACGGGCAG CCGGCCGCTG CGGCCAAGCG CGCCGCCCGC TAA
|
Protein sequence | MNRVSWSVEE IEPSVREKAE AAAKRAGLSL TDWINAQLGE AAPQQPATEQ LRMPGRPAMP ERSATEVAEI HQRLDAIARQ IDHISRAPAR SEPPVARQLN DAISRLDARL ARITEPKPAA ARPAEAAAVP QTPTDRVERA AAQVYPSPTL DPNALDKAIA EIAARQSELD ASIGRMPRQP ASFAPPIAHA MAPPPPQAGP DFTSLEKQLH KITSQIDALQ RSDKVEHSIA AFRADLAEIR QTITEAMPRK AIETLEGEIR SLAQRLDESR ANGSNSEVIV GIERALGEIH AALRSLTPAE QLAGFDEAIR NLGGKIDMIV RNSDDPGTVQ QLENAIGALR GIVSNVASNE ALGQLSANVH ALGEKIEQLA QADNHSISFA ALEQRISALT AALESRERPA PSESTEQLES AVRTLSERID HLPIGNDNQS AFAHLEQRVA HLLERMEAAT EQRGGSANLG RVEEGLHDIL RMLERQQSQF DVLADIDRRP APALDPSFVD TIKRELSDMR FSQSETDRHT QDSLEAVHNT LGHVVDRLAM IEGDLRTARA TPPLAPPPAP APAPEQAKPA FLAAAPIAPA PAAAAAAPVP PQPQPEMANP AAEPFAAAPR EFAAAKPAVE PPAPAPEPRG PRAFHDIHES AAGPRQPQKI EPVVAVPQPA RREATLPPDH PLEPGTKPPA RVASPSERIA ASENALGEIA PAQPEPANAT SFIAAARRAA QAAASASAGK AKPGKPKTDG DKPDPDGGTP GSPLGSKIKS LLVGASVVVI VLSSFQMAMK LFDSGEAPPV ASVTAPKLSP APDKQRLPAD ERQSDPTEPT APPSVAPVAP PPSMISPTPV ERQSLYTPPA PPQTEPAAPS DITGTIPSQP SAAPEKFGTV AIPSAERLPD TIGGATLRTL ALKGDAAAAY EVATRYVEGK GVPVNYDEAA KWYQRAADAG VTPAIFRIGT LYEKGLGVKR DLDVARTLYS TAADRGNAKA MHNLAVLYAD GGSKGANYKT AAAWFRKAAE RGVADSQFNL GILYARGIGV DQNLAESYKW FSLAAAQGDE DAGRKRDDVA KRLDPQSLAA AKLAIQTFTA EPQPDAAVKV AAPAGGWDGQ PAAAAKRAAR
|
| |