Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0794 |
Symbol | |
ID | 3909608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 889904 |
End bp | 891883 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882686 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_484416 |
Protein GI | 86747920 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.410839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCATC ACCTTCCGTC ACCTGCGCCG TCGAGCGGGC TGCTGTCCAC TCGCCGGGCG TTTCTGCAAT CATCCGCGGC CTTCATCGGC GCGCTGTCCC TCGGATCGTC GCTCGGCGCA CCGGCATGGG GCCGCGACCT TCGACGCTAC CCGATCGCGA CGCCGGCCGA AACGACCGTG ACGCAGATGC TGGCCTTCCC GGCGACGATC GAGCCCGGCC TGGCGAAGAC CGCGCTGCAT CAGGTCGCGC GCTACAAGGA TTTCGGCTAC GGCGAATGGA CGCTCGGCTC CGGCCTGCCG ATCGTGACGC GCACCGACCT GATGCGAGCC GGCTACGAAA AGCCGGTGGG CGGCGAGAGC AAGCGACTGA TCCGGTTCTT CGCGTTCACC GACGTGCACA TCACCGACAA GGAAGCGCCC AACCAGCTGA TCGGATTCCA GCAGACCGAG CCCGCGGCGG TGAACAACAC CTCGATCTAT TCGCCGGTGA TGCCGTACAC GACGCAGGTG CTGGACGCAG CGGTGCAGAC GGTCAACGAC CTGCACAGCC GCGACCCGTT CGATTTCGGC ATCGCGCTCG GCGACGCCTG CAACAGCACG TCCTACAATG AAGTGCGCTG GTACATCGAC GTGCTCGACG GCCAGCCGAT CACGCCGAGC TCCGGCGACC ATCGCGGCCG CGACAGCGTC GACTTCCAGA TGCCGTTTCA GGCCGCCGGT CTCGCCGCGG ATCTGCCCTG GTATCAGGTC CTCGGCAACC ACGATCATTT CATGATCGGC TCGTTTCCGG TCGATGCCGA TCCGACGATC GGGCTACGGC AGTCCTACAC GGCCGACAGG ATCTGGGCGG TCGGCGACGT GCTGAAGCCC AATCGCGAAG GCTTCCCGGC GCTGTTCGAC TATCGCGGCC TCAAGGCCAC GCCGGCGTAC TATCCGGGAG TGATCGACGG CGCGAGCCCG TATGGCGCGA TCATTCATAC CGGCCGCGCC GACGATCCGG CCTTCGCCGG CAAGCCGCCG CAGATCGCCG CCGATCCCGG CCGCCGTCCG CTGGCGCGCG CCGAATGGCT CGCCGAATTC CGCAACACCA CGACGCGTCC GAAGGGGCAC GGCTTCGATC TGATCGACGG CGCCGGCGAC GGCTTCGCCT GCTACAGCTT CGTACCGAAA TCCAACCTGC CGCTGAAGGT GATCGTGCTC GACGTCACCC AGTCCGAACA GGACGGCTCG CGCGACATCC ACGGCCACGG CTTTCTCGAT GCCAGGCGCT GGGACTGGCT GAAGGCCGAG CTGGCGCGCG GCCAGGCCGA CGATCAGCTG ATGATCATCG CCAACCACAT TCCGATCGGG GTGTCGCCGA TCGGCTCCGA AATGGAATGG TGGCTGGGCG ACGCCAATGC GGCGCCTGAT TTTGCCAACG CCGTCGACCT CGCCGGCCTG GTGACGACGC TGCAGGCTGC GCCGAACCTG CTGATGTGGA TCGCCGGGCA TCGCCATTTG AATGTGGTGA AGGCGTTCCC CTCCGCCGAC CCGGACCGGC CGGAGCAGGG CTTCTGGCAG GTCGAGACCT GCTCGCTGCG CGACTTTCCG CAGCAATTCC GGACTTTCGA GATCAGGCTC AACGCGGACG ACACGGTGTC GATCGAGGCG ATGAATGTCG ATATCGCCGT CGCCGACGGC ACGCCGGCGG CGCAATCGCG CAAATACGCC ATCGCCACCC AGCAGATCAT CCAGAACGAC CTGCGGCCCA ACAGCCCGAA CTACGCGACC GCCGGCGGCA AGATTCCGGT GCCGAGCATG GACCCGACGC GGCCGCAGAG CGACGATCCC AAAGCCACCG ATCCGTCGAT CCGGTTCGTC GATCTGAGAA GCGCGGACAA GCCGGTCCAG TATCATGCGT CGAGCAATGT CGCACTGCTG AAGCAGCTCA GCCCGCGGAT GGTCGAGGTG CTGGAGCGGA GGGTGGCGAT GCGGAAGTAG
|
Protein sequence | MTHHLPSPAP SSGLLSTRRA FLQSSAAFIG ALSLGSSLGA PAWGRDLRRY PIATPAETTV TQMLAFPATI EPGLAKTALH QVARYKDFGY GEWTLGSGLP IVTRTDLMRA GYEKPVGGES KRLIRFFAFT DVHITDKEAP NQLIGFQQTE PAAVNNTSIY SPVMPYTTQV LDAAVQTVND LHSRDPFDFG IALGDACNST SYNEVRWYID VLDGQPITPS SGDHRGRDSV DFQMPFQAAG LAADLPWYQV LGNHDHFMIG SFPVDADPTI GLRQSYTADR IWAVGDVLKP NREGFPALFD YRGLKATPAY YPGVIDGASP YGAIIHTGRA DDPAFAGKPP QIAADPGRRP LARAEWLAEF RNTTTRPKGH GFDLIDGAGD GFACYSFVPK SNLPLKVIVL DVTQSEQDGS RDIHGHGFLD ARRWDWLKAE LARGQADDQL MIIANHIPIG VSPIGSEMEW WLGDANAAPD FANAVDLAGL VTTLQAAPNL LMWIAGHRHL NVVKAFPSAD PDRPEQGFWQ VETCSLRDFP QQFRTFEIRL NADDTVSIEA MNVDIAVADG TPAAQSRKYA IATQQIIQND LRPNSPNYAT AGGKIPVPSM DPTRPQSDDP KATDPSIRFV DLRSADKPVQ YHASSNVALL KQLSPRMVEV LERRVAMRK
|
| |