Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0905 |
Symbol | |
ID | 4021379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1018962 |
End bp | 1020950 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637961095 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_568044 |
Protein GI | 91975385 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.486466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCATC GCGATCTGTC AGTTGGCCCG TCAGGCGGCG TGCTTGCAAC CCGGCGCTCG TTTCTTCAAT CCTCGACGGC GTTCATCGGC GCCGTGTCCC TCGGCGTGCC GTTCGGCGGG CCCACATGGG CAAGCGAGCC GCGGCGTTAC CCGATCGAGA CGCCGGCGGT GACCACGAAA GAGCGGATGA TCGCCTTTCC GGCGACGCGG ACGCCCGGCC TGAAGAAGAC GGAGCTGAAC CAGGTCGCGC GCTACAAGGA GCTCGGCTAT GGCGAATGGT CGTTCGGCTC CGGCCTGCCG GTGGTCCAGC GCACCGACCT GATGCCCGCC GGCTATCAGA AGCCGGCGAA CGCGGCCAGG ACTCGGCTGC TCAAATTCTT CTCGTTCTCC GATGTCCACA TCACCGACAA GGAAGCCCCG AACCAGCTGA TCGCGTTCCA GCAGACCGAG CCCGCCGCCG CGAGCAACAC CTCGATCTAT TCGCCGGTGA TGCTGTATTC GACGCAGGTG CTGGATGCGG CCGTGCAGAC GGTCAACGAT CTGCACCAAC GCGACCCGTT CGATTTCGGC ATCGCGCTGG GCGACGCCTG CAACAGCACC TCGTACAACG AGCTGCGCTG GTACATCGAC GTGCTCGACG GCAAGCCGAT CACGCCGAGC TCCGGCAAGC ATCGCGGCAA GGACAGCATC GACTTTCAGA TGCCATTCCA GGCCGCAGGC CTCGCCAAGG ATCTTCCCTG GTATCAGGTT CTCGGCAACC ACGATCATTT CATGATCGGA TCGTTTCCGG TCGACGCCGA TCCGTCGATG GGCCTGCGGC AGACCTACAC CGCCGACAAG GTCTGGGCGG TCGGCGATCT GCTGAGACCG AACCTCGCCG GCTTCCCGGC GCTGTTCGAC TATCGCAAGC TGAAGGCCGA GCCGGCGTTC TATCCGGGCG TCATCGATGG CGCGAGCCCT TATGGGGCGA TCATTCATGC CGGCCGCGCC GACGACCCGG CGTTCAACGG CCAGCCGCCG CAGATCGACG CAGATCCCTT GCGCCGTCCT TTGCTCCGCG CCGAATGGCT CGCCGAATTC CGCGACACCA CGACCAGCCC GAAGGGTCAC GGCTTCGATC TGGTCGACCA CGCCAGCGCC GTCGACGGCT TTGCCTGCTA CAGTTTCCTG CCGAAAGCCG GCCTGCCGCT GAAGGTGATC GTGCTCGACG TCACCCAGTC CGAACAGGAC GGCTCACGCG ACATTCACGG CCACGGCTTT CTCGACGCGC GGCGCTGGGA ATGGCTCAAG GCCGAACTGG CGCGCGGACA GGCCGACAAT CAGCTGATGA TCATCGCCAA CCACATTCCG ATCGGCGTCT CGCCGATCGG CTCCGAGATG GAATGGTGGA TGGGCGACGC CAACGCAGCG CCCGGCTTCG CCAACGCCGT CGACCTCGCC GGCCTGGTGC AGACGCTGCA GGCGACGCCG AATCTGTTGA TGTGGATCGC CGGGCATCGC CACATGAACG TCGTCAAGGC GTTTCCCTCC GCCGATCCGA ACCGGCCGGA GCAAGGGTTC TGGCAGGTCG AGACCTGTTC GCTGCGCGAC TTCCCGCAGC AGTTCCGAAC CTTCGAGATC ACGCTCAACG CCGACTACAC GGTGTCGATC GAGGCCGTGA ACGTCGACGT CGCTGCTGCC GAGGGAACGC CGGCGGCGCA GTCGCGCAAA TACGCCATCG CGACCCAGCA GATCATTCAG AACGACATCA CCTTCAACAG CCCGAACTAC GCGACGGCCG GCGGCCGCGG CACGCTGCAG GTCCCGAGCA TGGACCCGAC GAGGCCGCAA AGCGACGATC CCAAGGCGAC CGACCCCTCG ATCCAGTTCG TCGACCTGAG CACGGCCGCA AAGCCGGTCC CGTATCACGC GTCGAGCAAT GTCGAGCTGC TGAAGCAGCT CAGCCCGGAA ATGGTCAGCG TGCTGAAGCG GGCGGTGCCG CTGCGGTAG
|
Protein sequence | MNHRDLSVGP SGGVLATRRS FLQSSTAFIG AVSLGVPFGG PTWASEPRRY PIETPAVTTK ERMIAFPATR TPGLKKTELN QVARYKELGY GEWSFGSGLP VVQRTDLMPA GYQKPANAAR TRLLKFFSFS DVHITDKEAP NQLIAFQQTE PAAASNTSIY SPVMLYSTQV LDAAVQTVND LHQRDPFDFG IALGDACNST SYNELRWYID VLDGKPITPS SGKHRGKDSI DFQMPFQAAG LAKDLPWYQV LGNHDHFMIG SFPVDADPSM GLRQTYTADK VWAVGDLLRP NLAGFPALFD YRKLKAEPAF YPGVIDGASP YGAIIHAGRA DDPAFNGQPP QIDADPLRRP LLRAEWLAEF RDTTTSPKGH GFDLVDHASA VDGFACYSFL PKAGLPLKVI VLDVTQSEQD GSRDIHGHGF LDARRWEWLK AELARGQADN QLMIIANHIP IGVSPIGSEM EWWMGDANAA PGFANAVDLA GLVQTLQATP NLLMWIAGHR HMNVVKAFPS ADPNRPEQGF WQVETCSLRD FPQQFRTFEI TLNADYTVSI EAVNVDVAAA EGTPAAQSRK YAIATQQIIQ NDITFNSPNY ATAGGRGTLQ VPSMDPTRPQ SDDPKATDPS IQFVDLSTAA KPVPYHASSN VELLKQLSPE MVSVLKRAVP LR
|
| |