Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4785 |
Symbol | |
ID | 6412471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5149275 |
End bp | 5150630 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642714663 |
Product | conserved hypothetical protein; putative signal peptide |
Protein accession | YP_001993750 |
Protein GI | 192293145 |
COG category | [S] Function unknown |
COG ID | [COG4222] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.432202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAAAC TACTGCTCGG TTCGGTCGCG GTCTTTGCGC TGGCGACGTC CGCCGCGCTG GCGCAAACCG AAGGCGAATT TCCCGCGACC CTCGCTGGCC ATGCGGTGCT GCCGGCGACC TCGTTCGTCG ATGCGCCGGC CGATGCGCCG GCCGATCTGA AGACTTCGGG CAAATACACC ACCGGACAGC GCGTCGAGGC CCAGGGCAGC GTGATGGGCA AGTCCAACGG CCGTCCGACC GGCGTGTCGG TGCCGTTCAA GGGCCAGCCG CTGCAGGGCC ATTCCGGCAT CAAGGCGATG CCGGACGGCA CCTTCTGGGT GCTGACCGAC AACGGTTTCG GCTCGCGCTA CAACTCGGCC GACTCGATGC TGTATCTGGA CAACTACAAG ATCGACTGGG CGACCGGCGC GGTCGACCGC AAGCAGACCG TGTTCCTGCA CGATCCGGAC AAGAAGGTGC CGTTCCGCAT CGTTCACGAG GACACCGACA AGCGCTATCT GACCGGCGCC GACTTCGACA CCGAAGGCTT CCAGATCATC GGCGACCGGT TCTGGATCGG CGACGAGTTC GGCCCCTACA TCATCGAAGC CGACCTGACC GGCAAGATCG TCGGCGTCTA TGACAGCATG GCCGACGGCA AGCCGATCAA GTCGCCCGAT CATTGGTCGG TGCAGTCGCC CGGCGCGCCG GGCGCGACCT ACACCGGCGT CAACCTGAAG CGCTCCAAGG GCTACGAAGG CTTTGCCGCC TCCAAGGACG GCAAGTTCCT GTACGGCCTG CTCGAAGGCC CGCTGTGGGA CGCCGACAAG AAGGATTGGG AAAAGGTCGA CGGCAAGGAA GCCTCGCGCA TCCTCGAATT CGATGTTGCG CAGAAGAAGT TCACCGGCCG CTCCTGGCAC TACGTGTTCG AGCAGAACGG CAACGCGATC GGCGATTTCA ACATGATCGA CGCCACCCAC GGCCTGGTGA TCGAGCGTGA CAACGGCGAA GGCACCAAGG ACAAGGCCTG CCCCGAAGGC AAGACCGGCA CCGACTGCTT CAATGATCTC GCCAAGTTCA AGCGCGTCTA CAAGATCGAG CTGACCGATG CGAACGCCGG CAAGCCGGTG AACAAGATCG GCTTCATCGA TCTGATGAAG ATCCGCGATC CGGACAAGAA GGCGAAGAAG CCGCTGACCG ACGGCGTGCT GGCGTTCCCG TTCTTCACCA TCGAGAACGT CGACAAGGTG GACGACCGCC ACATCATCGT CGGCAACGAC AACAACCTGC CGTTCTCGTC GAGCCGCGAT CCGAACAAGG CCGACGACAA CGAGTTCGTC CTGCTCGAAG TCGCCGACTT CCTGAAGGCG AAGTAA
|
Protein sequence | MRKLLLGSVA VFALATSAAL AQTEGEFPAT LAGHAVLPAT SFVDAPADAP ADLKTSGKYT TGQRVEAQGS VMGKSNGRPT GVSVPFKGQP LQGHSGIKAM PDGTFWVLTD NGFGSRYNSA DSMLYLDNYK IDWATGAVDR KQTVFLHDPD KKVPFRIVHE DTDKRYLTGA DFDTEGFQII GDRFWIGDEF GPYIIEADLT GKIVGVYDSM ADGKPIKSPD HWSVQSPGAP GATYTGVNLK RSKGYEGFAA SKDGKFLYGL LEGPLWDADK KDWEKVDGKE ASRILEFDVA QKKFTGRSWH YVFEQNGNAI GDFNMIDATH GLVIERDNGE GTKDKACPEG KTGTDCFNDL AKFKRVYKIE LTDANAGKPV NKIGFIDLMK IRDPDKKAKK PLTDGVLAFP FFTIENVDKV DDRHIIVGND NNLPFSSSRD PNKADDNEFV LLEVADFLKA K
|
| |