Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5221 |
Symbol | |
ID | 6412921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5632259 |
End bp | 5634811 |
Gene Length | 2553 bp |
Protein Length | 850 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642715111 |
Product | hypothetical protein |
Protein accession | YP_001994184 |
Protein GI | 192293579 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02302] conserved hypothetical protein TIGR02302 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.954731 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCGGCA GCATTCCCGA TCCGTCGCAG GCCCCGCACG ATGAAGAGGC GGTCGCGCGG CTGCACCTGG ACAAGGCGAT CGGGCGCGCG ACGCTGGCGA TTGCCTGGGA GCGCGCGTGG CCGCATCTGG CGCGGGTAAT GAGCGTCGTC GGCCTGTTCC TGGCGCTCTC CTGGGCTGGG CTGTGGCTGG CCTTGCCGTT TTCGGCGCGG ATCGCCGGCT TGGTACTACT GTCGGCCCTG GTGCTTGCCG CGCTGATTCC CGCCATCAGG TTCCGCTGGC CGAGCCGCGA CGAGGCGCTG GCGCGGCTCG ACCGCAACAC CGGCCTCAAG CACCGGCCGG CGACGGCGCT CGGCGACACG CTGGCCTCCG GCGATCCGGT GGCGCGGGCG CTGTGGCAGG CGCAGCGCGA CCGCACCTTG GCGACGATCC GCGGCCTCAA GCCCGGCCTG CCGTCGCCGC GGCTGCCGAT CCACGATCCC TGGGGGCTCC GCGCCCTGGT GGTGATCCTG CTGGTCGCCA CGTTCTTCGC CGCTGGCGAG GAGCGTACCC CGCGCGTGAT GGCGGCGTTC GACTGGAAGG GCGCGCTGTC GCCCTCGACC GTCCGGGTCG ATGCCTGGGT GACGCCGCCG GTCTATACCA ACAAGCCGCC GATCATTCTC ACCGCCGCCT CCAACAAGGA CCTGGGCACC CCAGGCAGCG GTCCGCTGCC GGTGCCGGTC GGCTCGACCC TGCTGGTACG TTCCAGCGGC GGTGATCTCG ACATCGCGGT CGGCGGCGGC GTGGTCGAGG TCAAGCCGGA CAGCGACGCG CCGAAGGGCA CCAGCGAACG GCATTTCCGC ATCACCGGCG ACGGCACCGC GCGGGTCCGG GCGCCGTCGA GCGAAGCGCC GTGGAGCTTC ACCGCCACCC CGGACAAGCC GCCGGCGATT GCGCTCGCCA AGGAGCCGCA GCGCCAGGCG CGCGGTTCGC TGCAACTGTC CTACAAGCTC GAAGACGATT ACGGGGTCAC CGAGGCCGAG GCGCAATTCG TCGCCGCGCC GCCCGCCAAG GCCCCGGGCG CCAAGCCGGG GGATGCGCCG CGGCCGCTGT TCGAAGCGCC GCAGTTCAAG CTGGTGCTGC CGAATGCCCG CACCCGCGCC GGCGTCGGTC AGACCGTCAA GGACGTCAGC GAAGATCCCT ATGCGGGCGC CGAAGTGACG TTGACGCTCA CCGCCAAGGA CGAGGCCGGC AACCAGGGCC ACAGCGAGCC GTTCACGATG CGGCTGCCGG AGCGGCTGTT CACCAAGCCG CTGGCGCGCG CGCTGATCGA GCAGCGCCGC ATCCTGGCGC TCGATGCCAA TGCGAATAGC CAGGTGTACG CCGCACTCGA TGCGCTGCTG ATCGCGCCGG AAGTGTTCAC GCCTGATGCC GGCCAGTATC TCGGGCTCTA TACCATCGCC GATCAGCTCG AACGCGCCCG TACCGACGAT GCGCTGCGCG AAGTCGTTGG CAATTTGTGG TCGCTCGCGG TCTCGATCGA AGACGGCGAT GCGTCGGATG CCGAGAAGGC GCTGCGTGCC GCGCAGGACG CGCTCAAAGA CGCGCTGGAG CGCGGTGCGT CCGACGACGA GATCAAGCAG CTCACCGACA AGCTGCGCGC TGCGCTCGAT ACCTATATGC GCCAGCTCGC GCAGCAGCTC CGCAACAATC CGCAGCAGCT CGCCCGTCCG CTCGATCCGA ACACCAAGGT GATGCGGCAG CAGGATCTGG AAAACATGAT CCAGCGGATG GAGCGGCTGT CGCGCTCCGG CGACAAGGAA GCCGCCAAGC AGCTGCTCGA TCAGCTCGCG CAGATGCTGG AGAACCTGCA GATGGCGCAG CCCGGCCAGG GCGGCGACAG CGGCGACATG GAGCAGGCGC TCAACGAGCT CGGCGACATG ATCCGCAAGC AGCAACAGCT CCGTGACAAG ACCTACAAGC AGGGCCAGGA TCAGCGCCGC GACCGGATGC GCGGCCAGGA CGGCGAGCAG AACCTCGGCG ACCTGCAGCA GGATCAGCAG AACCTGCACG ACCGGCTGCG CAAGCTGCAG CAGGAGCTCG CCAAGCGCGG CCTCGGCCAG AGCCCCGGCG GCGAAAAGGG GCAGCAAGGG CAGCAGGGCC AGCAAGGCGA GGGCGGTCTC GACCAGGCCG ACTCGGCGAT GGGCGATGCC GAAGGCCGGC TCGGCGACGG CAATGCCGAC GGTGCGGTGG ATTCGCAGGG CCGCGCGCTG GAAGCGCTAC GCCAGGGCGC CCAGAAACTC GCCGAAGCGA TGCAGCAGGG CGACGGTGAT GGTCAGGGCG ATGGCCAGGG CAATCGCCCC GGCCGCCAGC AGAGCGGCGC CAATCAGACC GATCCGCTCG GCCGTCCGCT GCGCGGCCGC GACCTCGGCG ACGATCTCAC CGTGAAGATC CCCGGCGAAA TCGACGTGCA GCGCGTCCGC CGCATCCTCG AAGAACTCCG CCGCCGCCTC GGCGACTCCG GCCGCCCGCA GATCGAACTC GACTACATCG AGCGGCTGCT GAAGGATTAC TAA
|
Protein sequence | MSGSIPDPSQ APHDEEAVAR LHLDKAIGRA TLAIAWERAW PHLARVMSVV GLFLALSWAG LWLALPFSAR IAGLVLLSAL VLAALIPAIR FRWPSRDEAL ARLDRNTGLK HRPATALGDT LASGDPVARA LWQAQRDRTL ATIRGLKPGL PSPRLPIHDP WGLRALVVIL LVATFFAAGE ERTPRVMAAF DWKGALSPST VRVDAWVTPP VYTNKPPIIL TAASNKDLGT PGSGPLPVPV GSTLLVRSSG GDLDIAVGGG VVEVKPDSDA PKGTSERHFR ITGDGTARVR APSSEAPWSF TATPDKPPAI ALAKEPQRQA RGSLQLSYKL EDDYGVTEAE AQFVAAPPAK APGAKPGDAP RPLFEAPQFK LVLPNARTRA GVGQTVKDVS EDPYAGAEVT LTLTAKDEAG NQGHSEPFTM RLPERLFTKP LARALIEQRR ILALDANANS QVYAALDALL IAPEVFTPDA GQYLGLYTIA DQLERARTDD ALREVVGNLW SLAVSIEDGD ASDAEKALRA AQDALKDALE RGASDDEIKQ LTDKLRAALD TYMRQLAQQL RNNPQQLARP LDPNTKVMRQ QDLENMIQRM ERLSRSGDKE AAKQLLDQLA QMLENLQMAQ PGQGGDSGDM EQALNELGDM IRKQQQLRDK TYKQGQDQRR DRMRGQDGEQ NLGDLQQDQQ NLHDRLRKLQ QELAKRGLGQ SPGGEKGQQG QQGQQGEGGL DQADSAMGDA EGRLGDGNAD GAVDSQGRAL EALRQGAQKL AEAMQQGDGD GQGDGQGNRP GRQQSGANQT DPLGRPLRGR DLGDDLTVKI PGEIDVQRVR RILEELRRRL GDSGRPQIEL DYIERLLKDY
|
| |