Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2123 |
Symbol | |
ID | 6409783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2291671 |
End bp | 2293038 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642712008 |
Product | hypothetical protein |
Protein accession | YP_001991120 |
Protein GI | 192290515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.140853 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGG CCAATCTCGG CCTGCCTCTG ATCGAGGCCA GCCAGGCGCA GAAGCACGTC ACCCACAACG AGGCGTTGTT CAGCCTCGAC GGCCTGGTCC AGCTCGCGGT GCTGTCGTCG GCGCTGAGCG CGCCGCCGTC GGCGCCGGGC GAGGGGCAGC GCTGGATCGT CGCGGCCAGC GCCAGCGGCG CCTGGGCCGG CAAGAGCGGA CAGGTCGCCG CTTACTACGA CGGCGACTGG CGGTTCTTTG CGCCGAACCC GGGTTGGCTC GCCTATGATC TGTCGACGCA GACGCTGCTG GCCTGGAGCG GCGCGGCGTG GGTCAACGCA CTGGCGGCGT TTCAGAACCT GCCGCTGCTC GGCCTCAACA CCACGGCGGA TTCCAGTCGT CGCCTGGCGG TGAAGTCCGA CAACGTGCTG CTGTCGAATG ACGACGTCAC GCCCGGCAGT GGCCATATGC GCCTGACCAT CAACAAGAGC GCCGCCGCGA AAGACGCCGG ACTGATCCTG CAGGACAATT GGAGCGCGCG GGCGCTGCTC GGTCTGCTCG GCAATGACGA CTTCGTCGTC AAGCTGACGC CGGACGCGGC CAATTACTAC GCTGGCTTAC GCGGATGGTC GGCGCTGCAC GGCCGGCTCG ATCTGAAGGA CGCGCGGCGC CGCCAGCCGC TGCAGTGGTC GTCGCGGCCG GGCAGCACCG CGCTCGATGT CGCCGGCCTT GGTACGATGA TCACCGGTAC CGCGACCGCG GTCAGTCCCT CGGCGGGCAA TCTGTTTCTG TCCTCGCCGC GGCTCGATCT CGTCTCCGCC GCCAGCGCCG GCGCGTCGGC CGGGGTGAGC GGATCGGCGC TGACGCTGTG GCGCGGCAAT GCCGCGAGCC TCGGCGGGTT CTATCTGCTG ATGCGGTTCG GCATCGAGTG GTTTCAGACC AACTGCCGGC TGTTCGCCGG GCTGTATGGC TCGGCGAGCC CGATCGGCAA CGTCGCTCCG AGCGCGCTGC TCAATTTGAT CGGTGTCGGC TTCGATTCCG GCGATCCGAC TTTGTCGCTG CTGAGCAACG ACGGCAGCGG CGCGGCGACC AAGACCAGCC TCGGCGCCGG CTTCCCGACC ACCGGCGGCC AGGATCTGTA TGAGTTGTTA GTTTCGGCCG AGCCGAACGG CAGCGAGATC CGCTACCGCG TCGAACGGCT GAATTCCGGC GACGTCGCGA CCGGCGTCGT TACCAGCGAC CTGCCGGTTG CCACGCAATT CCTCACCCCG CATCTGTGGA TGAATAACGG CAGCACCGCC GCATCGGTCA ACGTCGCGCT GTTCCAGATG TACGCCGAGC CCGCGGCCCT GCTCGGCTCG CGCGGGGCGA TTGGTTAG
|
Protein sequence | MTTANLGLPL IEASQAQKHV THNEALFSLD GLVQLAVLSS ALSAPPSAPG EGQRWIVAAS ASGAWAGKSG QVAAYYDGDW RFFAPNPGWL AYDLSTQTLL AWSGAAWVNA LAAFQNLPLL GLNTTADSSR RLAVKSDNVL LSNDDVTPGS GHMRLTINKS AAAKDAGLIL QDNWSARALL GLLGNDDFVV KLTPDAANYY AGLRGWSALH GRLDLKDARR RQPLQWSSRP GSTALDVAGL GTMITGTATA VSPSAGNLFL SSPRLDLVSA ASAGASAGVS GSALTLWRGN AASLGGFYLL MRFGIEWFQT NCRLFAGLYG SASPIGNVAP SALLNLIGVG FDSGDPTLSL LSNDGSGAAT KTSLGAGFPT TGGQDLYELL VSAEPNGSEI RYRVERLNSG DVATGVVTSD LPVATQFLTP HLWMNNGSTA ASVNVALFQM YAEPAALLGS RGAIG
|
| |