Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2099 |
Symbol | |
ID | 6409759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 2272138 |
End bp | 2273175 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642711984 |
Product | Appr-1-p processing domain protein |
Protein accession | YP_001991096 |
Protein GI | 192290491 |
COG category | [R] General function prediction only |
COG ID | [COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000167345 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATCA GGCTAGTGAC GGGCAACCTA CTGGAGCGGC GCGTCGACGC GATCGTCAAC ACGGTCAACA CTGTCGGCGT GATGGGCAAA GGAATTGCCC TGCAGTTCAA GCGCAAATGG CCGGCGAATG CCAAAGCCTA CGAAGCGGCC TGCAAGCGCG GCGAAGTCGT TCCCGGTAAG ATGTTCGTGT TCGACAATGG CGGCCTCATC GCACCAAAGT TCATCATCAA TTTCCCAACC AAACGGCACT GGCGACAACC GTCGCGAATG GCAGACATCG AAGCTGGACT GGTCGATTTA ATCGTCCAGA TCCGGCACCT GCAGATTCGA TCAATTGCGC TCCCCCCGCT CGGCTGTGGA AATGGTGGTC TCGACTGGGA CCAAGTGCGC CCAAAAATCG AGGCAGCCTT CCGAGAGTTG CCGGACGTAG ACGTCGAGCT TTTCGCACCC GCAGAGGCAG CTGGCGTTCG CCAACTCGAG CCGGAAGCAC AGAAGCCGCG GATGACTCCC GGCCGTGCTG CGATTCTGAA ACTACTCTCA ATATACCGCG AAATGCGCTA TCCGCTAAGT CAGATCGAGA TTCAAAAGCT CGTCTATTTC CTGGCAAGCG CCGGCCAGGC AATGGGTAGT CTGACGTTCA AGAAACACAT TTATGGCCCC TACGCACCCG AGCTGCGCCA CGTCCTGACG AAGATGGATG GTGCATATCT ACATGGGGTC GGCGATGGCT CTAAGCCTTC CGAGATAACG GTGGTCGGTG AGGCACTTCG AGAGGCGGAA GCTTTCCTCA ACTCCCACCA GGACTTTGAA ACCGTGCGAC GCGTCGAGCA GATTGCGCGG CTGATCGACG GGTTCGAAAC ACCTTACGGC ATGGAGTTGC TCGCCACGGT GCATTGGACC GCGAACGAGA CAAAAACGAC AGATCTAAAC CGGATCGTCG AAGCTGTTCA TAGCTGGAAC GAGCGTAAAC GCCAGATTAT GCTGCCCGCG AACATCAAGT CGGCTCGTGA CCGGTTGGTG ACTGAACAGT GGCTGTAA
|
Protein sequence | MAIRLVTGNL LERRVDAIVN TVNTVGVMGK GIALQFKRKW PANAKAYEAA CKRGEVVPGK MFVFDNGGLI APKFIINFPT KRHWRQPSRM ADIEAGLVDL IVQIRHLQIR SIALPPLGCG NGGLDWDQVR PKIEAAFREL PDVDVELFAP AEAAGVRQLE PEAQKPRMTP GRAAILKLLS IYREMRYPLS QIEIQKLVYF LASAGQAMGS LTFKKHIYGP YAPELRHVLT KMDGAYLHGV GDGSKPSEIT VVGEALREAE AFLNSHQDFE TVRRVEQIAR LIDGFETPYG MELLATVHWT ANETKTTDLN RIVEAVHSWN ERKRQIMLPA NIKSARDRLV TEQWL
|
| |