Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3055 |
Symbol | |
ID | 3910856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3484682 |
End bp | 3485917 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637884962 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_486667 |
Protein GI | 86750171 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.363566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG CAGGGTTTCT GGAGGAGCTG CTGGATGGGG CTGATGTGGA GTGGGAACCA TTGGGGGAGG TCACTCAACC AACAGCGAAC ATCAAATGGT CACAAGCCGA CGGCGTTTAC CAATATATTG ATCTCACCTC CGTCGACATC AAAACCAAAC GCGTTACCGA GGCAAGCGAG ATTACAGCCG AGACCGCGCC AAGCAGAGCG CAGAAGCTCG TTAAAGAAAA TGACGTCATT TTCGCTACGA CGCGCCCCGC TCAACAACGA TACTGCCTAA TCGACTCCGA ACTGGCCGGA AACGTCGCCA GCACGGGTTA CTGCGTGCTC AGAGCAAAGA AGGATCAGGT ACTACCTAAG TGGATTTTGC ACTGGCTTGG CACAACAGAA TTCAAGAATT ACGTCGAGGA GAATCAGAGT GGGGCTGCAT ACCCAGCGAT ATCAGACGGC AAGGTGAAAG CGTTCAAAAT TCCCATTCCA TGCCCGGATG ATCCGGAGAA GTCGCTGGCG ATACAGGGGG AGATCGTCCG AATACTGGAC ACATTCACCG AGCTTACCGC TGAGCTTACC GCGGGGCTTG CCGCCGAGCT TGCCCAGCGC AAAAAACAAT ACAGCCACTA CCGCGACCAG CTCTTGACCT TCAATGAAGA TGAGGTGGAG TGGAAGACGC TGGGGGATAT CGCGACTCTA CGCCGAGGGC GAGTTATGTC GAAGGGCTAC CTGCGAGATA ACGCCGGTGT GTACCCGGTC TACAGCTCCC AAACTGCAAA CAACGGCATG ATTGGCCAGA TCGACACGTT TGACTTTGAC GGTGAGTACG TCAGTTGGAC CACAGACGGA GCAAACGCCG GAACTGTATT CTATAGAAAC GAAAAATTCT CGATTACTAA CGTTTGCGGC GTAATAAAAG AAAATGGAAC GTGCCCGCTG GACCTAAAAT TTTTATCTTT TTGGCTTTCG ACGGAGGCCA AGAAGCATGT TTACAGTGGA ATGGGCAACC CGAAGCTGAT GAGTCATCAA GTCGAGAAAA TACCAATCCC GATTCCCTTT CCAGATGACC CTAAAATATC GCTAGAAGCC CAAAAGCGCG TCGCCGCCAT CCTCGACAAG TTGGATGCGC TGACGACTTC CCTCACTGAG ATCCTGCCGC GTGAAATCGA GCTGCGTGAA AAGCAGTATG CCTATTACCG CGATCAGCTG CTGAGCTTCC CCAAGCCGGA CGCGGAGGCT TTCTAA
|
Protein sequence | MSAAGFLEEL LDGADVEWEP LGEVTQPTAN IKWSQADGVY QYIDLTSVDI KTKRVTEASE ITAETAPSRA QKLVKENDVI FATTRPAQQR YCLIDSELAG NVASTGYCVL RAKKDQVLPK WILHWLGTTE FKNYVEENQS GAAYPAISDG KVKAFKIPIP CPDDPEKSLA IQGEIVRILD TFTELTAELT AGLAAELAQR KKQYSHYRDQ LLTFNEDEVE WKTLGDIATL RRGRVMSKGY LRDNAGVYPV YSSQTANNGM IGQIDTFDFD GEYVSWTTDG ANAGTVFYRN EKFSITNVCG VIKENGTCPL DLKFLSFWLS TEAKKHVYSG MGNPKLMSHQ VEKIPIPIPF PDDPKISLEA QKRVAAILDK LDALTTSLTE ILPREIELRE KQYAYYRDQL LSFPKPDAEA F
|
| |