Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3056 |
Symbol | |
ID | 3910857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3485914 |
End bp | 3487461 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637884963 |
Product | type I restriction-modification system, M subunit |
Protein accession | YP_486668 |
Protein GI | 86750172 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.994596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGGC AAGAACAACG CGCGGCGCTA CAACGGAAAA TCTGGGATAT CGCAAACGAT GTTCGCGGCT CCGTTGATGG CTGGGACTTC AAGCAATATG TCCTAGGTAC ACTCTTCTAT CGGTTCATCA GCGAGAACTT CGCCGCTTAT ATCGAGGCGG ATGATGAGAG CATCGATTAT GCGGCTCTAT CCGATGACGT CATCACCGAT GACATCAAGG ATGATGCCAT CAAGACCAAG GGCTATTTCA TCTATCCCAG CCAGCTGTTC GTGAATGTCG CGAAGAACGC CAACATCAAT CACAGCCTGA ACACCGACCT AGCGCACATC TTCGCGGCGA TTGAATCATC GGCCAATGGC TACCCTTCCG AGCAGGACAT CAGAGGTCTT TTCGCCGATT TCGACACGAC CAGCACGCGT CTTGGCCATA CCGTCTCGGA AAAGAACAGC CGATTAGCCA AGGTCCTGAA ACGCGTTGCA GAGCTCGATT TCGGCGACTT CCACAACAGC CAGATTGATT TGTTCGGCGA TGCCTACGAA TTCCTGATCT CGAACTACGC CGCCAATGCG GGCAAGTCCG GCGGGGAGTT CTTCACGCCT CAACATGTCT CAAAGCTCAT CGCGCAGCTC GCCATGCACG GGCAAACACA AGTTAACAAG ATTTACGACC CGGCCTGCGG CTCCGGCTCG CTACTGTTGC AGGCGAAGAA GCATTTTGAC GAACATATCA TCGAAGAGGG TTTCTTTGGG CAGGAGATCA ACCACACGAC CTACAACCTC GCCCGCATGA ACATGTTCCT GCACAACATC AACTACGACA AGTTCAATAT CCAGCGCGGC GATACGCTGA CCCAGCCGCA TTTCCAGGAC GACAAACCCT TTGACGCCAT CGTCTCCAAC CCGCCTTATT CGGTAAAGTG GATCGGCTCA GACGATCCGA CCCTAATCAA CGATGACCGC TTTGCTCCGG CGGGGGTGTT GGCACCCAAA TCAAAGGCCG ATTTCGCCTT TGTGCTCCAT GCGCTTAGCT ATCTCTCGGC CAAGGGCCGC GCGGCGATCG TCTGCTTTCC GGGCATTTTC TATCGCGACG GGGCGGAGAA GAAGATCCGA CAATATCTGG TCGATAACAA TTACGTTGAG ACAGTGATAG CGCTCGCCTC CAACCTCTTC TATGGCACGA CCATCGCCGT GACGATCCTT GTCCTCGCCA AGAACAAGAC GGACACGGCC ATCCAGTTCA TCGACGCCAG CGGCGAAGAG TTTTTCAAGA AGGCCACCAA CACCAATCTG ATGACGGACG ATCACATCGC GCGGGTGATG GAGATTTTTG ATCGTAAGGA AGACGTCGAT CATGTCGCCG CCTCGGTGCA GTATGAGATC ATCGTGGAGC GCGGCTATAA CCTCTCGGTC AGCTCCTATG TCGAGCCGCG CGACACGCGC GAGATTGTCA GTATTGGTGA GCTGAATGCG AAGATCAGAA CCACGGTTCT GCGGATCGAC CAGCTGCGCG CCGACATCGA CGCCATCATT GCGGAGATTG ACGCATGA
|
Protein sequence | MTGQEQRAAL QRKIWDIAND VRGSVDGWDF KQYVLGTLFY RFISENFAAY IEADDESIDY AALSDDVITD DIKDDAIKTK GYFIYPSQLF VNVAKNANIN HSLNTDLAHI FAAIESSANG YPSEQDIRGL FADFDTTSTR LGHTVSEKNS RLAKVLKRVA ELDFGDFHNS QIDLFGDAYE FLISNYAANA GKSGGEFFTP QHVSKLIAQL AMHGQTQVNK IYDPACGSGS LLLQAKKHFD EHIIEEGFFG QEINHTTYNL ARMNMFLHNI NYDKFNIQRG DTLTQPHFQD DKPFDAIVSN PPYSVKWIGS DDPTLINDDR FAPAGVLAPK SKADFAFVLH ALSYLSAKGR AAIVCFPGIF YRDGAEKKIR QYLVDNNYVE TVIALASNLF YGTTIAVTIL VLAKNKTDTA IQFIDASGEE FFKKATNTNL MTDDHIARVM EIFDRKEDVD HVAASVQYEI IVERGYNLSV SSYVEPRDTR EIVSIGELNA KIRTTVLRID QLRADIDAII AEIDA
|
| |