Gene RPB_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3056 
Symbol 
ID3910857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3485914 
End bp3487461 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content55% 
IMG OID637884963 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_486668 
Protein GI86750172 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.994596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGGC AAGAACAACG CGCGGCGCTA CAACGGAAAA TCTGGGATAT CGCAAACGAT 
GTTCGCGGCT CCGTTGATGG CTGGGACTTC AAGCAATATG TCCTAGGTAC ACTCTTCTAT
CGGTTCATCA GCGAGAACTT CGCCGCTTAT ATCGAGGCGG ATGATGAGAG CATCGATTAT
GCGGCTCTAT CCGATGACGT CATCACCGAT GACATCAAGG ATGATGCCAT CAAGACCAAG
GGCTATTTCA TCTATCCCAG CCAGCTGTTC GTGAATGTCG CGAAGAACGC CAACATCAAT
CACAGCCTGA ACACCGACCT AGCGCACATC TTCGCGGCGA TTGAATCATC GGCCAATGGC
TACCCTTCCG AGCAGGACAT CAGAGGTCTT TTCGCCGATT TCGACACGAC CAGCACGCGT
CTTGGCCATA CCGTCTCGGA AAAGAACAGC CGATTAGCCA AGGTCCTGAA ACGCGTTGCA
GAGCTCGATT TCGGCGACTT CCACAACAGC CAGATTGATT TGTTCGGCGA TGCCTACGAA
TTCCTGATCT CGAACTACGC CGCCAATGCG GGCAAGTCCG GCGGGGAGTT CTTCACGCCT
CAACATGTCT CAAAGCTCAT CGCGCAGCTC GCCATGCACG GGCAAACACA AGTTAACAAG
ATTTACGACC CGGCCTGCGG CTCCGGCTCG CTACTGTTGC AGGCGAAGAA GCATTTTGAC
GAACATATCA TCGAAGAGGG TTTCTTTGGG CAGGAGATCA ACCACACGAC CTACAACCTC
GCCCGCATGA ACATGTTCCT GCACAACATC AACTACGACA AGTTCAATAT CCAGCGCGGC
GATACGCTGA CCCAGCCGCA TTTCCAGGAC GACAAACCCT TTGACGCCAT CGTCTCCAAC
CCGCCTTATT CGGTAAAGTG GATCGGCTCA GACGATCCGA CCCTAATCAA CGATGACCGC
TTTGCTCCGG CGGGGGTGTT GGCACCCAAA TCAAAGGCCG ATTTCGCCTT TGTGCTCCAT
GCGCTTAGCT ATCTCTCGGC CAAGGGCCGC GCGGCGATCG TCTGCTTTCC GGGCATTTTC
TATCGCGACG GGGCGGAGAA GAAGATCCGA CAATATCTGG TCGATAACAA TTACGTTGAG
ACAGTGATAG CGCTCGCCTC CAACCTCTTC TATGGCACGA CCATCGCCGT GACGATCCTT
GTCCTCGCCA AGAACAAGAC GGACACGGCC ATCCAGTTCA TCGACGCCAG CGGCGAAGAG
TTTTTCAAGA AGGCCACCAA CACCAATCTG ATGACGGACG ATCACATCGC GCGGGTGATG
GAGATTTTTG ATCGTAAGGA AGACGTCGAT CATGTCGCCG CCTCGGTGCA GTATGAGATC
ATCGTGGAGC GCGGCTATAA CCTCTCGGTC AGCTCCTATG TCGAGCCGCG CGACACGCGC
GAGATTGTCA GTATTGGTGA GCTGAATGCG AAGATCAGAA CCACGGTTCT GCGGATCGAC
CAGCTGCGCG CCGACATCGA CGCCATCATT GCGGAGATTG ACGCATGA
 
Protein sequence
MTGQEQRAAL QRKIWDIAND VRGSVDGWDF KQYVLGTLFY RFISENFAAY IEADDESIDY 
AALSDDVITD DIKDDAIKTK GYFIYPSQLF VNVAKNANIN HSLNTDLAHI FAAIESSANG
YPSEQDIRGL FADFDTTSTR LGHTVSEKNS RLAKVLKRVA ELDFGDFHNS QIDLFGDAYE
FLISNYAANA GKSGGEFFTP QHVSKLIAQL AMHGQTQVNK IYDPACGSGS LLLQAKKHFD
EHIIEEGFFG QEINHTTYNL ARMNMFLHNI NYDKFNIQRG DTLTQPHFQD DKPFDAIVSN
PPYSVKWIGS DDPTLINDDR FAPAGVLAPK SKADFAFVLH ALSYLSAKGR AAIVCFPGIF
YRDGAEKKIR QYLVDNNYVE TVIALASNLF YGTTIAVTIL VLAKNKTDTA IQFIDASGEE
FFKKATNTNL MTDDHIARVM EIFDRKEDVD HVAASVQYEI IVERGYNLSV SSYVEPRDTR
EIVSIGELNA KIRTTVLRID QLRADIDAII AEIDA