Gene Rpal_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3803 
Symbol 
ID6411481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4082080 
End bp4083780 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content51% 
IMG OID642713684 
ProductSite-specific DNA-methyltransferase (adenine-specific) 
Protein accessionYP_001992777 
Protein GI192292172 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAA AAACGCGCCT AGAACTAACC TGGATCGGTA AGGATGAGCG GCCGCGGCTG 
GAGCCGAGAA TATTGATGGA AGATATTGCA CTATCACATC ATGCGTCGGT GCGGCATTCT
GAGGCGGACA TCTTCGATAA TCTGCTGATC CAAGGCGACA ATCTATTGGC CCTCAAGGCG
CTCGAAGCAA GCTACACGGG CAGGATCAAG TGCGTCATTA TCGACCCGCC ATATAATACG
GGTAGCGCAT TCAAGCACTA CGATGACGGT CTTGAGCACT CGCTATGGCT TTCGTTGATG
CGTGACAGGC TCGACCTGAT TAGGCGCTTG ATGTCAGAAG ATGGATCGCT TTGGATCACA
ATAGATGACA ATGAAGCTCA CTATCTGAAG ATTCTCTGTG ATGAAGTTTT CGGAAGGTCA
AATTTTGTAG CGAACGTCGT TTGGCAGAAG AAGTATTCGA AGCAGAATGA TGCGAAGCAT
TTCAGTACGA GCCACGACCA CATTCTTGTC TTTGCTAAGA ACAAGAATGA GTGGGCGCCA
AACAAGGTTG GAAGAAACCA AAGTCAACTG AAAGGTTATA GCAATCCGGA CGATGATCCA
CGCGGACTAT GGACGTCAGT CGTTTATACC TGTTCCAAGA CTCGCGCAGA ACGGCCCAAT
CTGTTCTATC CAATAAAGCA CCCGGTTACG GACGTTGATG TTTGGCCAAG TGAAACCAGG
GTCTGGGGCT ACGACGAGGC GCGCCATAAG AAGCACGTCG AAGAGAACAT GCTTTGGTGG
GGCAAGAACG GCGAACAAGA AAAGCCGAGA ATAAAGGTGT TCTTATCCAA AGTAGGCGAA
GGAGTTGTTC CTAGCACTAT CTGGCTTCGC GATGAAGTGG GAGACAATCA AGATGCGCGC
CGTGAAGCGA TGGCGCTGAA TTCGGAAGGA TCTTTCTCTA CTCCGAAACC TGAGAGCCTC
ATAAGACAAA TGGTCTCCAT CGCTACAGCT CCCGGCGATC TAGTTCTCGA TTCATTTGCA
GGCTCCGGCA CCACCGGCGC CGTTGCACAC AAGATGGGGC GGCGCTGGAT TATGGTTGAG
CTCGGGGACC ATGCAGTCAC GCACATTGTT CCGCGCCTCA AGTTGGTAAT CAACGGCGCG
GACCGGGGTG GAGTCACCGA TGCGGTGGGC TGGAATGGCG GCGGTGGGTA TCGATTCTGT
CGGCTCGCTC CTTCGCTTCT CGAAAAAGAT CGCTTTGACA ATTGGGTGAT AGCCAAAGAA
TACAACGCCG CTATGCTTGC TGAAGCTCTA TGCAAGCATC TGGGTTTTAC CTATGCGCCT
AGCCAGGACG CGGCCGAATA TTGGCGGCAC GGAAATTCGA CCGAGACCGA CTTCATCTAT
GTCACTACTC AGTCACTGAC TTACGATGCT TTGAAGAAGT TGTCTGAAGA AGTTGGCCCA
AAGCGGACGT TGCTGGTTTG CTGTAAAGCC TTCAATGCGA AGGAGGATAG CTTTCCGAAT
CTTACGGTGA AAAAGATACC CCATGCAATT CTCGCGAAGT GCGAATGGGG TCGAGACGAT
TATTCCCTTC AGATAGCCAG TCTTACAGAA GAGGTAAAGT CCAAGGATTC CAACGGCTCG
GCTGACGATG GAGAAGAAAG GCCCCAACGC CGGAAGTCAA AAACACAATT GCCGCTCTTC
GATTCGACGG AGGGTGAGTG A
 
Protein sequence
MNRKTRLELT WIGKDERPRL EPRILMEDIA LSHHASVRHS EADIFDNLLI QGDNLLALKA 
LEASYTGRIK CVIIDPPYNT GSAFKHYDDG LEHSLWLSLM RDRLDLIRRL MSEDGSLWIT
IDDNEAHYLK ILCDEVFGRS NFVANVVWQK KYSKQNDAKH FSTSHDHILV FAKNKNEWAP
NKVGRNQSQL KGYSNPDDDP RGLWTSVVYT CSKTRAERPN LFYPIKHPVT DVDVWPSETR
VWGYDEARHK KHVEENMLWW GKNGEQEKPR IKVFLSKVGE GVVPSTIWLR DEVGDNQDAR
REAMALNSEG SFSTPKPESL IRQMVSIATA PGDLVLDSFA GSGTTGAVAH KMGRRWIMVE
LGDHAVTHIV PRLKLVINGA DRGGVTDAVG WNGGGGYRFC RLAPSLLEKD RFDNWVIAKE
YNAAMLAEAL CKHLGFTYAP SQDAAEYWRH GNSTETDFIY VTTQSLTYDA LKKLSEEVGP
KRTLLVCCKA FNAKEDSFPN LTVKKIPHAI LAKCEWGRDD YSLQIASLTE EVKSKDSNGS
ADDGEERPQR RKSKTQLPLF DSTEGE