Gene Bpro_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1940 
Symbol 
ID4015421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp2000515 
End bp2002122 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content61% 
IMG OID637941607 
ProductN-6 DNA methylase 
Protein accessionYP_548769 
Protein GI91787817 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0625168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0625802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACG ACATCAAAAA AACACTCTGG GCGACCGCCG ACAAGTTGCG CGCCAATATG 
GACGCCGCCG AATACAAGCA CCTGGTGCTG GGCCTGATCT TCGTCAAGTA CATCTCCGAC
ACCTTCGCCG CCCGCCGGTC AGAAGTCGCC ACCCGGCTGG CAGATCCCAA AGACGAATAC
TTTTTTGAAG GCGCCACGCC CAAAACCTTG GCTGTTGAGC TAGAAGACCG CGATTACTAC
AAGTCGGTCA ACGTCTTCTG GGTACCCGAA GCTGCCCGCT GGGAAAGCCT GCGCGCCGCG
GCCAAACAGC CTGACATCGG CAAGCGCATT GACGAAGCCC TGACGCTGGT CGAAGTGGAA
AACCCAAAGC TCAAGGGCAT CCTCGACAAG CGTTACGCCC GCGCCCAGTT GCCCGATGGC
AAGCTCGGCG AGCTGGTGGA CTTGATTTCG ACTGTCGGGT TTGGCGACAA CCCTTCGGTA
GCGCGCGACA TCCTGGGCCA AGTGTACGAA TACTTCCTCG GCATGTTCGC CAGTGCGGAA
GGCAAACGCG GCGGGCAGTT TTACACGCCG GCCAGCATCG TCAAAACCCT GGTCGCCGTG
CTCAACCCGC ACAGCGGCAA GGTGTACGAC CCGTGCTGCG GCTCGGGTGG CATGTTTGTC
CAGAGTGAAA AGTTCATTGA AGCGCACGGC GGCAAGCTGG GAGATGCATC CATCTACGGA
CAAGAGGCCA ACCCCACCAC CTGGCGCCTG GCCGCTATGA ATCTGGCAAT TCGCGGCATT
GACTTCAACC TGGGACGCGA ACCCGCCGAC ACCTTTGTGC GCAACCAGCA CCCCGACCTG
CGCGCCGACT TTATCCTGGC CAACCCGCCT TTCAACATCA GCGACTGGTG GCACGCCAGC
CTCACGGGTG ACGCCCGCTG GCAGTACGGC GACCCGCCGA CCGGCAATGC CAACTACGCT
TGGCTGCAGC ACATGCTGCA CCACCTCAAA CCCACGGGTC GTGCCGGCAT TGTGCTGGCC
AATGGCAGCA TGAGCTCAAG TCAGAACAGC GAGGGCCAGA TCCGCGCCGC GATGGTTGAG
GCCGACGTGG TTGAGGTCAT GATTGCACTG CCGGGCCAAC TGTTCTTCAA CACGCAAATT
CCTGCCTGCC TGTGGTTTCT GGTCAAGAAG AAAACCCGCA GACAAGGCGA AGTGCTGTTC
ATTGATGCCC GCAAGCTCGC CACCATGATC AGTCGGGTGC AGAGCGAGTT CACCGACGAG
GTCATAGCCC GAATTGCCAA CACAGTGGCA GCCTGGCGCG GCGAATCACT CCCTCTCCCC
CTGGGACTGG GTGAGGGTGA GGGTGAGGGT GAGGGCTCCC CGGCCCCAGC TCCCTACGCC
GACATCCCGG GCTTCTGCCG CAGCGTCAAG CTCAAAGAAA TCGCCCAGCA CGGCCATGTC
CTCACCCCTG GCCGGTACGT AGGTGCAGAA GAGGTGGAAG ACAACGATGA AGACTTCGCC
ACCAAGATGC AGCAGCTCAC TGAAAAGCTG GGTGAGCAGA TGGCGAAGGG GGCGGAACTG
GATCAGTTGA TTCGGCAAAA GCTGGGGGGG CTGGGGTATG AGTTCTGA
 
Protein sequence
MLDDIKKTLW ATADKLRANM DAAEYKHLVL GLIFVKYISD TFAARRSEVA TRLADPKDEY 
FFEGATPKTL AVELEDRDYY KSVNVFWVPE AARWESLRAA AKQPDIGKRI DEALTLVEVE
NPKLKGILDK RYARAQLPDG KLGELVDLIS TVGFGDNPSV ARDILGQVYE YFLGMFASAE
GKRGGQFYTP ASIVKTLVAV LNPHSGKVYD PCCGSGGMFV QSEKFIEAHG GKLGDASIYG
QEANPTTWRL AAMNLAIRGI DFNLGREPAD TFVRNQHPDL RADFILANPP FNISDWWHAS
LTGDARWQYG DPPTGNANYA WLQHMLHHLK PTGRAGIVLA NGSMSSSQNS EGQIRAAMVE
ADVVEVMIAL PGQLFFNTQI PACLWFLVKK KTRRQGEVLF IDARKLATMI SRVQSEFTDE
VIARIANTVA AWRGESLPLP LGLGEGEGEG EGSPAPAPYA DIPGFCRSVK LKEIAQHGHV
LTPGRYVGAE EVEDNDEDFA TKMQQLTEKL GEQMAKGAEL DQLIRQKLGG LGYEF