Gene TM1040_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3541 
Symbol 
ID4075219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp580185 
End bp581759 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content59% 
IMG OID638005055 
ProductN-6 DNA methylase 
Protein accessionYP_611774 
Protein GI99078516 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCCCA GCCACATCCG CGCTCAAATA GACCAAATCT GGAATGCCTT CTGGTCAGGC 
GGTGTCTCCA ATCCGCTCTC CGTGATCGAA CAGATCACCT ACCTGCTCTT TATCAAGCGG
CTCGACGAAA TCCATACGCG GGAAGAGGCC AAGGCCAACA TGCTCGGCTC AGAGATGGAG
CGGCGCATCT TCCCCGAGGG CACCTTCACC TACAAGGTCA GCGACGATCC AAAAGATGAC
GAAACCATCG AACGTCCTTA CGACGATCTG CGTTGGCAAC GGCTGATCAA CTTCGAAAAC
CGCGAGAAGA TGAAGCTGAT GGATCAGCAC GTCTTTCCCT TCATGCGGAC CATGGCGGAG
GAAGGAACAG CCTTCGCAAC CCACATGAAA GACGCGCGTC TGGGTTTTTC CAGCCCGGCT
CTTCTCGACA AGGTTATGCG ACTTCTCGAC GTCATCCAGA TGGATGACCG GGACACCAAG
GGCGATGTCT ACGAATACAT GCTCGGCAAG ATCGCCAGCG CCGGGCAAAA CGGCCAGTTC
CGCACGCCGC GTCACATCAT CGAGCTGATG GTGCGCCTGA TGGCTCCTAC GCCCAAGGAC
ACGATCTGTG ATCCTGCAGC GGGCACCTGC GGTTTCCTTG TCACGGCAGG GGAGTTTCTG
CGCGAAACCC ACCCGGAAAT GCTGCGCAAC CCCGAACAGC GCCAGCACTT CCATAACAGC
ATGTTCCACG GGTTCGATTT TGACCCGACC ATGCTGCGCA TCGGGTCGAT GAACATGGTT
CTGCACGGGG TCGAAAATGC CGACGTAGCT TACCGCGATA GTCTGGCCGA AGAACATGGG
GCCGACACGG GCACTTATTC CTTAATCCTT GCCAACCCGC CATTCGCGGG GTCGCTGGAC
TACGACGCGA CGGCCAAGGA CCTCCAGAAA GTTGTGAAGA CCAAGAAAAC GGAACTGCTG
TTTCTGGCCC TCTTCCTGCG CCTAATGCGC ACCGGTGGCC GTGCCGCTGT AGTTGTGCCC
GAAGGAGTTC TGTTTGGCTC CTCCAAGGCG CACAAGGACA TCCGCCGGAT CATCGTCGAA
GATCAAAAGC TCGATGCGAT CATCAAGCTG CCCTCGGGCG TGTTCCGCCC CTATGCCGGG
GTGTCCACTG CCATCATGAT CTTTACCAAG ACCGAAAGCG GCGGCACGGA TAACGTCTGG
TTTTATGACA TGGAGGCGGA CGGGCTGAGC CTGGATGACA AGCGCACCGA TCTGTTGCCG
TCTGAAAAGT TGGGGCCAGT GCCCGCCGAG GCGCTGACCG AGGAAGAGCA CGCCAAGAAC
AATCTGCCTG ACATCCTGGC CCGTTGGGGC GCGCTGGAGG TCGAGGGGGA ACGCCCCCGC
ACCGCGCAAA GTTTCATGGT GCCGCTGGCC GTTATTCAGG CCACGGGCAC ATGGGACCTG
TCGCTCAACC GCTACAAGGA AGTGGCGCAT AAGGAGGTCG ACCACCAGCC CCCGAAAGAG
ATCATCGCCG AGCTGCGCGC GATTGAGGCC GAGATTGCCG ATGGGCTGGA TCGGTTGGAG
GAGATGCTGG GATGA
 
Protein sequence
MLPSHIRAQI DQIWNAFWSG GVSNPLSVIE QITYLLFIKR LDEIHTREEA KANMLGSEME 
RRIFPEGTFT YKVSDDPKDD ETIERPYDDL RWQRLINFEN REKMKLMDQH VFPFMRTMAE
EGTAFATHMK DARLGFSSPA LLDKVMRLLD VIQMDDRDTK GDVYEYMLGK IASAGQNGQF
RTPRHIIELM VRLMAPTPKD TICDPAAGTC GFLVTAGEFL RETHPEMLRN PEQRQHFHNS
MFHGFDFDPT MLRIGSMNMV LHGVENADVA YRDSLAEEHG ADTGTYSLIL ANPPFAGSLD
YDATAKDLQK VVKTKKTELL FLALFLRLMR TGGRAAVVVP EGVLFGSSKA HKDIRRIIVE
DQKLDAIIKL PSGVFRPYAG VSTAIMIFTK TESGGTDNVW FYDMEADGLS LDDKRTDLLP
SEKLGPVPAE ALTEEEHAKN NLPDILARWG ALEVEGERPR TAQSFMVPLA VIQATGTWDL
SLNRYKEVAH KEVDHQPPKE IIAELRAIEA EIADGLDRLE EMLG