Gene TM1040_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1691 
Symbol 
ID4078267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1784602 
End bp1786695 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content61% 
IMG OID638007004 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_613686 
Protein GI99081532 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAC AGCTTCCCCT TGAATTCCTT GACGCAGCGC GGGTTTACGA GCGCCCCTTG 
ATTGTCGACA GCTTTGCGGG CGGCGGCGGG GCCAGCACTG GCATTGAGCT CGCTCTCGAC
CGTAGTCCGG ATATCGCGAT CAATCATGAT CCGGCAGCGC TTGCGCTACA TGAAGCGAAC
CACCCCGAGG CGTTACACCT TTCGGAGAAC GTATATCGCA TTGACCCGCT TGAGCATTTG
AGTGGAAAGC ACATCGGCCT GATGTGGTTC AGCCCCGATT GCAAGCACTT CTCCAAGGCC
AAGGGCGGTA AGCCGGTAGC TCGCAACATC CGCGATCTCG CTTGGATTAT TCCGGGCTGG
ATCGAAAGGA TTCAGAAGAG CGGCGGCAAG GTCGACGTAG TTCTTATGGA GAATGTCGAG
GAGTTCGCTG GCTGGGGGCC ACTGATCGAG ACCGACAAAG GCCTGATGCC GTGCCCAGAG
CGCAAGGGGG AAACGTTCGA AGCGTGGTGC AAAAAGATCC GCAGCCTTGG TGGCAAGCTA
GAGCGAAGAG AACTCCGAGG CTGTGACTAC GGAGCGCCAA CAATTCGCAA GCGCCTATTT
GTGGCGATCC GCTTTGATGG GGAGCCTGTC AGTTGGCCCA CGCCCACACA CGGCGATCCC
AACAGCGCCG AGGTGAAGTC GGGCAAGCTC AAACCGTGGC GAACCGCAGC AGAATGTATC
GACTGGAGTC ACCCTTGCCC CTCGATTTTC GACAGCAAGG CCGAGATCAT GGAAAAATAC
GGACTGCGCT CGGTGCGCCC TCTGGCCCAC AACACGCTTG CGCGCGTGGC TCGTGGCCTG
CATCGCTATG TTCTGCAGGC TGAGCGTCCG TTTCTGGTGA ACCTGACCCA CGGCGGGCGC
GTCGAGGACG TCGCAGAACC CTTCAAGACA ATCACGGGGG CCAACCGGGG CGAAAAGGCG
ATTGTTGCGC CCTCCCTTGT CAGCGTCGCG CATGGCGATA GCGGCGGTCG TCGAAAATAC
CCACTGACCG ATCCATATGG CGTTGTAACG GCCGGAGGCG TTTCAAACGC ACTCATTGCC
CCCAGCATTG CACGTTTCAC CACTGGCGGA ACCGGACACC AAATCAAATC TCCCCTCGCA
ACGGTCCCGG CAAACAGCTT CATCAAGCGC CCAGGTGGTG CCGCCCCGCT TGGAGTGCTG
GCACCGTACC TTGCCACCAT GCGCAACAGC CTGAAGCCTT GGCAGGAGGT AACGAAGCCC
ACCCACACCG TCACGGCGGG CGGCGCAGGC CTTACGCTTT GCGCGCCCCA CCTGATGAGC
CTGAAAGGCA CTGCGCGCCG GGATCGTGCT GCAGACGCAC CTCATCCGAC CGTTCTGGCC
GGAGGCGGCC ACTCCGCTCT CATGGCCCCC GTCTTGACCT ACGCACAACA GGGCGGAGCC
AATCGCTCTA TGCTGGACCC GCATCATACG ATCTGCGCGA GCAAAAAGGA CCAGAACAGC
CTGTTCTCGG CCTTTCTCGC CCAGCAAAAC GGCGGACCGC GCATGGCGGG GCACAGTGGC
CATGATCCCC GCGAGCCGAT TTCCACCGTC ACCGGCAGCG GGAGCCAGCA GACGCCGGTG
GCCGCGTTCT TCGCCAAATA TTACGGCACC GGTGATGGAG CGCGCGCGGA CGCTCCGCTG
CACACTATCA CAGTCAAGGA TCGCATGGCG CATTGCCAAG CGGATGTGAT CCCAGCTCCA
CCGTTCACTG ACGCCCATAC GGAGCGCGCC CGCCAAGTCG CAAGCCTGAT GCGCGACCAT
GGCCTTTGGG ATGAACGCGA GTTCGTCACA CTCGAGATCG AGGGGCAAGC TTTCATCATC
GTAGATGTCG GTATGCGCAT GCTCACCCCG CGAGAGCTCT TCAACGCCCA AGGCTTCCCT
GCAGACTACG TGATCGAGGG AATTTGGAAG CAGGAAAGCG ATGACTGGAC CTTTTCCACG
TTCCCCAAGG ACGTGCAGGT CCGGTGCGTG GGCAATAGCG TCTGCCCACC AGTGGCCGAG
GCGCTGGTGC GCGCCAACTG TTCGCACCTT ATCGAGATGG AGGACTCGCA GTGA
 
Protein sequence
MTAQLPLEFL DAARVYERPL IVDSFAGGGG ASTGIELALD RSPDIAINHD PAALALHEAN 
HPEALHLSEN VYRIDPLEHL SGKHIGLMWF SPDCKHFSKA KGGKPVARNI RDLAWIIPGW
IERIQKSGGK VDVVLMENVE EFAGWGPLIE TDKGLMPCPE RKGETFEAWC KKIRSLGGKL
ERRELRGCDY GAPTIRKRLF VAIRFDGEPV SWPTPTHGDP NSAEVKSGKL KPWRTAAECI
DWSHPCPSIF DSKAEIMEKY GLRSVRPLAH NTLARVARGL HRYVLQAERP FLVNLTHGGR
VEDVAEPFKT ITGANRGEKA IVAPSLVSVA HGDSGGRRKY PLTDPYGVVT AGGVSNALIA
PSIARFTTGG TGHQIKSPLA TVPANSFIKR PGGAAPLGVL APYLATMRNS LKPWQEVTKP
THTVTAGGAG LTLCAPHLMS LKGTARRDRA ADAPHPTVLA GGGHSALMAP VLTYAQQGGA
NRSMLDPHHT ICASKKDQNS LFSAFLAQQN GGPRMAGHSG HDPREPISTV TGSGSQQTPV
AAFFAKYYGT GDGARADAPL HTITVKDRMA HCQADVIPAP PFTDAHTERA RQVASLMRDH
GLWDEREFVT LEIEGQAFII VDVGMRMLTP RELFNAQGFP ADYVIEGIWK QESDDWTFST
FPKDVQVRCV GNSVCPPVAE ALVRANCSHL IEMEDSQ