Gene TM1040_2682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2682 
Symbol 
ID4077593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2819870 
End bp2820934 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content65% 
IMG OID638008006 
ProductDNA-O6-methylguanine--protein-cysteine S-methyltransferase / transcriptional regulator Ada 
Protein accessionYP_614676 
Protein GI99082522 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2169] Adenosine deaminase 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.100688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATGT TTGACCTGCC CGATTCAAAA ACCCTTTACG ACGCGCTGGT GGCGCGGGAT 
GCAAGCTATG ACGGGCGCGC CTATGTGGGC GTGTCCTCGA CCGGGATCTT TTGTCGCCTG
ACCTGCCCCG CGCGCAAGCC GAAGTATGAG AATTGCCAGT TTTTTGCCAC TCCCGGTGCC
TGCATCGAGG CTGGGTTTCG CGCCTGCAAG CGTTGTCACC CCCTGAATGC GGCGGCGGGG
AATGATCCGC TGGTGCGCCG CCTGCTGGAG GCGCTGGAGG CACGGCCCTT CTATCGCTGG
CGCGAGGAGG ATCTGGTGGC CATGGGCGCA GACCCTTCTA CCATCCGCCG GGCGTTCAAG
CGTCAGTTTG GCATGACCTT TCTGGAAATG GCGCGGCAAC GCCGCTTGCG CGAGGGGTTC
ACCACGCTCA GTGCCGGGCA GCCCGTGATC TCGGCGCAGC TTGAGGCCGG TTTTGAAAGC
CCCAGCGCTT TTCGCGCGGC CTTTGCCCGA TTGGTCGGGA TTGCGCCAGG GCAGTTTCGC
AAGGATGCGC AGCTTTTGGC GGATTGGATC GAGACCCCGC TGGGCGCGAT GGTGGCGGTG
GCCTGCCAGC ACAAGCTGCA TCTGTTGGAG TTTGCAGACC GCAAGGCTTT GCCGCGCGAA
GTGGCAAAGC TGCAAAAACG CCAGCCCGGC GGCATCGGCT TTGGCCGGCC CGAGGTGGTG
GATCAGGCGG CAGAGGAGTT GGGTGCCTAT TTTACGGGGC GGGCAGCGCG GTTCGACACG
CCGCTGGCCT ATCACGGCAC GGCCTTTGAG GCAGACGTCT GGCGTGCGCT CAGGGAGATC
CCGGTGGGGC AGACCCGCAG CTACGGCGCC TTGGCGCAGA GCCTTGGGCG ACCGGGATCA
AGCCGTGCGG TGGCGCGGGC CAACGGGGCC AACCAGATCG CGGTGATGAT CCCCTGCCAC
AGGGTTCTCG GCGCGGATGG GGCATTGACA GGATATGGCG GTGGGCTCTG GCGCAAACAG
AGGCTTATTG AAATCGAACG CGACCTCAGC GCGGCCACTG GATGA
 
Protein sequence
MMMFDLPDSK TLYDALVARD ASYDGRAYVG VSSTGIFCRL TCPARKPKYE NCQFFATPGA 
CIEAGFRACK RCHPLNAAAG NDPLVRRLLE ALEARPFYRW REEDLVAMGA DPSTIRRAFK
RQFGMTFLEM ARQRRLREGF TTLSAGQPVI SAQLEAGFES PSAFRAAFAR LVGIAPGQFR
KDAQLLADWI ETPLGAMVAV ACQHKLHLLE FADRKALPRE VAKLQKRQPG GIGFGRPEVV
DQAAEELGAY FTGRAARFDT PLAYHGTAFE ADVWRALREI PVGQTRSYGA LAQSLGRPGS
SRAVARANGA NQIAVMIPCH RVLGADGALT GYGGGLWRKQ RLIEIERDLS AATG