Gene TM1040_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1444 
SymboldmdA 
ID4078074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1542571 
End bp1543728 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content60% 
IMG OID638006755 
Productputative dimethyl sulfoniopropionate demethylase 
Protein accessionYP_613439 
Protein GI99081285 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.8924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.101905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCT CGCCCTCAAC TCATGGTATG AAAGGGTTTG CCATGACCGT TCTGAAACCT 
GCCATTTCCC TCTCTCGGCG CCTGCGTCGA ACCCCTTTTT CGGAGGGCGT CGAAGCCGCT
GGAGTCAAAG GCTACACCGT CTACAATCAC ATGCTGCTGC CCACGGTGTT TGAGAGTGTC
GAAGCCGACT ACCACCACCT CAAGCGTCAT GTGCAAGTCT GGGACGTCGC CTGCGAACGC
CAGGTGGAAC TGCGCGGCCC CGACGCCGGA CGCCTGATGC AGATGCTGAC CCCGCGCGAT
CTGCGTGGCA TGATGCCCGG TCAATGTTAC TATGTCCCCA TCGTCGATGA GACCGGCGGG
ATGCTCAATG ATCCCGTAGC CGTAAAGCTC GCAGAGGATC GTTGGTGGAT CTCCATTGCG
GACAGTGATC TGTTGTACTG GGTCAAGGGG ATCGCAAACG GCTGGCGCCT TGATGTGCTG
GTGGATGAAC CGGATGTTTC GCCGCTTGCG GTGCAGGGCC CCAAGGCAGA GGACTTGATG
GCGCGTGTCT TCGGCGAGAC TGTGCGCGCG ATCCGGTTTT TCCGCTTTGG CGTCTACCAG
TTCGAAGGAC GCGATCTGGT GGTGGCAAGG TCGGGCTACT CAAAGCAGGG TGGGTTCGAG
ATCTACGTCG AAGGCGGCGA TCTTGGCATG CCGCTCTGGA ACCGTCTGTT TGAGGCTGGC
GCAGATCTCG AGGTGCGTGC GGGCTGTCCC AACCTCATTG AGCGGATCGA GAGCGGTCTT
CTGAGCTACG GCAATGATAT GACCGACGAC AACACACCGC ACGAATGCGG CCTTGGGCGG
TTCTGTAACA CCCACACGGC CATTGGGTGT ATCGGGCGTG ATGCGCTGCT GCGGGTGGCC
AAGGAAGGCC CGGTGCAGCA GATCCGCCCG ATCGAGATTT CCGGCGAAGC GGTGCCGCCC
TGTGATCAAT TCTGGCCGCT CGTTGCAAAT GGGCGTCGTG TCGGTCGGGT CTCCTCGGCC
ACCTGGTCGC CGGATCATGC CACGAATGTT GCGATCGGCA TGGTCAAGAT GACGCATTGG
GATGCGGGGA CGCAGCTAGA GGTGGAGACA CCGGATGGAA TGCGTACTGC TCTGGTGCGC
GAAAATTTCT GGAATTAA
 
Protein sequence
MRRSPSTHGM KGFAMTVLKP AISLSRRLRR TPFSEGVEAA GVKGYTVYNH MLLPTVFESV 
EADYHHLKRH VQVWDVACER QVELRGPDAG RLMQMLTPRD LRGMMPGQCY YVPIVDETGG
MLNDPVAVKL AEDRWWISIA DSDLLYWVKG IANGWRLDVL VDEPDVSPLA VQGPKAEDLM
ARVFGETVRA IRFFRFGVYQ FEGRDLVVAR SGYSKQGGFE IYVEGGDLGM PLWNRLFEAG
ADLEVRAGCP NLIERIESGL LSYGNDMTDD NTPHECGLGR FCNTHTAIGC IGRDALLRVA
KEGPVQQIRP IEISGEAVPP CDQFWPLVAN GRRVGRVSSA TWSPDHATNV AIGMVKMTHW
DAGTQLEVET PDGMRTALVR ENFWN