Gene TM1040_0971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0971 
Symbol 
ID4077267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1037740 
End bp1040775 
Gene Length3036 bp 
Protein Length1011 aa 
Translation table11 
GC content62% 
IMG OID638006274 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_612966 
Protein GI99080812 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC GTCTCGCCAA ACAGGGTCGT CTCATCGACC GCTCCAAGCA GATCGAGTTT 
TCCTTCAACG GCAAAACCCT CAAGGGGTTT GCTGGCGATA CTCTGGCCTC TGCGCTTCTC
GCCAACGATC AGATGCTGGT CGGGCGGTCC TTCAAGTACC ACCGCCCGCG TGGCCTCGTG
GCCTCCGGAT CTGAGGAGCC GAACGCGCTG GTGGGCCTCG GTGTCGGCGA CCGCTTCGAG
CCGAACCAGC GCGCCACTAC CACCGAGCTT TTCTCGGGCC TCAAGGCGCA ATCGCAGAAC
CACTGGCCGA GCCTCGAGTT CGACGTGGGC GAGATCAACT CCAAACTGTT CTCGCGCTTC
CTGACTGCGG GCTTCTACTA TAAGATGTTC ATCCACCCGC GTCCCTTCTG GAAGCATGTG
TATGAGCCCT TTATTCGCCA GTCCGCAGGG CTCGGCAAAG CCCCCAACAA GGAACTGAAG
GACGCCGACA CCTACGAGCA TTTCTATTAT TTCTGTGACG TGCTGGTGAT TGGCGGCGGT
GTGGCTGGCC TTCAGGCTGC AAAGACCGCC GCGGCGACCG GCGCCAAGGT TCTGGTGCTC
GAGGAAAAAT CCTATTGGGG TGGTCGGACG CCCGTCGACG GCGGTACCAT CGAAGGCCAA
TCCGCCGAAG CCTGGATCAG CAAGACCGTG GCCGAGCTTG AGGCAATGGA CAATGTCACC
CTGCGCACCC GCACCATGGG GGCAGGCGTT TACGATCATG GCTATATCCT TGGTTATGAA
CGCCTCACCG ACCACGCGCC GGGGCAGGGC GGTCCGCGCC ATCGCCTCTG GCGTATCCGT
GCGACGCAAA CCGTCACCGC AACCGGTGCC ATCGAACGCC CGCTGTCCTT TGCGGGCAAC
GACGTGCCGG GTGTGATGCT GGCCGCTTCC ATGCGCGACT ATGTGGTGAA CTGGGGGGTG
ACTCCGGGGC AGAAGGTTGT GGTCGCCACC AACAATGATG ATGCTTACCG CACCGCAATC
ACCCTGCATG AGGCAGGCGT CGAGGTCGTG CGCGTTCTGG ACACGCGCGA GAGCGGCGGC
GGCGATCTCG CTGACAAGGT CCGCGCGCTT GGTATCCGCG TCGAATGCGG CCGTGCCATC
GCCAAGGTGA AGGACGGCAA ACGCGTCACC AAGGTCGCCA TCTGCGCCCA GAACGGCGAG
GGCGGCGCCC ATGAGGAGAT CGAAGCCGAT GCTGTGGCGA TGTCCGGTGG CTGGTCGCCC
GTGGTTCACC TCTGGTCCCA CTGCGGCGGC AAGCTCATCT GGGACGAGGC GCAGGCGCAT
TTCCGCCCCG ACCCGGATCG CGCGCCCACT GGCCATGATG GTCTGCCCTT TGTGGTGACG
GCTGGCGCAT CCTCTGGTCC GCTCACGCTC TCTGGCGTGA TGGCCGATGC GGCCGAAGCT
GGCGCGCGTG CTGCCGAGGC CGCCGGCCGT CCTGCGCAAA AAGTGTCGCC GGTTGCCACC
TCGGATCTCG CTGAAGCGCC TATGGAAGCC GTTTGGATGA TGCCCGCCAA GGCCGACATC
AAACTGCGCA TGAAAACCTG GCTCGACTAT CAAAACGACG TCAAGGTGTC CGACGTGCAG
CTTGCGGCGC GCGAGGGCTA TGAAAGCGTC GAGCACACCA AACGCTACAC CACGCTGGGC
ATGGCGACGG ATCAGGGGAA ACTCTCCAAC ATCAACGGTC TGGCTGTGCT GGCCGATAGT
CTGGGCGCCG AGATCCCGCA GGTCGGGACC ACCACGTTCC GCCCGCCGTA TCACCCGATT
TCCATGGGAG CGATTGGCGG TGAGGCGCGT GGCGAGATCT TCCAACCCCT GCGCAAGACG
CCGATGTATG ACTGGCACGA CAGCAACGGC GCCCATTGGG AGCCGGTGGG CCACTGGCGG
CGTCCCTACG CCTATGTGCG CTCGGGCGAG AGCGTCCATC AGGCGGTCAA TCGCGAGGTG
AAAAACACCC GCGAGAACCT TGGCCTTCTG GATGCCTCCA CCCTCGGCAA ACTCATCGTC
AAAGGGCCGG ATGCGGGCAA GTTCCTCGAC ATGCTCTACA CCAATATGAT GAGCACGCTG
AAGATCGGCA AATGCCGCTA TGGCCTGATG TGTTCCGAGA ACGGCTTTCT TGTGGATGAC
GGCGTGGTTG CGCGCATCGA CGAGGACACC TGGCTCTGCC ACACCACCAC CGGCGGCGCG
GATCGCATTC ACGCCCACAT GGAAGAATGG CTGCAGACCG AATGGTGGGA CTGGAAGGTC
TATGTCACCA ACGCCACCGA GCAACTGGCG CAGGTCGCCG TCGTTGGCCC CAACGCCCGC
AAGGTGCTGG AAAAGCTGAA TGAAAAAGCA GGCGGCGGTA TGGATCTCTC GAAAGAGGCG
CTCGCGTTCA TGGAGTGGAA AGACGGCGAG ATCGGCGGCT TCAAGGCGCG CGCTTACCGG
ATTTCCTTCT CGGGCGAGCT CTCTTATGAG ATCGCAGTTT CTGCGTCCGA GGGGCAGGCC
TTCTGGAATG CGCTCATCGA GGCGGGCAAG GAATTTGGCG TCATGCCCTA TGGCACCGAA
TGTCTGCATA TCCTGCGGGC CGAAAAAGGC TTCATCATGA TTGGTGACGA GACCGACGGC
ACCGTGATCC CGCAGGATCT TGGTCTGCAC TGGGCGCTCT CCAAGAAGAA AGAAGACTAT
CTTGGCAAGC GCGCGCAGCA GCGCAGCCAC ATGGCCGATC CCGACCGCTG GCAACTGGTG
GGTCTGGAAA CCGTAGATGG CTCTGTGTTG CCCGATGGCG CCTATGCGGT TGGCGATGGA
AACAACGCCA ATGGCCAGCG CAATACCATC GGCCGCGTGA CGTCGACCTA CTACTCCGCC
AACCTTGATC GCGGCATCGC GATGGGGCTG GTGAAACATG GCCCCAAACG CATGGGCGAG
GTGATCGACT TCCCCGGCCT TGATGGGAAA ATCTACAAAG CCAAGATCGT CGACCCGATC
TTCTATGACA AAGAAGGGGA CAAGCAGAAT GTCTAA
 
Protein sequence
MSTRLAKQGR LIDRSKQIEF SFNGKTLKGF AGDTLASALL ANDQMLVGRS FKYHRPRGLV 
ASGSEEPNAL VGLGVGDRFE PNQRATTTEL FSGLKAQSQN HWPSLEFDVG EINSKLFSRF
LTAGFYYKMF IHPRPFWKHV YEPFIRQSAG LGKAPNKELK DADTYEHFYY FCDVLVIGGG
VAGLQAAKTA AATGAKVLVL EEKSYWGGRT PVDGGTIEGQ SAEAWISKTV AELEAMDNVT
LRTRTMGAGV YDHGYILGYE RLTDHAPGQG GPRHRLWRIR ATQTVTATGA IERPLSFAGN
DVPGVMLAAS MRDYVVNWGV TPGQKVVVAT NNDDAYRTAI TLHEAGVEVV RVLDTRESGG
GDLADKVRAL GIRVECGRAI AKVKDGKRVT KVAICAQNGE GGAHEEIEAD AVAMSGGWSP
VVHLWSHCGG KLIWDEAQAH FRPDPDRAPT GHDGLPFVVT AGASSGPLTL SGVMADAAEA
GARAAEAAGR PAQKVSPVAT SDLAEAPMEA VWMMPAKADI KLRMKTWLDY QNDVKVSDVQ
LAAREGYESV EHTKRYTTLG MATDQGKLSN INGLAVLADS LGAEIPQVGT TTFRPPYHPI
SMGAIGGEAR GEIFQPLRKT PMYDWHDSNG AHWEPVGHWR RPYAYVRSGE SVHQAVNREV
KNTRENLGLL DASTLGKLIV KGPDAGKFLD MLYTNMMSTL KIGKCRYGLM CSENGFLVDD
GVVARIDEDT WLCHTTTGGA DRIHAHMEEW LQTEWWDWKV YVTNATEQLA QVAVVGPNAR
KVLEKLNEKA GGGMDLSKEA LAFMEWKDGE IGGFKARAYR ISFSGELSYE IAVSASEGQA
FWNALIEAGK EFGVMPYGTE CLHILRAEKG FIMIGDETDG TVIPQDLGLH WALSKKKEDY
LGKRAQQRSH MADPDRWQLV GLETVDGSVL PDGAYAVGDG NNANGQRNTI GRVTSTYYSA
NLDRGIAMGL VKHGPKRMGE VIDFPGLDGK IYKAKIVDPI FYDKEGDKQN V