Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0971 |
Symbol | |
ID | 4077267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1037740 |
End bp | 1040775 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006274 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_612966 |
Protein GI | 99080812 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC GTCTCGCCAA ACAGGGTCGT CTCATCGACC GCTCCAAGCA GATCGAGTTT TCCTTCAACG GCAAAACCCT CAAGGGGTTT GCTGGCGATA CTCTGGCCTC TGCGCTTCTC GCCAACGATC AGATGCTGGT CGGGCGGTCC TTCAAGTACC ACCGCCCGCG TGGCCTCGTG GCCTCCGGAT CTGAGGAGCC GAACGCGCTG GTGGGCCTCG GTGTCGGCGA CCGCTTCGAG CCGAACCAGC GCGCCACTAC CACCGAGCTT TTCTCGGGCC TCAAGGCGCA ATCGCAGAAC CACTGGCCGA GCCTCGAGTT CGACGTGGGC GAGATCAACT CCAAACTGTT CTCGCGCTTC CTGACTGCGG GCTTCTACTA TAAGATGTTC ATCCACCCGC GTCCCTTCTG GAAGCATGTG TATGAGCCCT TTATTCGCCA GTCCGCAGGG CTCGGCAAAG CCCCCAACAA GGAACTGAAG GACGCCGACA CCTACGAGCA TTTCTATTAT TTCTGTGACG TGCTGGTGAT TGGCGGCGGT GTGGCTGGCC TTCAGGCTGC AAAGACCGCC GCGGCGACCG GCGCCAAGGT TCTGGTGCTC GAGGAAAAAT CCTATTGGGG TGGTCGGACG CCCGTCGACG GCGGTACCAT CGAAGGCCAA TCCGCCGAAG CCTGGATCAG CAAGACCGTG GCCGAGCTTG AGGCAATGGA CAATGTCACC CTGCGCACCC GCACCATGGG GGCAGGCGTT TACGATCATG GCTATATCCT TGGTTATGAA CGCCTCACCG ACCACGCGCC GGGGCAGGGC GGTCCGCGCC ATCGCCTCTG GCGTATCCGT GCGACGCAAA CCGTCACCGC AACCGGTGCC ATCGAACGCC CGCTGTCCTT TGCGGGCAAC GACGTGCCGG GTGTGATGCT GGCCGCTTCC ATGCGCGACT ATGTGGTGAA CTGGGGGGTG ACTCCGGGGC AGAAGGTTGT GGTCGCCACC AACAATGATG ATGCTTACCG CACCGCAATC ACCCTGCATG AGGCAGGCGT CGAGGTCGTG CGCGTTCTGG ACACGCGCGA GAGCGGCGGC GGCGATCTCG CTGACAAGGT CCGCGCGCTT GGTATCCGCG TCGAATGCGG CCGTGCCATC GCCAAGGTGA AGGACGGCAA ACGCGTCACC AAGGTCGCCA TCTGCGCCCA GAACGGCGAG GGCGGCGCCC ATGAGGAGAT CGAAGCCGAT GCTGTGGCGA TGTCCGGTGG CTGGTCGCCC GTGGTTCACC TCTGGTCCCA CTGCGGCGGC AAGCTCATCT GGGACGAGGC GCAGGCGCAT TTCCGCCCCG ACCCGGATCG CGCGCCCACT GGCCATGATG GTCTGCCCTT TGTGGTGACG GCTGGCGCAT CCTCTGGTCC GCTCACGCTC TCTGGCGTGA TGGCCGATGC GGCCGAAGCT GGCGCGCGTG CTGCCGAGGC CGCCGGCCGT CCTGCGCAAA AAGTGTCGCC GGTTGCCACC TCGGATCTCG CTGAAGCGCC TATGGAAGCC GTTTGGATGA TGCCCGCCAA GGCCGACATC AAACTGCGCA TGAAAACCTG GCTCGACTAT CAAAACGACG TCAAGGTGTC CGACGTGCAG CTTGCGGCGC GCGAGGGCTA TGAAAGCGTC GAGCACACCA AACGCTACAC CACGCTGGGC ATGGCGACGG ATCAGGGGAA ACTCTCCAAC ATCAACGGTC TGGCTGTGCT GGCCGATAGT CTGGGCGCCG AGATCCCGCA GGTCGGGACC ACCACGTTCC GCCCGCCGTA TCACCCGATT TCCATGGGAG CGATTGGCGG TGAGGCGCGT GGCGAGATCT TCCAACCCCT GCGCAAGACG CCGATGTATG ACTGGCACGA CAGCAACGGC GCCCATTGGG AGCCGGTGGG CCACTGGCGG CGTCCCTACG CCTATGTGCG CTCGGGCGAG AGCGTCCATC AGGCGGTCAA TCGCGAGGTG AAAAACACCC GCGAGAACCT TGGCCTTCTG GATGCCTCCA CCCTCGGCAA ACTCATCGTC AAAGGGCCGG ATGCGGGCAA GTTCCTCGAC ATGCTCTACA CCAATATGAT GAGCACGCTG AAGATCGGCA AATGCCGCTA TGGCCTGATG TGTTCCGAGA ACGGCTTTCT TGTGGATGAC GGCGTGGTTG CGCGCATCGA CGAGGACACC TGGCTCTGCC ACACCACCAC CGGCGGCGCG GATCGCATTC ACGCCCACAT GGAAGAATGG CTGCAGACCG AATGGTGGGA CTGGAAGGTC TATGTCACCA ACGCCACCGA GCAACTGGCG CAGGTCGCCG TCGTTGGCCC CAACGCCCGC AAGGTGCTGG AAAAGCTGAA TGAAAAAGCA GGCGGCGGTA TGGATCTCTC GAAAGAGGCG CTCGCGTTCA TGGAGTGGAA AGACGGCGAG ATCGGCGGCT TCAAGGCGCG CGCTTACCGG ATTTCCTTCT CGGGCGAGCT CTCTTATGAG ATCGCAGTTT CTGCGTCCGA GGGGCAGGCC TTCTGGAATG CGCTCATCGA GGCGGGCAAG GAATTTGGCG TCATGCCCTA TGGCACCGAA TGTCTGCATA TCCTGCGGGC CGAAAAAGGC TTCATCATGA TTGGTGACGA GACCGACGGC ACCGTGATCC CGCAGGATCT TGGTCTGCAC TGGGCGCTCT CCAAGAAGAA AGAAGACTAT CTTGGCAAGC GCGCGCAGCA GCGCAGCCAC ATGGCCGATC CCGACCGCTG GCAACTGGTG GGTCTGGAAA CCGTAGATGG CTCTGTGTTG CCCGATGGCG CCTATGCGGT TGGCGATGGA AACAACGCCA ATGGCCAGCG CAATACCATC GGCCGCGTGA CGTCGACCTA CTACTCCGCC AACCTTGATC GCGGCATCGC GATGGGGCTG GTGAAACATG GCCCCAAACG CATGGGCGAG GTGATCGACT TCCCCGGCCT TGATGGGAAA ATCTACAAAG CCAAGATCGT CGACCCGATC TTCTATGACA AAGAAGGGGA CAAGCAGAAT GTCTAA
|
Protein sequence | MSTRLAKQGR LIDRSKQIEF SFNGKTLKGF AGDTLASALL ANDQMLVGRS FKYHRPRGLV ASGSEEPNAL VGLGVGDRFE PNQRATTTEL FSGLKAQSQN HWPSLEFDVG EINSKLFSRF LTAGFYYKMF IHPRPFWKHV YEPFIRQSAG LGKAPNKELK DADTYEHFYY FCDVLVIGGG VAGLQAAKTA AATGAKVLVL EEKSYWGGRT PVDGGTIEGQ SAEAWISKTV AELEAMDNVT LRTRTMGAGV YDHGYILGYE RLTDHAPGQG GPRHRLWRIR ATQTVTATGA IERPLSFAGN DVPGVMLAAS MRDYVVNWGV TPGQKVVVAT NNDDAYRTAI TLHEAGVEVV RVLDTRESGG GDLADKVRAL GIRVECGRAI AKVKDGKRVT KVAICAQNGE GGAHEEIEAD AVAMSGGWSP VVHLWSHCGG KLIWDEAQAH FRPDPDRAPT GHDGLPFVVT AGASSGPLTL SGVMADAAEA GARAAEAAGR PAQKVSPVAT SDLAEAPMEA VWMMPAKADI KLRMKTWLDY QNDVKVSDVQ LAAREGYESV EHTKRYTTLG MATDQGKLSN INGLAVLADS LGAEIPQVGT TTFRPPYHPI SMGAIGGEAR GEIFQPLRKT PMYDWHDSNG AHWEPVGHWR RPYAYVRSGE SVHQAVNREV KNTRENLGLL DASTLGKLIV KGPDAGKFLD MLYTNMMSTL KIGKCRYGLM CSENGFLVDD GVVARIDEDT WLCHTTTGGA DRIHAHMEEW LQTEWWDWKV YVTNATEQLA QVAVVGPNAR KVLEKLNEKA GGGMDLSKEA LAFMEWKDGE IGGFKARAYR ISFSGELSYE IAVSASEGQA FWNALIEAGK EFGVMPYGTE CLHILRAEKG FIMIGDETDG TVIPQDLGLH WALSKKKEDY LGKRAQQRSH MADPDRWQLV GLETVDGSVL PDGAYAVGDG NNANGQRNTI GRVTSTYYSA NLDRGIAMGL VKHGPKRMGE VIDFPGLDGK IYKAKIVDPI FYDKEGDKQN V
|
| |