Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2145 |
Symbol | |
ID | 4076459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2250829 |
End bp | 2253774 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638007465 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_614139 |
Protein GI | 99081985 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTAT CTGGCAAGGG GCTTCTGTCC GGCAACCGGA CTGTGAATTT TTCCTTTAAT GGCAAGGCCT ACATGGGGCT GGAGGGGGAC ACGCTGGCCT CGGCGCTTTT GGCCAATGAT GTGCACTTGG TGGGGCGGTC GTTCAAATAC CACCGCCCGC GGGGTATCCT GACAGCAGGC TCCGAGGAGC CCAACGCGCT GGTGAGCGTG GGACGCGGGG CAGCGCGAGA TCCAAATATC CGCGCCACCC AGCAGGAAAT TTTTGAGGGG CTGGCAGCGC AGAGCCAGAA CTGCTGGCCT TCGGTCGAGC GCGATGTCAT GGCGGTGAAT GATCTGGCAG CGCCCTTCCT TGGGGCGGGG TTTTATTACA AGACCTTCAT GTGGCCAGCG CCCCTGTGGG AGAAGTTCTA CGAGCCGATC ATTCGCCGGG CGGCGGGGCT TGGGGCCTTG TCCGGGGCGG CCAACAAGGA CCCTTACGAG AAGGCCTTTG CCTTTTGCGA CCTGCTGGTG ATCGGGTCTG GCCCAGCCGG GCTGATGGCT GCACTGACGG CAGGGCGTGC GGGGGCGGAT GTGATCCTCG CCGAGGATGA CAGCCGGATG GGCGGGCGTC TTCTGGCGGA GATCGAGGAG ATCGACGGCA AGCCCGCGCA TGTCTGGGTG GCAGAGGTGC TGGCGGAGCT GGAGGCCATG GAGAACGTGC GTCTGATGCC GCGCACCGCG GTCACCGGCG TCTACGATCA AGGGACCTAT TCGGCGCTCG AGCGGGTTTC GCACCACCTT CCGCCGAGCC AGTCCCTGAA GACCGCGCAT CGCCCGCGCG AGTGTTTCTG GCGCATCGCC GCCAAAGCCG CGGTGCTGGC GGCGGGGGCT ATTGAACGCC CCATTGCCTT TCGCAACAAC GACCGCCCCG GCATCATGAT GGCGGGGGCG GTGCGGGCCT ATCTCAATCG CTGGGGAGTT GCGCCGGGCG AGCGGATCAC CGTGTTTGCC AACAATGATG ATGCCCATCG CACCGCGCGG GATCTGGTGG CGGAGGGCGT GCATGTCACC GCCGTTGTCG ACAGCCGCCA TGACGCGCCC GACAGTGGCG AATTCCCCGT CATCAAGGGC GCGGTGATCT GCGACAGCGC CGGGCGGCAG CGGCTGGAGA GTGTCACCAT CCGCAGCGTC AATGGCGAGG AAAAGCTGCA AACCGATTGT CTGGCCGTAT CTGGCGGCTG GAACCCCTCG GTGCATCTGA CCTGCCACAT GGGCGCGCGC CCGGAGTGGA ACCGGGAGCT GACGGCGTTC CTGCCCAAAC CGGATGCTGT GCCGGGTATG GTGGTCGCAG GGGCTGCCAA TGGGGTGTTT TCTACCCATG GTGCCCTTCG AGACGGGGCC GAGGCGGCCG CGACGGCATT GGGCAAGATC GGGAGGTCTG CTGCTGCGGT GGCTGTCCCC GAGGCCAGTG ATGCGCCTTA CCAGATCAGC CCTCTGTGGG CAGTGCCGGG CAAAGGGCGG GCGTGGCTGG ATTTCCAGAA CGACGTCACC GTGAAGGATG TGGCGCAGGC AGCGCGGGAG AACTTCCGCT CGGTCGAGCA TATGAAGCGT TACACCACAC AGGGCATGGC CACCGATCAG GGCAAGAACT CCAACGTGGC GGCCTTGGCG GTGCTGGCCG ATGCCACCGG GCGCAGCATA CCAGAGACCG GTGTGACGAC GTTCCGCCCG CCGTTCGTGC CTACCTCTAT CGCCGCAATG GGGGCAGGGG CCGAAGATCA GGGATTTTTG CCGCAGCGGT TCTTGACCTC GCACAAGGCA TCGCTGGCGC GTGGCGCCGC CAATATCGAG GTCGGACTGT GGTATCGCGC CAGCTATTTT GCCAAGGACG GCGAGAGCAC CTGGCGGCAA TCCTGCGACC GCGAGGTCAC CATGGTGCGC AATGCCGTGG GGGTCTGCGA TGTCTCGACC CTTGGCAAGA TCGACATTCA GGGACCAGAC GCGGCAGAGC TGCTGGATCT GGTGTATACC AATCTGTTCT CCACGCTGAA GCTGGGGCGC GTGCGCTATG GTCTGATGCT GCGCGAGGAC GGCTTTGTCA TGGATGATGG CACCACCGCG CGGCTGGGGG AAAACCACTA TGTGATGACC ACCACCACCG CGGCGGCGGG GCAGGTGATG GCGCATCTGG AATATCTCAC GCAGGTGGTG CGCCCGGACC TTGATGTGCG CTTCACGTCT GTGACCGATC AATGGGCGCA GTTCTCGGTG GCTGGCCCCA AGGCGCGCGA TCTGATCGAC GCGCTGGTGG ATGAGGACGT CAACGGTGAG ACCTTCCCCT TCATGGCCTG CGGGGTGATC ACCGTGCTGG GCGTGGCCGG GCGGCTGTTC CGGATATCGT TTTCGGGCGA GCACGCCTAT GAGATCGCTG TGCCCGCGCG CTACGGCGAG GCACTTTATG AGCGGCTGCT GGAGCGGGCT GAGGCGCTGG GCGGTGGACC CTACGGGATG GAAGCGTTGA ATGTGCTCCG GATCGAAAAG GGCTTTATCA CCCACGCGGA GATCAATGGC ACGGTGACGG CCTTTGACCT TGGTATGCAG GGGCTTGTGT CGAAAAACAA ATCCTGCTGG GGCAAGGCCT TGTCCGAACG GGATGGGTTG ATGCATGACG ACCGGATGCG GCTTGTGGGC CTCAAGCCCG TGGGCGCGGC ACAAGAGATG AGCGCAGGGG CGCATCTCTT TGATCCCGAC GCCACTGTCG AGCGGGTCAA TGACCTTGGC TATGTGACAT CGGTGGGGTT CTCACCGACG CTGGGGCATA TGATCGGGCT GGCGATGCTG TCGGGCGGGC CGGACAGGAT GGGCGACACT ATCCGTCTGG TGGATCACAC CCGGGGCATC GACACCCTGT GTGAAGTGGT GAACCCTGTG TTCTTTGACC CTGAGGGAGG GCGCGTTCGT GGCTGA
|
Protein sequence | MRVSGKGLLS GNRTVNFSFN GKAYMGLEGD TLASALLAND VHLVGRSFKY HRPRGILTAG SEEPNALVSV GRGAARDPNI RATQQEIFEG LAAQSQNCWP SVERDVMAVN DLAAPFLGAG FYYKTFMWPA PLWEKFYEPI IRRAAGLGAL SGAANKDPYE KAFAFCDLLV IGSGPAGLMA ALTAGRAGAD VILAEDDSRM GGRLLAEIEE IDGKPAHVWV AEVLAELEAM ENVRLMPRTA VTGVYDQGTY SALERVSHHL PPSQSLKTAH RPRECFWRIA AKAAVLAAGA IERPIAFRNN DRPGIMMAGA VRAYLNRWGV APGERITVFA NNDDAHRTAR DLVAEGVHVT AVVDSRHDAP DSGEFPVIKG AVICDSAGRQ RLESVTIRSV NGEEKLQTDC LAVSGGWNPS VHLTCHMGAR PEWNRELTAF LPKPDAVPGM VVAGAANGVF STHGALRDGA EAAATALGKI GRSAAAVAVP EASDAPYQIS PLWAVPGKGR AWLDFQNDVT VKDVAQAARE NFRSVEHMKR YTTQGMATDQ GKNSNVAALA VLADATGRSI PETGVTTFRP PFVPTSIAAM GAGAEDQGFL PQRFLTSHKA SLARGAANIE VGLWYRASYF AKDGESTWRQ SCDREVTMVR NAVGVCDVST LGKIDIQGPD AAELLDLVYT NLFSTLKLGR VRYGLMLRED GFVMDDGTTA RLGENHYVMT TTTAAAGQVM AHLEYLTQVV RPDLDVRFTS VTDQWAQFSV AGPKARDLID ALVDEDVNGE TFPFMACGVI TVLGVAGRLF RISFSGEHAY EIAVPARYGE ALYERLLERA EALGGGPYGM EALNVLRIEK GFITHAEING TVTAFDLGMQ GLVSKNKSCW GKALSERDGL MHDDRMRLVG LKPVGAAQEM SAGAHLFDPD ATVERVNDLG YVTSVGFSPT LGHMIGLAML SGGPDRMGDT IRLVDHTRGI DTLCEVVNPV FFDPEGGRVR G
|
| |