Gene TM1040_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2145 
Symbol 
ID4076459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2250829 
End bp2253774 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content64% 
IMG OID638007465 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_614139 
Protein GI99081985 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTAT CTGGCAAGGG GCTTCTGTCC GGCAACCGGA CTGTGAATTT TTCCTTTAAT 
GGCAAGGCCT ACATGGGGCT GGAGGGGGAC ACGCTGGCCT CGGCGCTTTT GGCCAATGAT
GTGCACTTGG TGGGGCGGTC GTTCAAATAC CACCGCCCGC GGGGTATCCT GACAGCAGGC
TCCGAGGAGC CCAACGCGCT GGTGAGCGTG GGACGCGGGG CAGCGCGAGA TCCAAATATC
CGCGCCACCC AGCAGGAAAT TTTTGAGGGG CTGGCAGCGC AGAGCCAGAA CTGCTGGCCT
TCGGTCGAGC GCGATGTCAT GGCGGTGAAT GATCTGGCAG CGCCCTTCCT TGGGGCGGGG
TTTTATTACA AGACCTTCAT GTGGCCAGCG CCCCTGTGGG AGAAGTTCTA CGAGCCGATC
ATTCGCCGGG CGGCGGGGCT TGGGGCCTTG TCCGGGGCGG CCAACAAGGA CCCTTACGAG
AAGGCCTTTG CCTTTTGCGA CCTGCTGGTG ATCGGGTCTG GCCCAGCCGG GCTGATGGCT
GCACTGACGG CAGGGCGTGC GGGGGCGGAT GTGATCCTCG CCGAGGATGA CAGCCGGATG
GGCGGGCGTC TTCTGGCGGA GATCGAGGAG ATCGACGGCA AGCCCGCGCA TGTCTGGGTG
GCAGAGGTGC TGGCGGAGCT GGAGGCCATG GAGAACGTGC GTCTGATGCC GCGCACCGCG
GTCACCGGCG TCTACGATCA AGGGACCTAT TCGGCGCTCG AGCGGGTTTC GCACCACCTT
CCGCCGAGCC AGTCCCTGAA GACCGCGCAT CGCCCGCGCG AGTGTTTCTG GCGCATCGCC
GCCAAAGCCG CGGTGCTGGC GGCGGGGGCT ATTGAACGCC CCATTGCCTT TCGCAACAAC
GACCGCCCCG GCATCATGAT GGCGGGGGCG GTGCGGGCCT ATCTCAATCG CTGGGGAGTT
GCGCCGGGCG AGCGGATCAC CGTGTTTGCC AACAATGATG ATGCCCATCG CACCGCGCGG
GATCTGGTGG CGGAGGGCGT GCATGTCACC GCCGTTGTCG ACAGCCGCCA TGACGCGCCC
GACAGTGGCG AATTCCCCGT CATCAAGGGC GCGGTGATCT GCGACAGCGC CGGGCGGCAG
CGGCTGGAGA GTGTCACCAT CCGCAGCGTC AATGGCGAGG AAAAGCTGCA AACCGATTGT
CTGGCCGTAT CTGGCGGCTG GAACCCCTCG GTGCATCTGA CCTGCCACAT GGGCGCGCGC
CCGGAGTGGA ACCGGGAGCT GACGGCGTTC CTGCCCAAAC CGGATGCTGT GCCGGGTATG
GTGGTCGCAG GGGCTGCCAA TGGGGTGTTT TCTACCCATG GTGCCCTTCG AGACGGGGCC
GAGGCGGCCG CGACGGCATT GGGCAAGATC GGGAGGTCTG CTGCTGCGGT GGCTGTCCCC
GAGGCCAGTG ATGCGCCTTA CCAGATCAGC CCTCTGTGGG CAGTGCCGGG CAAAGGGCGG
GCGTGGCTGG ATTTCCAGAA CGACGTCACC GTGAAGGATG TGGCGCAGGC AGCGCGGGAG
AACTTCCGCT CGGTCGAGCA TATGAAGCGT TACACCACAC AGGGCATGGC CACCGATCAG
GGCAAGAACT CCAACGTGGC GGCCTTGGCG GTGCTGGCCG ATGCCACCGG GCGCAGCATA
CCAGAGACCG GTGTGACGAC GTTCCGCCCG CCGTTCGTGC CTACCTCTAT CGCCGCAATG
GGGGCAGGGG CCGAAGATCA GGGATTTTTG CCGCAGCGGT TCTTGACCTC GCACAAGGCA
TCGCTGGCGC GTGGCGCCGC CAATATCGAG GTCGGACTGT GGTATCGCGC CAGCTATTTT
GCCAAGGACG GCGAGAGCAC CTGGCGGCAA TCCTGCGACC GCGAGGTCAC CATGGTGCGC
AATGCCGTGG GGGTCTGCGA TGTCTCGACC CTTGGCAAGA TCGACATTCA GGGACCAGAC
GCGGCAGAGC TGCTGGATCT GGTGTATACC AATCTGTTCT CCACGCTGAA GCTGGGGCGC
GTGCGCTATG GTCTGATGCT GCGCGAGGAC GGCTTTGTCA TGGATGATGG CACCACCGCG
CGGCTGGGGG AAAACCACTA TGTGATGACC ACCACCACCG CGGCGGCGGG GCAGGTGATG
GCGCATCTGG AATATCTCAC GCAGGTGGTG CGCCCGGACC TTGATGTGCG CTTCACGTCT
GTGACCGATC AATGGGCGCA GTTCTCGGTG GCTGGCCCCA AGGCGCGCGA TCTGATCGAC
GCGCTGGTGG ATGAGGACGT CAACGGTGAG ACCTTCCCCT TCATGGCCTG CGGGGTGATC
ACCGTGCTGG GCGTGGCCGG GCGGCTGTTC CGGATATCGT TTTCGGGCGA GCACGCCTAT
GAGATCGCTG TGCCCGCGCG CTACGGCGAG GCACTTTATG AGCGGCTGCT GGAGCGGGCT
GAGGCGCTGG GCGGTGGACC CTACGGGATG GAAGCGTTGA ATGTGCTCCG GATCGAAAAG
GGCTTTATCA CCCACGCGGA GATCAATGGC ACGGTGACGG CCTTTGACCT TGGTATGCAG
GGGCTTGTGT CGAAAAACAA ATCCTGCTGG GGCAAGGCCT TGTCCGAACG GGATGGGTTG
ATGCATGACG ACCGGATGCG GCTTGTGGGC CTCAAGCCCG TGGGCGCGGC ACAAGAGATG
AGCGCAGGGG CGCATCTCTT TGATCCCGAC GCCACTGTCG AGCGGGTCAA TGACCTTGGC
TATGTGACAT CGGTGGGGTT CTCACCGACG CTGGGGCATA TGATCGGGCT GGCGATGCTG
TCGGGCGGGC CGGACAGGAT GGGCGACACT ATCCGTCTGG TGGATCACAC CCGGGGCATC
GACACCCTGT GTGAAGTGGT GAACCCTGTG TTCTTTGACC CTGAGGGAGG GCGCGTTCGT
GGCTGA
 
Protein sequence
MRVSGKGLLS GNRTVNFSFN GKAYMGLEGD TLASALLAND VHLVGRSFKY HRPRGILTAG 
SEEPNALVSV GRGAARDPNI RATQQEIFEG LAAQSQNCWP SVERDVMAVN DLAAPFLGAG
FYYKTFMWPA PLWEKFYEPI IRRAAGLGAL SGAANKDPYE KAFAFCDLLV IGSGPAGLMA
ALTAGRAGAD VILAEDDSRM GGRLLAEIEE IDGKPAHVWV AEVLAELEAM ENVRLMPRTA
VTGVYDQGTY SALERVSHHL PPSQSLKTAH RPRECFWRIA AKAAVLAAGA IERPIAFRNN
DRPGIMMAGA VRAYLNRWGV APGERITVFA NNDDAHRTAR DLVAEGVHVT AVVDSRHDAP
DSGEFPVIKG AVICDSAGRQ RLESVTIRSV NGEEKLQTDC LAVSGGWNPS VHLTCHMGAR
PEWNRELTAF LPKPDAVPGM VVAGAANGVF STHGALRDGA EAAATALGKI GRSAAAVAVP
EASDAPYQIS PLWAVPGKGR AWLDFQNDVT VKDVAQAARE NFRSVEHMKR YTTQGMATDQ
GKNSNVAALA VLADATGRSI PETGVTTFRP PFVPTSIAAM GAGAEDQGFL PQRFLTSHKA
SLARGAANIE VGLWYRASYF AKDGESTWRQ SCDREVTMVR NAVGVCDVST LGKIDIQGPD
AAELLDLVYT NLFSTLKLGR VRYGLMLRED GFVMDDGTTA RLGENHYVMT TTTAAAGQVM
AHLEYLTQVV RPDLDVRFTS VTDQWAQFSV AGPKARDLID ALVDEDVNGE TFPFMACGVI
TVLGVAGRLF RISFSGEHAY EIAVPARYGE ALYERLLERA EALGGGPYGM EALNVLRIEK
GFITHAEING TVTAFDLGMQ GLVSKNKSCW GKALSERDGL MHDDRMRLVG LKPVGAAQEM
SAGAHLFDPD ATVERVNDLG YVTSVGFSPT LGHMIGLAML SGGPDRMGDT IRLVDHTRGI
DTLCEVVNPV FFDPEGGRVR G