Gene TM1040_3377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3377 
Symbol 
ID4075276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp390581 
End bp393535 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content60% 
IMG OID638004885 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_611611 
Protein GI99078353 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.958217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGG TGAACCGTAT ATCCGGTGGG CTGATCGACC GATCTACCGA GTTGAACTTC 
ACCTTTGATG GTAAGAACTA TCAGGGATAC GCGGGCGACA CGCTGGCCTC GGCACTGCTC
GCCAATGGGG TGCGCCTCAT GGGACGGTCG TTCAAATATC ACCGCCCGCG AGGCGTACTG
GCCGCCGGAT CAGAAGAACC CAATGCGCTG GTCGAGTTGC GCTCCGGGGG GCGCCAGGAA
CCCAATACCC GCGCCACTGT GGCAGAAATC TATGAGGGCC TCAGCGCCAA TTCACAGAAC
CGCTGGCCCT CGCTCAAGCA TGACGTCATG GCGATCAATG ATCGCTTCTC GGCATTCCTG
AGTGCTGGTT TCTATTACAA GACTTTCATG TGGCCACGCG CATTCTGGGA GAAACTTTAC
GAGCCGGTAA TCCGCAAGGC TGCCGGGTTG GGGAGCCTTT CAGGCGAAGG CGATCCAGAT
GCCTATGACA AGGGTTACCT GCATTGCGAC CTGCTGGTGA TCGGCGCAGG GCCAGCCGGC
CTGTCCGCCG CCCTGACCGC AGGACGCGGC GGGGCGCAGG TGATCCTCGC CGATGAGGAT
TTTCAACTTG GCGGGCGCCT GCTGTCCGAT GCCCAGTCTC TTTGTAATCA ATCGAATGCG
GAATGGGTGG CAGCAACTCA GGCCGAATTG ATTGCACTCC CGAATGTCCG CGTCATGCCA
CGCACCACCG TGTTTGGCGC CTATGATCAC GGTGTCTATG GCGCAGTGGA GCGGAATGCG
GATCACTTGG TTGCCCCTGA AGAGAATAAG CCGCGTCAGA CGCTCTGGCG GATTTACTCC
AGACGCGCGG TGGTTGCCAT CGGCGCAATC GAACGCCCCA TCGCGTTTGA AAACAACGAC
CGTCCGGGCG TGATGCTGGC CGGGGCCACC CGCGCCTATG CGAACCGCTG GGCTGTGACG
CCTGCGCGCT CTGTCGTGGT TTTCGCCAAC AATGATGACG CGCATCAAAC AGCTAAGGAC
CTGATCGCCA AGGGGATCGA GGTGCATGCC GTTGTGGATA CCCGCTCCGA TGCCCCAGGG
ATCGAGGGCA CCGAGCTTTT GGCCGGCGCG CAGATCATCG GCACCAAAGG GCGACTGGGC
CTGACATCGG TTACGGTGCG ACTCTTGGAT GGTCGGACGC GCGATATTAC CTGTGGCGCC
CTCGCCATGT CGGGTGGCTG GAATCCAAAT CTGGGCCTCA CCTGTCATCA GCGCGGGCGC
CCGGTGTGGC GCGAAGACAT TCACGCCTTT GTTCCGGGTT CAGATTTGCC CGCAGGCCAA
TCTGTTGTTG GCGCTGCAAT GGGCGAGATG TCGACACACG CCGCACTCCG GACTGGCGCA
GAGACCGCGC GCGAGGCGCT CTCTGATCTG GGCTTTACTG CGCCCGGCGT TGAGACACCC
AAGGCAGAGG ACGCGCCCAT TTCTCTGACC CCCTTCTGGC ATGTCGCAGA TGCAAAACGC
GCCTGGCTCG ACTTTCAGAA CGACGTGACC GTCAAGGATG TGAAGCTGGC GCATCAGGAG
AACTTCACAT CCGTCGAGCA TTTGAAGCGC TACACCACGC TGGGCATGGC GACCGATCAG
GGCAAAACTT CAAATGTCGG AGCCTTGGCT GTGATGGCAG AGCTGACCGG CAAGCCGATC
CCGGAAACCG GCACAACCAT CTTTCGGCCG CCCTATACGC CCGTCTCGAT GGGGGCTCTT
GCGGGTCGGG CCGTCGGCAA GGATTTCCAC CCAACCCGTC TGACTCCCAG CCACAAATGG
GCCGAGGAAC AGGGCGCCGT ATTTGTCGAG GTCGGCAACT GGCTGCGCGC GCAATGGTTC
CCAAAAGCGG GTGAAACCCA TTGGCGTCAG TCCGTGGATC GCGAGGTACT CGCCACCCGT
AATTCCGTCG GCATCTGCGA CGTGACAACG CTGGGTAAGA TTGACGTACA GGGTACGGAT
GCAGCGGAGT TCCTCAACAA GATCTACGCA AATGGATTTG CCAAGCTGCC GGTGGGCAAG
GTCCGCTATG GCCTGATGTT GCGCGAGGAT GGAGTGGCCT ATGATGATGG CACCGCCGCC
CGACTGGCCG AGGATCACTT CGTTGTCACG ACGACGACCG CCAATGCCGT TCTTGTCTAT
CGAAACATGG AATTCGCGCG CCAATGCCTG TGGCCTGATC TCGACGTGCA GCTGATCTCC
ACCACCGAGG CATGGGCACA ATATGCGGTG GCTGGTCCAA ACTCTCGCAA GCTGCTGCAA
AAAATTGTCG ACCCGGAGTT CGACATCTCC AACGCCGCCT TTCCGTTTAT GGGCTGTCGC
GAGATCACGG TCTGCGGTGG TCTGCGGGCG CGGCTGTTTC GGATCTCGTT CTCGGGAGAG
CTTGCCTATG AAATCGCGGT TCCAACCCGC TACGGAGATG CGCTGATGCG CGAAATGATG
ACTGCAGGCG CAGAGTTTGA TGTCACGCCC TACGGCACAG AAGCGCTCGG CGTGATGCGG
ATCGAGAAAG GCCATGCAGC TGGCAATGAA CTGAACGGCA CCACCACCGC GTTGAATCTC
GGTCTGGACC GTATGGTGTC CACCAAGAAG GACTTCATCG GCAATGTGTT GTCCCGTCGC
GAAGGCATGA ACGCCAAGGA CGCACTGAAC CTTGTGGGGG TGCGCCCGGT CGACCCATCT
CATTCCCTGC CCGCAGGCGG CCATCTCATG CGCAGGTCCG GTCCAGTCGA TGCAACGCAG
GATCAGGGCT ATGTCACATC TGCCGCCTAT TCTCCTACTC TCAAGAGTGC GATTGGGCTG
GGCTTTGTGA AATCGGGGTT TGAGCGAATG GGGGAACAGC TCCGGCTCGT GAACCCGCTC
GAAGGTCAAG AAATCCTTGT TGAAATCGTC AGCCCCCATT TTGTCGACCC AGAAGGAGAA
AAGCTCCGTG CATAG
 
Protein sequence
MTQVNRISGG LIDRSTELNF TFDGKNYQGY AGDTLASALL ANGVRLMGRS FKYHRPRGVL 
AAGSEEPNAL VELRSGGRQE PNTRATVAEI YEGLSANSQN RWPSLKHDVM AINDRFSAFL
SAGFYYKTFM WPRAFWEKLY EPVIRKAAGL GSLSGEGDPD AYDKGYLHCD LLVIGAGPAG
LSAALTAGRG GAQVILADED FQLGGRLLSD AQSLCNQSNA EWVAATQAEL IALPNVRVMP
RTTVFGAYDH GVYGAVERNA DHLVAPEENK PRQTLWRIYS RRAVVAIGAI ERPIAFENND
RPGVMLAGAT RAYANRWAVT PARSVVVFAN NDDAHQTAKD LIAKGIEVHA VVDTRSDAPG
IEGTELLAGA QIIGTKGRLG LTSVTVRLLD GRTRDITCGA LAMSGGWNPN LGLTCHQRGR
PVWREDIHAF VPGSDLPAGQ SVVGAAMGEM STHAALRTGA ETAREALSDL GFTAPGVETP
KAEDAPISLT PFWHVADAKR AWLDFQNDVT VKDVKLAHQE NFTSVEHLKR YTTLGMATDQ
GKTSNVGALA VMAELTGKPI PETGTTIFRP PYTPVSMGAL AGRAVGKDFH PTRLTPSHKW
AEEQGAVFVE VGNWLRAQWF PKAGETHWRQ SVDREVLATR NSVGICDVTT LGKIDVQGTD
AAEFLNKIYA NGFAKLPVGK VRYGLMLRED GVAYDDGTAA RLAEDHFVVT TTTANAVLVY
RNMEFARQCL WPDLDVQLIS TTEAWAQYAV AGPNSRKLLQ KIVDPEFDIS NAAFPFMGCR
EITVCGGLRA RLFRISFSGE LAYEIAVPTR YGDALMREMM TAGAEFDVTP YGTEALGVMR
IEKGHAAGNE LNGTTTALNL GLDRMVSTKK DFIGNVLSRR EGMNAKDALN LVGVRPVDPS
HSLPAGGHLM RRSGPVDATQ DQGYVTSAAY SPTLKSAIGL GFVKSGFERM GEQLRLVNPL
EGQEILVEIV SPHFVDPEGE KLRA