Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3377 |
Symbol | |
ID | 4075276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 390581 |
End bp | 393535 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004885 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_611611 |
Protein GI | 99078353 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.958217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGG TGAACCGTAT ATCCGGTGGG CTGATCGACC GATCTACCGA GTTGAACTTC ACCTTTGATG GTAAGAACTA TCAGGGATAC GCGGGCGACA CGCTGGCCTC GGCACTGCTC GCCAATGGGG TGCGCCTCAT GGGACGGTCG TTCAAATATC ACCGCCCGCG AGGCGTACTG GCCGCCGGAT CAGAAGAACC CAATGCGCTG GTCGAGTTGC GCTCCGGGGG GCGCCAGGAA CCCAATACCC GCGCCACTGT GGCAGAAATC TATGAGGGCC TCAGCGCCAA TTCACAGAAC CGCTGGCCCT CGCTCAAGCA TGACGTCATG GCGATCAATG ATCGCTTCTC GGCATTCCTG AGTGCTGGTT TCTATTACAA GACTTTCATG TGGCCACGCG CATTCTGGGA GAAACTTTAC GAGCCGGTAA TCCGCAAGGC TGCCGGGTTG GGGAGCCTTT CAGGCGAAGG CGATCCAGAT GCCTATGACA AGGGTTACCT GCATTGCGAC CTGCTGGTGA TCGGCGCAGG GCCAGCCGGC CTGTCCGCCG CCCTGACCGC AGGACGCGGC GGGGCGCAGG TGATCCTCGC CGATGAGGAT TTTCAACTTG GCGGGCGCCT GCTGTCCGAT GCCCAGTCTC TTTGTAATCA ATCGAATGCG GAATGGGTGG CAGCAACTCA GGCCGAATTG ATTGCACTCC CGAATGTCCG CGTCATGCCA CGCACCACCG TGTTTGGCGC CTATGATCAC GGTGTCTATG GCGCAGTGGA GCGGAATGCG GATCACTTGG TTGCCCCTGA AGAGAATAAG CCGCGTCAGA CGCTCTGGCG GATTTACTCC AGACGCGCGG TGGTTGCCAT CGGCGCAATC GAACGCCCCA TCGCGTTTGA AAACAACGAC CGTCCGGGCG TGATGCTGGC CGGGGCCACC CGCGCCTATG CGAACCGCTG GGCTGTGACG CCTGCGCGCT CTGTCGTGGT TTTCGCCAAC AATGATGACG CGCATCAAAC AGCTAAGGAC CTGATCGCCA AGGGGATCGA GGTGCATGCC GTTGTGGATA CCCGCTCCGA TGCCCCAGGG ATCGAGGGCA CCGAGCTTTT GGCCGGCGCG CAGATCATCG GCACCAAAGG GCGACTGGGC CTGACATCGG TTACGGTGCG ACTCTTGGAT GGTCGGACGC GCGATATTAC CTGTGGCGCC CTCGCCATGT CGGGTGGCTG GAATCCAAAT CTGGGCCTCA CCTGTCATCA GCGCGGGCGC CCGGTGTGGC GCGAAGACAT TCACGCCTTT GTTCCGGGTT CAGATTTGCC CGCAGGCCAA TCTGTTGTTG GCGCTGCAAT GGGCGAGATG TCGACACACG CCGCACTCCG GACTGGCGCA GAGACCGCGC GCGAGGCGCT CTCTGATCTG GGCTTTACTG CGCCCGGCGT TGAGACACCC AAGGCAGAGG ACGCGCCCAT TTCTCTGACC CCCTTCTGGC ATGTCGCAGA TGCAAAACGC GCCTGGCTCG ACTTTCAGAA CGACGTGACC GTCAAGGATG TGAAGCTGGC GCATCAGGAG AACTTCACAT CCGTCGAGCA TTTGAAGCGC TACACCACGC TGGGCATGGC GACCGATCAG GGCAAAACTT CAAATGTCGG AGCCTTGGCT GTGATGGCAG AGCTGACCGG CAAGCCGATC CCGGAAACCG GCACAACCAT CTTTCGGCCG CCCTATACGC CCGTCTCGAT GGGGGCTCTT GCGGGTCGGG CCGTCGGCAA GGATTTCCAC CCAACCCGTC TGACTCCCAG CCACAAATGG GCCGAGGAAC AGGGCGCCGT ATTTGTCGAG GTCGGCAACT GGCTGCGCGC GCAATGGTTC CCAAAAGCGG GTGAAACCCA TTGGCGTCAG TCCGTGGATC GCGAGGTACT CGCCACCCGT AATTCCGTCG GCATCTGCGA CGTGACAACG CTGGGTAAGA TTGACGTACA GGGTACGGAT GCAGCGGAGT TCCTCAACAA GATCTACGCA AATGGATTTG CCAAGCTGCC GGTGGGCAAG GTCCGCTATG GCCTGATGTT GCGCGAGGAT GGAGTGGCCT ATGATGATGG CACCGCCGCC CGACTGGCCG AGGATCACTT CGTTGTCACG ACGACGACCG CCAATGCCGT TCTTGTCTAT CGAAACATGG AATTCGCGCG CCAATGCCTG TGGCCTGATC TCGACGTGCA GCTGATCTCC ACCACCGAGG CATGGGCACA ATATGCGGTG GCTGGTCCAA ACTCTCGCAA GCTGCTGCAA AAAATTGTCG ACCCGGAGTT CGACATCTCC AACGCCGCCT TTCCGTTTAT GGGCTGTCGC GAGATCACGG TCTGCGGTGG TCTGCGGGCG CGGCTGTTTC GGATCTCGTT CTCGGGAGAG CTTGCCTATG AAATCGCGGT TCCAACCCGC TACGGAGATG CGCTGATGCG CGAAATGATG ACTGCAGGCG CAGAGTTTGA TGTCACGCCC TACGGCACAG AAGCGCTCGG CGTGATGCGG ATCGAGAAAG GCCATGCAGC TGGCAATGAA CTGAACGGCA CCACCACCGC GTTGAATCTC GGTCTGGACC GTATGGTGTC CACCAAGAAG GACTTCATCG GCAATGTGTT GTCCCGTCGC GAAGGCATGA ACGCCAAGGA CGCACTGAAC CTTGTGGGGG TGCGCCCGGT CGACCCATCT CATTCCCTGC CCGCAGGCGG CCATCTCATG CGCAGGTCCG GTCCAGTCGA TGCAACGCAG GATCAGGGCT ATGTCACATC TGCCGCCTAT TCTCCTACTC TCAAGAGTGC GATTGGGCTG GGCTTTGTGA AATCGGGGTT TGAGCGAATG GGGGAACAGC TCCGGCTCGT GAACCCGCTC GAAGGTCAAG AAATCCTTGT TGAAATCGTC AGCCCCCATT TTGTCGACCC AGAAGGAGAA AAGCTCCGTG CATAG
|
Protein sequence | MTQVNRISGG LIDRSTELNF TFDGKNYQGY AGDTLASALL ANGVRLMGRS FKYHRPRGVL AAGSEEPNAL VELRSGGRQE PNTRATVAEI YEGLSANSQN RWPSLKHDVM AINDRFSAFL SAGFYYKTFM WPRAFWEKLY EPVIRKAAGL GSLSGEGDPD AYDKGYLHCD LLVIGAGPAG LSAALTAGRG GAQVILADED FQLGGRLLSD AQSLCNQSNA EWVAATQAEL IALPNVRVMP RTTVFGAYDH GVYGAVERNA DHLVAPEENK PRQTLWRIYS RRAVVAIGAI ERPIAFENND RPGVMLAGAT RAYANRWAVT PARSVVVFAN NDDAHQTAKD LIAKGIEVHA VVDTRSDAPG IEGTELLAGA QIIGTKGRLG LTSVTVRLLD GRTRDITCGA LAMSGGWNPN LGLTCHQRGR PVWREDIHAF VPGSDLPAGQ SVVGAAMGEM STHAALRTGA ETAREALSDL GFTAPGVETP KAEDAPISLT PFWHVADAKR AWLDFQNDVT VKDVKLAHQE NFTSVEHLKR YTTLGMATDQ GKTSNVGALA VMAELTGKPI PETGTTIFRP PYTPVSMGAL AGRAVGKDFH PTRLTPSHKW AEEQGAVFVE VGNWLRAQWF PKAGETHWRQ SVDREVLATR NSVGICDVTT LGKIDVQGTD AAEFLNKIYA NGFAKLPVGK VRYGLMLRED GVAYDDGTAA RLAEDHFVVT TTTANAVLVY RNMEFARQCL WPDLDVQLIS TTEAWAQYAV AGPNSRKLLQ KIVDPEFDIS NAAFPFMGCR EITVCGGLRA RLFRISFSGE LAYEIAVPTR YGDALMREMM TAGAEFDVTP YGTEALGVMR IEKGHAAGNE LNGTTTALNL GLDRMVSTKK DFIGNVLSRR EGMNAKDALN LVGVRPVDPS HSLPAGGHLM RRSGPVDATQ DQGYVTSAAY SPTLKSAIGL GFVKSGFERM GEQLRLVNPL EGQEILVEIV SPHFVDPEGE KLRA
|
| |