Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_0994 |
Symbol | |
ID | 5161707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 1099400 |
End bp | 1102396 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640552911 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001234130 |
Protein GI | 148260003 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR00528] glycine cleavage system T protein [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA GCTTCCGGCT TGCCGAGGGC GGGCGGATCG ACCGCTCCCG CACCCTCGCC TTCCGTTTCG ACGGCCGCGC CTATGCCGGC CATCCGGGCG ACACGCTGGC CTCGGCCCTG CTCGCCAACG GCGTCCACCT GGTCGGCCGT TCCTTCAAGT ATCACCGTCC GCGCGGCATC CTGACCGCCG GCGCCGAGGA GCCGAACGCC CTGGTCGGCG TCGGCCGCGA CGAGGGGCAC TACACCCCCA ACCTGCGCGC GACCCAGGTC GTCCTGCATG ACGGCCTGGT CGCCGAAAGC CAGAACCGCT CCCCCTCGCT CGAACGCGAC TTCGCCGGCC TCACCGACCT GTTCTGGAAA TTCATCCCCG CCGGCTTCTA CTACAAGACC TTCATGTGGC CGAAAGCGGC GTGGACGCGG TTCTTCGAGC CGCGCATCCG CGCCATGGCC GGCCTCGGCC GCGCCCCGGA GGCGGCGGAC GCCGCCTGCT ACGCCCAGCG CTACGCCCAT TGCGACGTGC TGGTGGTCGG CGCCGGCCCC GCCGGCATCG AGGCCGCCCT CGCCGCCGCG TCCTCCGGCG CGCGGGTGAT CCTGTGCGAC GAACAGGCCG AGCTTGGCGG CGCCCTCCTC GCCGAGACGG ATGCGACGCT CGATGGCACC AATGCCGCGA GCTTCCTCGC CACCCGCCTG AGCTGGCTGC AGGCCGCCGG CCAGGTCACC ATCCTGCCGC GCACCATCGC CTTCGGCTAT TTCCCGCACA ACATGATCGG CCTCGCCCAG GACCTCACCG ATCATCTCGG CGAACCCGAC GACACCCAGC CCCGCGGCCG GCTCTGGCAG GTCCGCGCCC GCGAGGTGGT GATCGCAACC GGCGCCATCG AGCGCCCCCT CGCCTTCCCC GACAACGACC GCCCCGGCAT CATGCTGGCC GACGCCGCGC GCACCTACGT CACCCGCTAC GGCGTGCTGC CGGGCCGCAA CGCCGTCGTC TTCACCGCGC ACGATTCCGC CTATGCCGCC GCCCTCGCCC TGCACCGCGC CGGCGCCCGC ATCGCCGCGA TCGCCGATCT CCGCCCCGAT CCGTCCGGCG AACTGGTCGA GGCCGCCCGC GCCGCCGGCC TGCCGGTCCG CCCCGGCTGC ACCCTCACCG GCACCGAGGG GCGGCTGCGC GTCACCGCCG CCACCATCGC GCGGCGCGAT GGCGGCGCCG GCGAGCGCCT CTCCTGCGAC CTCGTGCTGA TGTCCGGCGG CTTCACCCCG AGCGTGCATC TCTTCTCGCA GAGCCGCGGC AAGCTCCGCT TCGATCCCGC GCTCGACGCC TTCATCCCCG GCGAGCCGGC CGAGGCCTGC CGCGCCGCCG GTGCCGCCGC CGGCACCACC TCGCTGGCCG ACGCCCTCGC CTCCGGCCGC GCCGCCGGCG AGGCGGCGGC CGCCGCCGCC GGCTTCACCG CCCCGCCCGC CGCGGCGATC GCGGTCGCCA ACGCCCCCGC CGCCACCGGC GGCTTCCTCG GCGCCACGCC GCACGGCCGC AACCCCGGCT CGGTCCGCGC CTTCATCGAC TTCCAGAACG ACGTCACCGC GAAGGACATC TCGCTCGCCC TGCGCGAGGG CTTCCGCTCG GTCGAGCACG TCAAGCGCTA CACCACCAAC GGCATGGCGA CCGACCAGGG CAAGCTCTCC AACATGAACG CGCTCGGCAT CATGTCGGCC GAGCTCGGCC GCCCGATCCC CGAGATCGGC ACCACCACCT TCCGCATGCC CTACACGCCG GTCCCCTTCG GCTATTTCGC CGGCTACGCC CGCGGCGCGC TGTTCGAGCC CGAGCGCCAC ACCCCGATCC ACGACTGGGC GGCGGAACAG GGCGCGGTGT TCGAGGATGT CGGCATCTGG AAACGCGCCC GCTACTTCCC GCGCGGCGGC GAGACGATGC GCAGCGCCGT CGCCCGCGAA TGCCGCGCCG TGCGCGCGAG CGTCGGCATC TTCGACGCCT CGACCCTCGG CAAGATCGAG GTCGTGGGGC CGGACGCCGC CGAGTTCCTC AACCGGATGT ACGTCAACGC CTGGACCAAG CTGAAACCCG GCCGCCTGCG CTACGGCGTG CTGCTGCGCG AGGACGGGTT CGTGATCGAC GACGGCGTGA TCGGCCGCCT GTCCGACACC CGCTTCCACG TCACCACCAC CACCGGCGGC GCCCCGCGCG TGCTCGCCAT GATGGAGGAC TACCTGCAGA CCGAGTTCCC CGAGCTCGAT GTCTGGCTCA CCTCCACCAC CGAGCAATAC GCGGTCATCG CCGTCCAGGG CCCCCGCGCG CGCGACGTGA TCGCCCCGCT CGTCGAGGGG GCGGACATCT CCGGCGCCGC CATGCCGCAC ATGAGCATGG TCGAATGCCG CGTCGCCGGC ATCCCGGCCC GGCTCTTCCG CGTCAGCTTC ACCGGCGAGC TCGGCTTCGA GATCAACGTC CCCGCCGATT ACGGCCGCGC CGTCTGGGAG GCGGTGTTCG ACGCCGGCCG CCGGCACGAC ATCACCGCCT ACGGCACCGA GACGATGCAC GTTCTGCGCG CCGAGAAGGG CTACATCATC GTCGGCCAGG AGACCGACGG CACGGCGACG CCCGACGATG TCGGCCTCGC CTGGGCGATC GGCAAGGCGA AGCCCGATTT CGTCGGCAAG CGCGCGCTCG ACCGCGCCGC CTTCGCCGGC CAGACCGGGC GCAAGCAGCT CGTCGGACTG TTCACCGAGC CCGGCGATAT CGTGCTCGAG GAAGGATCCC ACCTCGTCGC CGATCCCAGC CGGCCGCCGC CCGCCGAGAT CCTCGGCCAC GTCACCTCCG CCTACTGGAG CGAGACGCTC GGCCGCTCCA TCGCCCTCGC CCTGGTGCGC GGCGGGCGCG ACCGCATCGG CGACACGCTG CACGTGAAAC TCGCGGACCG CGCGATCCCG GTCCGCCTCA CCGATCCGGT ATTCTACGAC CGTGAAGGAG CCAGGCTCGA TGGCTGA
|
Protein sequence | MSQSFRLAEG GRIDRSRTLA FRFDGRAYAG HPGDTLASAL LANGVHLVGR SFKYHRPRGI LTAGAEEPNA LVGVGRDEGH YTPNLRATQV VLHDGLVAES QNRSPSLERD FAGLTDLFWK FIPAGFYYKT FMWPKAAWTR FFEPRIRAMA GLGRAPEAAD AACYAQRYAH CDVLVVGAGP AGIEAALAAA SSGARVILCD EQAELGGALL AETDATLDGT NAASFLATRL SWLQAAGQVT ILPRTIAFGY FPHNMIGLAQ DLTDHLGEPD DTQPRGRLWQ VRAREVVIAT GAIERPLAFP DNDRPGIMLA DAARTYVTRY GVLPGRNAVV FTAHDSAYAA ALALHRAGAR IAAIADLRPD PSGELVEAAR AAGLPVRPGC TLTGTEGRLR VTAATIARRD GGAGERLSCD LVLMSGGFTP SVHLFSQSRG KLRFDPALDA FIPGEPAEAC RAAGAAAGTT SLADALASGR AAGEAAAAAA GFTAPPAAAI AVANAPAATG GFLGATPHGR NPGSVRAFID FQNDVTAKDI SLALREGFRS VEHVKRYTTN GMATDQGKLS NMNALGIMSA ELGRPIPEIG TTTFRMPYTP VPFGYFAGYA RGALFEPERH TPIHDWAAEQ GAVFEDVGIW KRARYFPRGG ETMRSAVARE CRAVRASVGI FDASTLGKIE VVGPDAAEFL NRMYVNAWTK LKPGRLRYGV LLREDGFVID DGVIGRLSDT RFHVTTTTGG APRVLAMMED YLQTEFPELD VWLTSTTEQY AVIAVQGPRA RDVIAPLVEG ADISGAAMPH MSMVECRVAG IPARLFRVSF TGELGFEINV PADYGRAVWE AVFDAGRRHD ITAYGTETMH VLRAEKGYII VGQETDGTAT PDDVGLAWAI GKAKPDFVGK RALDRAAFAG QTGRKQLVGL FTEPGDIVLE EGSHLVADPS RPPPAEILGH VTSAYWSETL GRSIALALVR GGRDRIGDTL HVKLADRAIP VRLTDPVFYD REGARLDG
|
| |