Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3593 |
Symbol | |
ID | 7295074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 3994259 |
End bp | 3997201 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643591999 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_002489638 |
Protein GI | 220914329 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCTCCC AGAACGCCCG CCTGGCCACC GGCGGCCGCA TCGACCGCAC CATTTCCTGG CGCTTCACCG TGGACGGCGA GGAGTTCACC GGACACCCCG GCGATACCCT GGCCTCCGCA CTGCTCGCCA ACGGCCGCAT CGCTGCCGGT AACTCGCTTT ACGAGGACCG CCACCGCGGC ATCATGTCCG CCGGCGTCGA AGAAGCCAAC GCGCTGGTCC GCGTGGAAGC CCGCTTCCCC GGTCACGTGG CCGAGTCCAT GCTCCCTGCT ACCACCGTTT CCCTCGTTGA CGGACTCCAG GCCCTCCAGC TCAACGGCCT GGGCAAGCTG GATCCGGCTG AGGACCGCGC CGAATACGAC AAGAAGTACG TCCACACCGA CGTCCTGGTC ATCGGCGGCG GCCCTGCCGG CCTTGCCGCA GCCCGCGAAG CCGTCCGCAC CGGTGCCCGC GTCATGCTGC TCGATGACCA GCCCGAACTC GGCGGCTCCC TGCTCTCCGG CTCCATGGCC GAGGGCCTGG CCGAAACCAT CGAAGGCAAG CCCGCCCTCG AATGGGTGGC CGACGTCGAA GCCGAACTGG TTTCCGGCGC GGAATCCACG GTCCTGAACC GCACCACCGC GTTCGGCGCC TACGACGCCA ACTACGTCAT CGCCGTCCAG AACCGCACCG ACCACCTCAC CAGCCCCGCT GCCCCCGGCG TCTCCCGCCA GCGGATTTGG CACATCCGTG CCAGTCAGGT GGTCCTCGCC CCCGGGGCGC ATGAGCGTCC CCTGGTGTTC GAGAACAACG ACCGCCCCGG CATCATGCTC GCCTCGGCCG TCCGCAGCTA CCTGAACCGG TACGCCGTGG CCGCTGGCCA GCGCGTGGTT ATCAGCACCA CCAACGACAG CGCCTACGCC ACCGCCGCCG ACCTCGCAGC AGCTGGCGTC AAGGTCGCAG CCGTCGTTGA CGCCCGTCCC AAGCTCACCG CCGTGGCAAC CGCCGCCGTC GAATCCGGGA TCCGGGTGCT GATCGGCAGC GCGGTGGCCA ACACCAGCGC TGATTCCGCC GGCCGGCTGG ACGGCGTCAC CGTCCGCAGC ATCAACGACG ACGGCGAACT CACCTCCGGC GTCGAGCAGA TCGCAGCAGA CCTGCTGGCC GTCTCCGGCG GCTGGAGCCC GCTGGTGCAC CTGCACTCCC AGCGACAGGG CAAGCTGCGC TGGGACGACG AGCTGGCAGC CTTCGTGCCC AGCACCGAGG TTCCCAACCA GCAGACCATC GGCTCCGGCC GCGGCTCGTT CGCGACCGAA GACTGCCTCG CCGAGGGCAT CTCCGCCGGC GCGAAGGCGG CCATCGCCGC GGGCTTCGAA TCCGCCGTCG AGCCTTCCGT CCTCCCGGAG CTGAAGGCTT CCGCCCCCAC CCGCCAGCTG TGGCTGGTAC CGGGCGAAGA GGGTACCCCG GACGACTGGC ACCACCACTT CGTGGACTTC CAGCGCGACC AGTCAGTGGC GGACGTCCTC CGCTCCACCG GCGCGGGAAT GCGTTCGGTG GAACACATCA AGCGGTACAC CTCCATCAGC ACCGCCAACG ACCAGGGCAA GACCTCCGGC GTGAACGCCA TCGGCGTGAT CGCGGCGGCC CTGCGCACGG CCGGCGAGGC TTCGCGCGGC ATCGGTGACA TCGGCACCAC CACCTACCGC GCACCGTTCA CCCCGGTGGC CTTCGCGGCC CTCGCCGGAC GCCAGCGCGG TGAGCTCTTC GACCCCGCCC GCATCACCTC GATCCAGCCA TGGCACGTTG CCAAGGGTGC GCTCTTCGAG GACGTCGGGC AGTGGAAGCG CCCCTGGTAC TACCCGCAGG GCGGGGAAGA CATGGACGCC GCAGTGCTGC GCGAATGCGC CGCCGTCCGC GACTCGGTGG GCTTCATGGA CGCCACCACC CTGGGCAAGA TCGAAATCCG CGGCAAGGAT GCGGGCGAGT TCCTGAACCG CGTCTACACC AACGCCTTCA AGAAGCTGGC CCCGGGCTCG GCACGCTACG GCGTCATGTG CCTGGCCGAC GGCATGATCT TCGACGACGG CGTGACCCTC CGGTTGGACG AGGACACCTT CTTCATGACC ACCACCACCG GCGGCGCCGC CAAGGTGCTG GACCACCTGG AGGAATGGCT GCAGACCGAA TGGCCTGAGC TGGACGTGCA GTGCACCTCG GTGACCGAGC AGTGGAACAC CATTGCCGTC GTGGGGCCCA AGTCCCGCGA AGTGATCGCC AAGGTGGCCC CGGAACTGGC CGCCAACGGC GGACTGGATG CTGAAAACTT CCCGTTCATG ACCTTCCGTG AGACCACCCT CGCCTCCGGC GTCCGGGCAC GGGTCTGCCG GATCTCCTTC TCCGGCGAAC TCGCCTACGA GATCAATGTT CCGGCCTGGT ACGGCCTGAA CACCTGGGAG TCCGTGGCCG CAGCAGGTGC CGAGTTCAAC ATCACCCCGT ACGGCACCGA AACCATGCAC GTCCTCCGCG CCGAAAAGGG CTACCCGATC GTCGGGCAGG ACACCGACGG CACTGTAACC CCGCAGGATG CCGGCATGGA GTGGATCGTC TCCAAGGCCA AGGACTTCAT CGGCAAGCGC TCCTACTCCC GCGTGGACGC CCAGCGTGAA GACCGCAAGC ACCTGGTCAG CGTCCTTCCC GTGGACCGCA CGCTGCGGCT GCCCGAAGGC ACCCAGTTGG TGGAAAAGGG ACGCTCCACC AACCCCGCCT ACGGCCCCGT GCCGATGGAA GGGTTCGTCA CCTCCAGCTA CCACAGCGCA GCGCTGGGCC GTTCCTTCGG CCTGGCCCTG ATCAAGAACG GACGCAACCG CATCGGCGAA ACGCTGATTG CTGCCGCCGG CGACCAGCTG GTGGACGTTG TTGTTGCAGA GACAGTGCTT TTTGACTCCG AAGGGACCCG CAAAGATGGC TGA
|
Protein sequence | MTSQNARLAT GGRIDRTISW RFTVDGEEFT GHPGDTLASA LLANGRIAAG NSLYEDRHRG IMSAGVEEAN ALVRVEARFP GHVAESMLPA TTVSLVDGLQ ALQLNGLGKL DPAEDRAEYD KKYVHTDVLV IGGGPAGLAA AREAVRTGAR VMLLDDQPEL GGSLLSGSMA EGLAETIEGK PALEWVADVE AELVSGAEST VLNRTTAFGA YDANYVIAVQ NRTDHLTSPA APGVSRQRIW HIRASQVVLA PGAHERPLVF ENNDRPGIML ASAVRSYLNR YAVAAGQRVV ISTTNDSAYA TAADLAAAGV KVAAVVDARP KLTAVATAAV ESGIRVLIGS AVANTSADSA GRLDGVTVRS INDDGELTSG VEQIAADLLA VSGGWSPLVH LHSQRQGKLR WDDELAAFVP STEVPNQQTI GSGRGSFATE DCLAEGISAG AKAAIAAGFE SAVEPSVLPE LKASAPTRQL WLVPGEEGTP DDWHHHFVDF QRDQSVADVL RSTGAGMRSV EHIKRYTSIS TANDQGKTSG VNAIGVIAAA LRTAGEASRG IGDIGTTTYR APFTPVAFAA LAGRQRGELF DPARITSIQP WHVAKGALFE DVGQWKRPWY YPQGGEDMDA AVLRECAAVR DSVGFMDATT LGKIEIRGKD AGEFLNRVYT NAFKKLAPGS ARYGVMCLAD GMIFDDGVTL RLDEDTFFMT TTTGGAAKVL DHLEEWLQTE WPELDVQCTS VTEQWNTIAV VGPKSREVIA KVAPELAANG GLDAENFPFM TFRETTLASG VRARVCRISF SGELAYEINV PAWYGLNTWE SVAAAGAEFN ITPYGTETMH VLRAEKGYPI VGQDTDGTVT PQDAGMEWIV SKAKDFIGKR SYSRVDAQRE DRKHLVSVLP VDRTLRLPEG TQLVEKGRST NPAYGPVPME GFVTSSYHSA ALGRSFGLAL IKNGRNRIGE TLIAAAGDQL VDVVVAETVL FDSEGTRKDG
|
| |