Gene Achl_3593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3593 
Symbol 
ID7295074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3994259 
End bp3997201 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content68% 
IMG OID643591999 
Productsarcosine oxidase, alpha subunit family 
Protein accessionYP_002489638 
Protein GI220914329 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTCCC AGAACGCCCG CCTGGCCACC GGCGGCCGCA TCGACCGCAC CATTTCCTGG 
CGCTTCACCG TGGACGGCGA GGAGTTCACC GGACACCCCG GCGATACCCT GGCCTCCGCA
CTGCTCGCCA ACGGCCGCAT CGCTGCCGGT AACTCGCTTT ACGAGGACCG CCACCGCGGC
ATCATGTCCG CCGGCGTCGA AGAAGCCAAC GCGCTGGTCC GCGTGGAAGC CCGCTTCCCC
GGTCACGTGG CCGAGTCCAT GCTCCCTGCT ACCACCGTTT CCCTCGTTGA CGGACTCCAG
GCCCTCCAGC TCAACGGCCT GGGCAAGCTG GATCCGGCTG AGGACCGCGC CGAATACGAC
AAGAAGTACG TCCACACCGA CGTCCTGGTC ATCGGCGGCG GCCCTGCCGG CCTTGCCGCA
GCCCGCGAAG CCGTCCGCAC CGGTGCCCGC GTCATGCTGC TCGATGACCA GCCCGAACTC
GGCGGCTCCC TGCTCTCCGG CTCCATGGCC GAGGGCCTGG CCGAAACCAT CGAAGGCAAG
CCCGCCCTCG AATGGGTGGC CGACGTCGAA GCCGAACTGG TTTCCGGCGC GGAATCCACG
GTCCTGAACC GCACCACCGC GTTCGGCGCC TACGACGCCA ACTACGTCAT CGCCGTCCAG
AACCGCACCG ACCACCTCAC CAGCCCCGCT GCCCCCGGCG TCTCCCGCCA GCGGATTTGG
CACATCCGTG CCAGTCAGGT GGTCCTCGCC CCCGGGGCGC ATGAGCGTCC CCTGGTGTTC
GAGAACAACG ACCGCCCCGG CATCATGCTC GCCTCGGCCG TCCGCAGCTA CCTGAACCGG
TACGCCGTGG CCGCTGGCCA GCGCGTGGTT ATCAGCACCA CCAACGACAG CGCCTACGCC
ACCGCCGCCG ACCTCGCAGC AGCTGGCGTC AAGGTCGCAG CCGTCGTTGA CGCCCGTCCC
AAGCTCACCG CCGTGGCAAC CGCCGCCGTC GAATCCGGGA TCCGGGTGCT GATCGGCAGC
GCGGTGGCCA ACACCAGCGC TGATTCCGCC GGCCGGCTGG ACGGCGTCAC CGTCCGCAGC
ATCAACGACG ACGGCGAACT CACCTCCGGC GTCGAGCAGA TCGCAGCAGA CCTGCTGGCC
GTCTCCGGCG GCTGGAGCCC GCTGGTGCAC CTGCACTCCC AGCGACAGGG CAAGCTGCGC
TGGGACGACG AGCTGGCAGC CTTCGTGCCC AGCACCGAGG TTCCCAACCA GCAGACCATC
GGCTCCGGCC GCGGCTCGTT CGCGACCGAA GACTGCCTCG CCGAGGGCAT CTCCGCCGGC
GCGAAGGCGG CCATCGCCGC GGGCTTCGAA TCCGCCGTCG AGCCTTCCGT CCTCCCGGAG
CTGAAGGCTT CCGCCCCCAC CCGCCAGCTG TGGCTGGTAC CGGGCGAAGA GGGTACCCCG
GACGACTGGC ACCACCACTT CGTGGACTTC CAGCGCGACC AGTCAGTGGC GGACGTCCTC
CGCTCCACCG GCGCGGGAAT GCGTTCGGTG GAACACATCA AGCGGTACAC CTCCATCAGC
ACCGCCAACG ACCAGGGCAA GACCTCCGGC GTGAACGCCA TCGGCGTGAT CGCGGCGGCC
CTGCGCACGG CCGGCGAGGC TTCGCGCGGC ATCGGTGACA TCGGCACCAC CACCTACCGC
GCACCGTTCA CCCCGGTGGC CTTCGCGGCC CTCGCCGGAC GCCAGCGCGG TGAGCTCTTC
GACCCCGCCC GCATCACCTC GATCCAGCCA TGGCACGTTG CCAAGGGTGC GCTCTTCGAG
GACGTCGGGC AGTGGAAGCG CCCCTGGTAC TACCCGCAGG GCGGGGAAGA CATGGACGCC
GCAGTGCTGC GCGAATGCGC CGCCGTCCGC GACTCGGTGG GCTTCATGGA CGCCACCACC
CTGGGCAAGA TCGAAATCCG CGGCAAGGAT GCGGGCGAGT TCCTGAACCG CGTCTACACC
AACGCCTTCA AGAAGCTGGC CCCGGGCTCG GCACGCTACG GCGTCATGTG CCTGGCCGAC
GGCATGATCT TCGACGACGG CGTGACCCTC CGGTTGGACG AGGACACCTT CTTCATGACC
ACCACCACCG GCGGCGCCGC CAAGGTGCTG GACCACCTGG AGGAATGGCT GCAGACCGAA
TGGCCTGAGC TGGACGTGCA GTGCACCTCG GTGACCGAGC AGTGGAACAC CATTGCCGTC
GTGGGGCCCA AGTCCCGCGA AGTGATCGCC AAGGTGGCCC CGGAACTGGC CGCCAACGGC
GGACTGGATG CTGAAAACTT CCCGTTCATG ACCTTCCGTG AGACCACCCT CGCCTCCGGC
GTCCGGGCAC GGGTCTGCCG GATCTCCTTC TCCGGCGAAC TCGCCTACGA GATCAATGTT
CCGGCCTGGT ACGGCCTGAA CACCTGGGAG TCCGTGGCCG CAGCAGGTGC CGAGTTCAAC
ATCACCCCGT ACGGCACCGA AACCATGCAC GTCCTCCGCG CCGAAAAGGG CTACCCGATC
GTCGGGCAGG ACACCGACGG CACTGTAACC CCGCAGGATG CCGGCATGGA GTGGATCGTC
TCCAAGGCCA AGGACTTCAT CGGCAAGCGC TCCTACTCCC GCGTGGACGC CCAGCGTGAA
GACCGCAAGC ACCTGGTCAG CGTCCTTCCC GTGGACCGCA CGCTGCGGCT GCCCGAAGGC
ACCCAGTTGG TGGAAAAGGG ACGCTCCACC AACCCCGCCT ACGGCCCCGT GCCGATGGAA
GGGTTCGTCA CCTCCAGCTA CCACAGCGCA GCGCTGGGCC GTTCCTTCGG CCTGGCCCTG
ATCAAGAACG GACGCAACCG CATCGGCGAA ACGCTGATTG CTGCCGCCGG CGACCAGCTG
GTGGACGTTG TTGTTGCAGA GACAGTGCTT TTTGACTCCG AAGGGACCCG CAAAGATGGC
TGA
 
Protein sequence
MTSQNARLAT GGRIDRTISW RFTVDGEEFT GHPGDTLASA LLANGRIAAG NSLYEDRHRG 
IMSAGVEEAN ALVRVEARFP GHVAESMLPA TTVSLVDGLQ ALQLNGLGKL DPAEDRAEYD
KKYVHTDVLV IGGGPAGLAA AREAVRTGAR VMLLDDQPEL GGSLLSGSMA EGLAETIEGK
PALEWVADVE AELVSGAEST VLNRTTAFGA YDANYVIAVQ NRTDHLTSPA APGVSRQRIW
HIRASQVVLA PGAHERPLVF ENNDRPGIML ASAVRSYLNR YAVAAGQRVV ISTTNDSAYA
TAADLAAAGV KVAAVVDARP KLTAVATAAV ESGIRVLIGS AVANTSADSA GRLDGVTVRS
INDDGELTSG VEQIAADLLA VSGGWSPLVH LHSQRQGKLR WDDELAAFVP STEVPNQQTI
GSGRGSFATE DCLAEGISAG AKAAIAAGFE SAVEPSVLPE LKASAPTRQL WLVPGEEGTP
DDWHHHFVDF QRDQSVADVL RSTGAGMRSV EHIKRYTSIS TANDQGKTSG VNAIGVIAAA
LRTAGEASRG IGDIGTTTYR APFTPVAFAA LAGRQRGELF DPARITSIQP WHVAKGALFE
DVGQWKRPWY YPQGGEDMDA AVLRECAAVR DSVGFMDATT LGKIEIRGKD AGEFLNRVYT
NAFKKLAPGS ARYGVMCLAD GMIFDDGVTL RLDEDTFFMT TTTGGAAKVL DHLEEWLQTE
WPELDVQCTS VTEQWNTIAV VGPKSREVIA KVAPELAANG GLDAENFPFM TFRETTLASG
VRARVCRISF SGELAYEINV PAWYGLNTWE SVAAAGAEFN ITPYGTETMH VLRAEKGYPI
VGQDTDGTVT PQDAGMEWIV SKAKDFIGKR SYSRVDAQRE DRKHLVSVLP VDRTLRLPEG
TQLVEKGRST NPAYGPVPME GFVTSSYHSA ALGRSFGLAL IKNGRNRIGE TLIAAAGDQL
VDVVVAETVL FDSEGTRKDG