Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3678 |
Symbol | |
ID | 7295160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 4089797 |
End bp | 4090927 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643592084 |
Product | Sarcosine oxidase |
Protein accession | YP_002489722 |
Protein GI | 220914413 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 101 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAACAA CCCTTGACAC CGTGGTGATC GGCGGCGGCG CCATGGGCTC CGCCGCGGCG TGGGCGCTGT CCCGGCGGGG ACGCCAGGTG ACCCTGGTGG AGCAGTTCGG GCCGGGACAC ACGATCGGGG CGTCGCACGG CACCACACGG AACTTCAACC CCGGCTACCA CCGGCCCGAG TACGTCGCCA TGGTGGCCGA ATCGCTGGAC CTCTGGAACG AGCTGGAACA GGAGAGCGGC CAGACGCTCC TGGCACGGAC CGGCATCGTC ACGCACGGGC CCGAGCCCAT GCTGCCGGAC GCCGCGGCCG CACTTGCCCA GGCCGGCCTG CGCGCCGAAT TCCTGCACCC GGACGAAGCC GGCGAGCGCT GGCGCGGCAT TCGGTTCGAC CAGCAGGTCC TGTACATGCC CGACGGCGGC CAACTCAACC CGGAAGCAGC CCTGCCGGCA TTCCAGCGCC TCGCCGCAGC CCGGGGCGCC GACATCCGGC ACCACACCAA AGTGGTGTCC TTCGAGGTGG CGGACGACGG CGTCCGGCTG GGGCTGGAAT CGGTTGCCGG CACCGAGATG GTCACCGCTG CGCAGGTTGT GGTGACGGCC GGCGGCTGGA CGGAGAAGCT TCTGGGCGCT GCCGTGGGCG GACGCCTGCG GACGCCGAAG CTCAGGGTGA CGCAGGAACA GCCCGCGCAT TTCCGGATTA CCGATTCCGA TGCGGTGTGG CCGGGCTTCA ACCACTACCC GGGCGGCGGG TCACAGTACG CGGGGTGGTA CTCCCCGGTC TACGGCATGC ACACCCCCGG CGAGGGCATC AAGGCAGGCT GGCATGGTGT TGGCCCGGTG GTGGATCCAG ACCGGCGCAG CTTCGAGCCG GAGCCGCAGC AGCTCGCTGC CCTGCAAACC TACGCGAGGA CCTGGCTGCC CGGCGTGGAC GCGGACGCCT TCGAGGCCAT CAGCTGCACC TACACCACCA CGCCGGACGA GGACTTCATC CTGGACCGGA TGGGGCCCGT GGTGATCGGC GCGGGGTTCT CCGGGCACGG GTTCAAGTTC ACTCCCGTGG TGGGCCGGAT CCTTGCCGAC CTCGCCACGG GCACCCGCCC TGCCCCCGCT ATCTTCAGCG CCTCCCGCTA G
|
Protein sequence | MTTTLDTVVI GGGAMGSAAA WALSRRGRQV TLVEQFGPGH TIGASHGTTR NFNPGYHRPE YVAMVAESLD LWNELEQESG QTLLARTGIV THGPEPMLPD AAAALAQAGL RAEFLHPDEA GERWRGIRFD QQVLYMPDGG QLNPEAALPA FQRLAAARGA DIRHHTKVVS FEVADDGVRL GLESVAGTEM VTAAQVVVTA GGWTEKLLGA AVGGRLRTPK LRVTQEQPAH FRITDSDAVW PGFNHYPGGG SQYAGWYSPV YGMHTPGEGI KAGWHGVGPV VDPDRRSFEP EPQQLAALQT YARTWLPGVD ADAFEAISCT YTTTPDEDFI LDRMGPVVIG AGFSGHGFKF TPVVGRILAD LATGTRPAPA IFSASR
|
| |