Gene Achl_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3521 
Symbol 
ID7295002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3899136 
End bp3900305 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content72% 
IMG OID643591927 
ProductSarcosine oxidase 
Protein accessionYP_002489566 
Protein GI220914257 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0264649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACC CGCTTCCGGA CCCGTCCGAT TATGTGGTGG TGGGCGCCGG CCTGGCCGGC 
GCCGCCACCG CCTGGCAGCT GGCCGCCCGC GGGCACCAGG TCACCCTGCT GGAGCGCGAC
GTTCCGGCCG CACACAACGC CAGCTCCCAC GGCTCCGCCA GGATCTTCCG CTACGCCTAC
CCGGACCCCT TCTACACGCA GGCCGTGCTG GACTCGAAGG CCCTCTGGGA CAGCCTGGCC
GCCGAGGCCG GAACTGAGCT GATCACGCCC TTCGGGGCGG TGGACTACGG CCCCACGCGG
CAGCCTGCCT TGCTGGCCGG CGTCCTGGCA GGCGCCGGGA TCGGCCACGA GCTGCTCTCC
GCCGCCGAGG CCCGGTCCCG CTGGCCCCAG ATTGCCTTCG ACACCGAGGT CCTCTGGCAC
CCCGGGGCAG GGGTCATCGA TGCCGAGACC TCCGTGAACG CGATGGTGGC GCTCGCCGTC
CGCAACGGCG CCCGGGTGCT GACCGGCTGG ACTGTGGAGC GGGTGGAACG CCTCGGCCGC
GGGACCGGCG CCGGCTACCG GCTGCACTCC GCCGCAGGGG AAACGTTCGA CGCCGGCAAC
GTCGTCATCA GCGCCGGCGG CTGGCTGCCG CGGCTGCTGG ACTCCCTGCC GCTGCCGGCC
GGCTTCCTCG CCGGCCTGCC CGAGTTCACC GTCCGGCAGG AGCAGGCGTT CCACTTCCGC
TACCGCGACG GCTACCCCGG CGCCACCTGG CCCACGTTCA TCCACAAGGC CGCGGACATC
CAGGCCTACG GACTTCCCGG CGGCCGGGAC GCCGGATTCG CCGGCCAGAA AGTGGCCGAA
TACAACGGCG GACCGCTGAT CCCGTCGGCC GCGGACCAGA CCGGCCAGGT GGACCCGGCC
AACCGCGCCC GCGTGGTGGA CTACGTCAGC CGATACCTGC CCGGCCTGGA CCCTGAGCCG
TACGCCGAAA CCACCTGCCT GTTCACCAAC ACCCCCAATG AGGACTTCCT CATCGACCGG
GCGGACAACC TCACCGTCGT CTCGCCCTGC TCCGGGCACG GCGCTAAATT CGCCCCGCTG
ATCGGACAGT GGGCGGCGGA CCTGGCCACC GGCGCAGCGG TGGTCCCGGA CCGGTTCCGC
AGCACCGCCT CACCAGCACT CACCACGTAG
 
Protein sequence
MGDPLPDPSD YVVVGAGLAG AATAWQLAAR GHQVTLLERD VPAAHNASSH GSARIFRYAY 
PDPFYTQAVL DSKALWDSLA AEAGTELITP FGAVDYGPTR QPALLAGVLA GAGIGHELLS
AAEARSRWPQ IAFDTEVLWH PGAGVIDAET SVNAMVALAV RNGARVLTGW TVERVERLGR
GTGAGYRLHS AAGETFDAGN VVISAGGWLP RLLDSLPLPA GFLAGLPEFT VRQEQAFHFR
YRDGYPGATW PTFIHKAADI QAYGLPGGRD AGFAGQKVAE YNGGPLIPSA ADQTGQVDPA
NRARVVDYVS RYLPGLDPEP YAETTCLFTN TPNEDFLIDR ADNLTVVSPC SGHGAKFAPL
IGQWAADLAT GAAVVPDRFR STASPALTT