Gene Achl_3678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3678 
Symbol 
ID7295160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4089797 
End bp4090927 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content70% 
IMG OID643592084 
ProductSarcosine oxidase 
Protein accessionYP_002489722 
Protein GI220914413 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAACAA CCCTTGACAC CGTGGTGATC GGCGGCGGCG CCATGGGCTC CGCCGCGGCG 
TGGGCGCTGT CCCGGCGGGG ACGCCAGGTG ACCCTGGTGG AGCAGTTCGG GCCGGGACAC
ACGATCGGGG CGTCGCACGG CACCACACGG AACTTCAACC CCGGCTACCA CCGGCCCGAG
TACGTCGCCA TGGTGGCCGA ATCGCTGGAC CTCTGGAACG AGCTGGAACA GGAGAGCGGC
CAGACGCTCC TGGCACGGAC CGGCATCGTC ACGCACGGGC CCGAGCCCAT GCTGCCGGAC
GCCGCGGCCG CACTTGCCCA GGCCGGCCTG CGCGCCGAAT TCCTGCACCC GGACGAAGCC
GGCGAGCGCT GGCGCGGCAT TCGGTTCGAC CAGCAGGTCC TGTACATGCC CGACGGCGGC
CAACTCAACC CGGAAGCAGC CCTGCCGGCA TTCCAGCGCC TCGCCGCAGC CCGGGGCGCC
GACATCCGGC ACCACACCAA AGTGGTGTCC TTCGAGGTGG CGGACGACGG CGTCCGGCTG
GGGCTGGAAT CGGTTGCCGG CACCGAGATG GTCACCGCTG CGCAGGTTGT GGTGACGGCC
GGCGGCTGGA CGGAGAAGCT TCTGGGCGCT GCCGTGGGCG GACGCCTGCG GACGCCGAAG
CTCAGGGTGA CGCAGGAACA GCCCGCGCAT TTCCGGATTA CCGATTCCGA TGCGGTGTGG
CCGGGCTTCA ACCACTACCC GGGCGGCGGG TCACAGTACG CGGGGTGGTA CTCCCCGGTC
TACGGCATGC ACACCCCCGG CGAGGGCATC AAGGCAGGCT GGCATGGTGT TGGCCCGGTG
GTGGATCCAG ACCGGCGCAG CTTCGAGCCG GAGCCGCAGC AGCTCGCTGC CCTGCAAACC
TACGCGAGGA CCTGGCTGCC CGGCGTGGAC GCGGACGCCT TCGAGGCCAT CAGCTGCACC
TACACCACCA CGCCGGACGA GGACTTCATC CTGGACCGGA TGGGGCCCGT GGTGATCGGC
GCGGGGTTCT CCGGGCACGG GTTCAAGTTC ACTCCCGTGG TGGGCCGGAT CCTTGCCGAC
CTCGCCACGG GCACCCGCCC TGCCCCCGCT ATCTTCAGCG CCTCCCGCTA G
 
Protein sequence
MTTTLDTVVI GGGAMGSAAA WALSRRGRQV TLVEQFGPGH TIGASHGTTR NFNPGYHRPE 
YVAMVAESLD LWNELEQESG QTLLARTGIV THGPEPMLPD AAAALAQAGL RAEFLHPDEA
GERWRGIRFD QQVLYMPDGG QLNPEAALPA FQRLAAARGA DIRHHTKVVS FEVADDGVRL
GLESVAGTEM VTAAQVVVTA GGWTEKLLGA AVGGRLRTPK LRVTQEQPAH FRITDSDAVW
PGFNHYPGGG SQYAGWYSPV YGMHTPGEGI KAGWHGVGPV VDPDRRSFEP EPQQLAALQT
YARTWLPGVD ADAFEAISCT YTTTPDEDFI LDRMGPVVIG AGFSGHGFKF TPVVGRILAD
LATGTRPAPA IFSASR