Gene Arth_0077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0077 
Symbol 
ID4447479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp78923 
End bp80023 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content70% 
IMG OID639687872 
Productsarcosine oxidase 
Protein accessionYP_829578 
Protein GI116668645 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTTG ACGTCGTCGT GGTCGGCGGA GGGGCGATGG GGTCCGCTGC TGCGTGGCAG 
CTTGCACGCC GCGGCAGGTC CGTTGTTCTC CTGGAGCAAT TCGAACAGGG GCATCACATC
GGCGCCTCCC ACGGCGCGAC CCGCAATTTC AACATGGCCT ACGCCGAGGG CGATTACCTG
GACCTGGTCA CCGAGGCCAA GGATCTCTGG GACGAGCTCG AGGGTGCAAC GGGCATGCAG
CTCCTGGACC TCGTGGGCCT GGTGAACCAC GGCAACGTCC GGCGGCTGCG GGACGTCCGG
TCGTCACACG CCGAGCGCGG CATTGAGAGC CACTTCCTTC CCGCAACAGA GGCCGCAGAG
CGCTGGCGGG GGATGAACTT CAGGGGTGAC GTCCTGGTGG TGCCCGGCTC CGGACGGGTC
CGTGCCGCTG ACGCGCTGCT GGCGCTTCGC CACGCCGCCG AGGCGCACGG CGCCCGCTTT
GAATACTCGA CGCCGGCCCG CGACATCCGC GTTGAGGGCG ACCGCGCCGT CGTTGTCATT
GACTCCGGCG AGATCACCGC GCGCCGTGTG GTGGTCACCG CCGGCGCATG GACCAGCAAG
CTTCTCGGGA GCACGGTCCC GCTCCCGAGG CTCGTGGTCA CGCAGGAGCA GCCGGCGCAC
TTCACGCCCT TGGACGACTC GCTGACCTGG CCCAGCTTCA ACCACAACCC CGATCCGGAC
GACCCCCGCG ACGCGTACTG GTACGGCCCC GTCTATGGCA TGCTCACCCC GGGCGAGGGC
ATCAAGGCAG GCTGGCACGG CGTGGGGCCG GTGACGGACC CGGACGGGCG CAGCTTCACG
CCCGAACCTG TCCAGCTGGA GGCGCTGGTG CGCTACGTCC GGGAGTGGCT GCCGGGCGTG
GATGCGGAGT CAGCGGCTCC CATGAGTTGC ACGTACACCA GCACCGCCAA CGAGGACTTC
GTGCTGGACC GTTTCGGTCC CGTAGTGGTG GGGGCCGGCT TCTCCGGCCA CGGGTTCAAG
TTCACCCCGG CCGTTGGCCG GGTGCTTGCA GACCTGGCCG ACGGCGGGGG CGCACCCGCC
CGTTTCACCG CCCGGCGCTA G
 
Protein sequence
MEVDVVVVGG GAMGSAAAWQ LARRGRSVVL LEQFEQGHHI GASHGATRNF NMAYAEGDYL 
DLVTEAKDLW DELEGATGMQ LLDLVGLVNH GNVRRLRDVR SSHAERGIES HFLPATEAAE
RWRGMNFRGD VLVVPGSGRV RAADALLALR HAAEAHGARF EYSTPARDIR VEGDRAVVVI
DSGEITARRV VVTAGAWTSK LLGSTVPLPR LVVTQEQPAH FTPLDDSLTW PSFNHNPDPD
DPRDAYWYGP VYGMLTPGEG IKAGWHGVGP VTDPDGRSFT PEPVQLEALV RYVREWLPGV
DAESAAPMSC TYTSTANEDF VLDRFGPVVV GAGFSGHGFK FTPAVGRVLA DLADGGGAPA
RFTARR