Gene Sare_4279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4279 
Symbol 
ID5706991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4854504 
End bp4855700 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content75% 
IMG OID641273698 
Productglycine oxidase ThiO 
Protein accessionYP_001539051 
Protein GI159039798 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.196313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00513964 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTGACCG GTGCACCCGG TGTAGCCCCG CAGTTCGGTC GGAACCCGCA GCACGGGCCT 
GACGTGGCGG TGGTGGGGGC GGGGCCGATC GGGCTGGCGA TCGCCTGGCG ATGCGCAGCG
CGCGGGCTGC GGGTCGTGGT GTACGACCCG GCCCCTGGTT CGGGCGCGGC GCACGCCGCC
GCGGGGATGC TCGCGCCGGT CGCCGAGGCG TACTTCGGCG AGCACGAGCT GACCGGCCTG
CTCACCGAGT CGGCGGCCCG CTGGCCGGCG TTCGCTGCCG AGTTGTCCGC CGCATCCGGC
ACCGATACGG GCTACCGCGG TGAGGGCACG TTGATGGTCG GGCTCACCGC CGACGATCTC
GCCGTGGCCC GCCGGTTGTG GGCCTACCAA CAGGGGCTGG GGTTGCCAGT CACCCCGCTG
CGTCCCTCCG AACTACGAGA CCGTGAGCCG GCGCTGTCAC CTCGCACGCG TGGTGGCGCC
TACGCCGGTA CCGATCACCA GGTGGACCCG CGTCGGCTGG TGGCGGCACT GCGTACCGCC
ACCGAGCGGG CCGGGGGGAC GCTGGTGCCG GCCCCGGTCC ACCGGTTGGC CGACCTGACC
GCGGGAATCA CGGTGGTCGC CGCCGGCTGT GGCGCCGCCG CGCTGACCGG GCTGCCGGTA
CGCCCGGTGA AGGGTCAGGT GCTTCGGCTC CGCGCCCCCG GTGCGCCGGG CTTCCAGCAC
GTGATCCGGG GATTCGCCGA CGGCGAGCAG GTATATCTGG TTCCCCGGGA GGACGGGGAG
GTCGTGGTCG GGGCGACCTC GGAGGAGCGC ACTGACACCA CGGTGACCAG TGGTGCGGTG
CTGCGGTTGC TCCGGGCCGC CACCGACCTG GTGCCCGAGG TGGCCGAGTA CGAGCTGATC
GAGGCACTCG CCGGGCTGCG TCCGGGTACC CCCGACAACG CGCCGATCCT CGGCCCGCTG
CCCGGGCGGC CGGCGGTACT CGCCGCGACC GGGCACCACC GGCACGGGAT CGTGCTCACC
CCGGTCACCG CCGACCTGAT TGCCGACCTG ATCGTCACCG GTACGCCAGA CCCGCTGCTC
GCCCCGTTCA CGCCGGAGCG CCTCGGGCCG GCCGCGTCCA GCCAGCCAGT CACCGCCGCC
GCGGCCCGCG GACCCGCCGG GGCCCGTCCG ACCACACAGG AGGAATCGTG GAACTGA
 
Protein sequence
MLTGAPGVAP QFGRNPQHGP DVAVVGAGPI GLAIAWRCAA RGLRVVVYDP APGSGAAHAA 
AGMLAPVAEA YFGEHELTGL LTESAARWPA FAAELSAASG TDTGYRGEGT LMVGLTADDL
AVARRLWAYQ QGLGLPVTPL RPSELRDREP ALSPRTRGGA YAGTDHQVDP RRLVAALRTA
TERAGGTLVP APVHRLADLT AGITVVAAGC GAAALTGLPV RPVKGQVLRL RAPGAPGFQH
VIRGFADGEQ VYLVPREDGE VVVGATSEER TDTTVTSGAV LRLLRAATDL VPEVAEYELI
EALAGLRPGT PDNAPILGPL PGRPAVLAAT GHHRHGIVLT PVTADLIADL IVTGTPDPLL
APFTPERLGP AASSQPVTAA AARGPAGARP TTQEESWN