Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4279 |
Symbol | |
ID | 5706991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4854504 |
End bp | 4855700 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641273698 |
Product | glycine oxidase ThiO |
Protein accession | YP_001539051 |
Protein GI | 159039798 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR02352] glycine oxidase ThiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.196313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00513964 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTGACCG GTGCACCCGG TGTAGCCCCG CAGTTCGGTC GGAACCCGCA GCACGGGCCT GACGTGGCGG TGGTGGGGGC GGGGCCGATC GGGCTGGCGA TCGCCTGGCG ATGCGCAGCG CGCGGGCTGC GGGTCGTGGT GTACGACCCG GCCCCTGGTT CGGGCGCGGC GCACGCCGCC GCGGGGATGC TCGCGCCGGT CGCCGAGGCG TACTTCGGCG AGCACGAGCT GACCGGCCTG CTCACCGAGT CGGCGGCCCG CTGGCCGGCG TTCGCTGCCG AGTTGTCCGC CGCATCCGGC ACCGATACGG GCTACCGCGG TGAGGGCACG TTGATGGTCG GGCTCACCGC CGACGATCTC GCCGTGGCCC GCCGGTTGTG GGCCTACCAA CAGGGGCTGG GGTTGCCAGT CACCCCGCTG CGTCCCTCCG AACTACGAGA CCGTGAGCCG GCGCTGTCAC CTCGCACGCG TGGTGGCGCC TACGCCGGTA CCGATCACCA GGTGGACCCG CGTCGGCTGG TGGCGGCACT GCGTACCGCC ACCGAGCGGG CCGGGGGGAC GCTGGTGCCG GCCCCGGTCC ACCGGTTGGC CGACCTGACC GCGGGAATCA CGGTGGTCGC CGCCGGCTGT GGCGCCGCCG CGCTGACCGG GCTGCCGGTA CGCCCGGTGA AGGGTCAGGT GCTTCGGCTC CGCGCCCCCG GTGCGCCGGG CTTCCAGCAC GTGATCCGGG GATTCGCCGA CGGCGAGCAG GTATATCTGG TTCCCCGGGA GGACGGGGAG GTCGTGGTCG GGGCGACCTC GGAGGAGCGC ACTGACACCA CGGTGACCAG TGGTGCGGTG CTGCGGTTGC TCCGGGCCGC CACCGACCTG GTGCCCGAGG TGGCCGAGTA CGAGCTGATC GAGGCACTCG CCGGGCTGCG TCCGGGTACC CCCGACAACG CGCCGATCCT CGGCCCGCTG CCCGGGCGGC CGGCGGTACT CGCCGCGACC GGGCACCACC GGCACGGGAT CGTGCTCACC CCGGTCACCG CCGACCTGAT TGCCGACCTG ATCGTCACCG GTACGCCAGA CCCGCTGCTC GCCCCGTTCA CGCCGGAGCG CCTCGGGCCG GCCGCGTCCA GCCAGCCAGT CACCGCCGCC GCGGCCCGCG GACCCGCCGG GGCCCGTCCG ACCACACAGG AGGAATCGTG GAACTGA
|
Protein sequence | MLTGAPGVAP QFGRNPQHGP DVAVVGAGPI GLAIAWRCAA RGLRVVVYDP APGSGAAHAA AGMLAPVAEA YFGEHELTGL LTESAARWPA FAAELSAASG TDTGYRGEGT LMVGLTADDL AVARRLWAYQ QGLGLPVTPL RPSELRDREP ALSPRTRGGA YAGTDHQVDP RRLVAALRTA TERAGGTLVP APVHRLADLT AGITVVAAGC GAAALTGLPV RPVKGQVLRL RAPGAPGFQH VIRGFADGEQ VYLVPREDGE VVVGATSEER TDTTVTSGAV LRLLRAATDL VPEVAEYELI EALAGLRPGT PDNAPILGPL PGRPAVLAAT GHHRHGIVLT PVTADLIADL IVTGTPDPLL APFTPERLGP AASSQPVTAA AARGPAGARP TTQEESWN
|
| |