Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0024 |
Symbol | |
ID | 8740587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 23438 |
End bp | 24583 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646510587 |
Product | Sarcosine oxidase |
Protein accession | YP_003401598 |
Protein GI | 284163319 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGCAA CAGGAACCCG ATACGACGTT ATCGTGATCG GCGTCGGCGG AATGGGCAGC GCGACGGCCG CTCACCTCGC CGACCGCGGC AGCGACGTGC TCGGCCTCGA GCGCTACGAC GTGCCAAATA CGATGGGCTC TTCCCACGGG ATCACCCGGA TCATCCGGCG CGCCTACTAC GAGCACCCTT CCTACATTCC GCTCGTCGAA CGGGCCTACG AGCTCTGGGA CGACCTCGCG GACGAGACCG GCCGCGACGT GATCCACCGG ACGGGATCGA TCGACGCCGG TCCCCCGGAT AACATCGTCT TCGAGGGGTC GCTGCGCTCC TGCGAGGAAC ACGACATTCC CCACGAGGTC CTCACGAGCG CGGAGGTCGC CGAGCGGTTC CCCGGCTACG ACCTCCCCGA GGGGTACAAG GCCTTGTACC AGCCCGATGG CGGGTTCGTG GTCCCCGAAC AGGCGATCGT CGGCCACGTC GAGACGGCCC AGGCGGCGGG CGCCGAGGTG CGCGCCCGCG AGCGCGTCCT CGAGTGGGAG TCGACGTCGG ACGAGGGCGT CCGCGTCGAA ACCGATCGCG GGACCTACGA AGCCGAGAAC ATGGTGCTCG CCGCGGGGGC GTGGAACTAC AAGTTCGCCG ACGTGCTCGA GGATCTCGCG GTCCCCGAGC GGCAGGTACT CGGCTGGTTC CAGCCCGATC GGCCGTCGAC GTTCGAACCC GAGAACTTCC CGGTCTGGAA CCTCAAGGTC CCCGAAGGCC GCTTCTACGG GCTGCCGATC TACGACGTGC CGGGGTTCAA GATCGGCAAG TACCACCACC GGGACAAACA GGTCGATCCC GACGACTACG AGAGGGAGCC GAACCGCGAG GACGAGCGAC TCCTCCGCGA GGTTACTGAG AACTACTTCG CCGACGCCGC CGGGACGACG ATGCGGCTCG CGACCTGCAT GTTCACCAAT TCGCCCGACG AGCACTTCAT CCTCGATACG CTCCCCGAGC ACCCACAGGT GGCCGTCGGC GCGGGCTTCT CGGGCCACGG CTTCAAGTTC GCCAGCGTCA TCGGCGAGAT CCTCGCCGAC CTCGCGATCG ACGGCGACAC CGACCACCCG GTCGACATGT TCCGGTTCGA TCGGTTCGAC GTCTGA
|
Protein sequence | MVATGTRYDV IVIGVGGMGS ATAAHLADRG SDVLGLERYD VPNTMGSSHG ITRIIRRAYY EHPSYIPLVE RAYELWDDLA DETGRDVIHR TGSIDAGPPD NIVFEGSLRS CEEHDIPHEV LTSAEVAERF PGYDLPEGYK ALYQPDGGFV VPEQAIVGHV ETAQAAGAEV RARERVLEWE STSDEGVRVE TDRGTYEAEN MVLAAGAWNY KFADVLEDLA VPERQVLGWF QPDRPSTFEP ENFPVWNLKV PEGRFYGLPI YDVPGFKIGK YHHRDKQVDP DDYEREPNRE DERLLREVTE NYFADAAGTT MRLATCMFTN SPDEHFILDT LPEHPQVAVG AGFSGHGFKF ASVIGEILAD LAIDGDTDHP VDMFRFDRFD V
|
| |