Gene Htur_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0024 
Symbol 
ID8740587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp23438 
End bp24583 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content67% 
IMG OID646510587 
ProductSarcosine oxidase 
Protein accessionYP_003401598 
Protein GI284163319 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGCAA CAGGAACCCG ATACGACGTT ATCGTGATCG GCGTCGGCGG AATGGGCAGC 
GCGACGGCCG CTCACCTCGC CGACCGCGGC AGCGACGTGC TCGGCCTCGA GCGCTACGAC
GTGCCAAATA CGATGGGCTC TTCCCACGGG ATCACCCGGA TCATCCGGCG CGCCTACTAC
GAGCACCCTT CCTACATTCC GCTCGTCGAA CGGGCCTACG AGCTCTGGGA CGACCTCGCG
GACGAGACCG GCCGCGACGT GATCCACCGG ACGGGATCGA TCGACGCCGG TCCCCCGGAT
AACATCGTCT TCGAGGGGTC GCTGCGCTCC TGCGAGGAAC ACGACATTCC CCACGAGGTC
CTCACGAGCG CGGAGGTCGC CGAGCGGTTC CCCGGCTACG ACCTCCCCGA GGGGTACAAG
GCCTTGTACC AGCCCGATGG CGGGTTCGTG GTCCCCGAAC AGGCGATCGT CGGCCACGTC
GAGACGGCCC AGGCGGCGGG CGCCGAGGTG CGCGCCCGCG AGCGCGTCCT CGAGTGGGAG
TCGACGTCGG ACGAGGGCGT CCGCGTCGAA ACCGATCGCG GGACCTACGA AGCCGAGAAC
ATGGTGCTCG CCGCGGGGGC GTGGAACTAC AAGTTCGCCG ACGTGCTCGA GGATCTCGCG
GTCCCCGAGC GGCAGGTACT CGGCTGGTTC CAGCCCGATC GGCCGTCGAC GTTCGAACCC
GAGAACTTCC CGGTCTGGAA CCTCAAGGTC CCCGAAGGCC GCTTCTACGG GCTGCCGATC
TACGACGTGC CGGGGTTCAA GATCGGCAAG TACCACCACC GGGACAAACA GGTCGATCCC
GACGACTACG AGAGGGAGCC GAACCGCGAG GACGAGCGAC TCCTCCGCGA GGTTACTGAG
AACTACTTCG CCGACGCCGC CGGGACGACG ATGCGGCTCG CGACCTGCAT GTTCACCAAT
TCGCCCGACG AGCACTTCAT CCTCGATACG CTCCCCGAGC ACCCACAGGT GGCCGTCGGC
GCGGGCTTCT CGGGCCACGG CTTCAAGTTC GCCAGCGTCA TCGGCGAGAT CCTCGCCGAC
CTCGCGATCG ACGGCGACAC CGACCACCCG GTCGACATGT TCCGGTTCGA TCGGTTCGAC
GTCTGA
 
Protein sequence
MVATGTRYDV IVIGVGGMGS ATAAHLADRG SDVLGLERYD VPNTMGSSHG ITRIIRRAYY 
EHPSYIPLVE RAYELWDDLA DETGRDVIHR TGSIDAGPPD NIVFEGSLRS CEEHDIPHEV
LTSAEVAERF PGYDLPEGYK ALYQPDGGFV VPEQAIVGHV ETAQAAGAEV RARERVLEWE
STSDEGVRVE TDRGTYEAEN MVLAAGAWNY KFADVLEDLA VPERQVLGWF QPDRPSTFEP
ENFPVWNLKV PEGRFYGLPI YDVPGFKIGK YHHRDKQVDP DDYEREPNRE DERLLREVTE
NYFADAAGTT MRLATCMFTN SPDEHFILDT LPEHPQVAVG AGFSGHGFKF ASVIGEILAD
LAIDGDTDHP VDMFRFDRFD V