Gene Htur_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0018 
Symbol 
ID8740581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp18922 
End bp20274 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content69% 
IMG OID646510581 
ProductTryptophanase 
Protein accessionYP_003401592 
Protein GI284163313 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATGG TCGCGTACAA GACGAAAGTA GCGGAACGGA TTCACCTCCC CTCTCGAGAC 
CGGCGCGAAC GGGCGCTGGC CGAGGCGGGG TACAACGTCT TCAATCTCGA CGCCGAGGAC
GTCTTCGTCG ACCTCCTGAC CGACAGCGGC ACCGGCGCGA TGAGCGACGC CCAGTGGGCG
GCCGTAATGC GCGGCGACGA GTCCTACGCC GGCTCGCGCA GCTTCGACGA CCTCGAGTCG
GCCGTCCGGG ACGTAATGGG TTTCTCGCGC GTCGTCCCGA CCCACCAGGG TCGCGGCGCG
GAGAACGTCC TCTACGGCAC GCTGCTCTCG GAGGGCGACG TCGCGCTCAA CAACACCCAC
TTCGACACGA CGCGGGCCCA CGTCGCGAAC CAGGGTGCCG ACCCGGTCGA TTGCCCCGTC
GAGGGGGCTC GCGACCTCGA GTCGGACGAG CCGTTCAAGG GGAACTTCTC GCTCGAGCGC
GCTCGCTCGG TCGTCGACGA GGTGGGCGCC GAGCGCGTGC CGCTGGTGAT CTTGACGATC
ACGAACAACT CGACGGCGGG TCAGCCGGTC TCCGTCGAGA ACACCCGCCG CGTCCGCGAC
TTCGCCGACG AGATCGGGGC GACGTTCGTC ATCGACGCCT GCCGGTTCGC CGAGAACGCC
GGCTTCGTCC GGCGGCGCGA GGACGAGTTC ACGGACGCCG ATATCGACGA GATCGCCCGC
GAACAACTCT CCTACGCCGA CGCGATCGTC ATGAGCGGCA AGAAGGACGG GCTGGCCAAC
GCCGGCGGCT TCGTCGCGAC CGACGACGAG GCGCTGTTCG AGCGGTGCAA GCAGCGAGCG
ATCCTCTACG AGGGCTTTCC CACGTACGGC GGCATGTCCG GCCGGGACGT CGCCGCGCTG
GCCGTCGGCC TCCGCGAGGC CGTCGAGGAG GCTTACGTCG CCGACCGCCT CGACGGCGTC
CGCGCGTTCG CGGACCTGCT CGAGGACGCC GGCGTCCCGA TCTACACGCC GCCCGGTGGT
CACGCCGTCT ACCTCGACGC CGGGACCGCA CTCCCGCACC TCGCACCCGA CGAGTTCCCT
GGTCAGGCAC TGGTCTGTGA ACTGTATCGA GAAGGCGGCG TCCGCGGGGT CGAACTCGGG
AGCTTCGCGT TCCCCGATAC GGACCGGCCG GAACTGGTCC GCCTCGCGGT GCCGCGTCGC
ACCTATCACA CCGAACACTT CGAACACGTC GCCGAGACCG CCGCGACGGT CCTCGAGAAG
CGAGAGGCGG TCTCCGGGCT CGAGATCGTT TCCGAGCCGG AAAACCGCGA GTTACGTCAC
TTCACGGCCG ACCTCGAGCC GCTGTCTGTA TGA
 
Protein sequence
MRMVAYKTKV AERIHLPSRD RRERALAEAG YNVFNLDAED VFVDLLTDSG TGAMSDAQWA 
AVMRGDESYA GSRSFDDLES AVRDVMGFSR VVPTHQGRGA ENVLYGTLLS EGDVALNNTH
FDTTRAHVAN QGADPVDCPV EGARDLESDE PFKGNFSLER ARSVVDEVGA ERVPLVILTI
TNNSTAGQPV SVENTRRVRD FADEIGATFV IDACRFAENA GFVRRREDEF TDADIDEIAR
EQLSYADAIV MSGKKDGLAN AGGFVATDDE ALFERCKQRA ILYEGFPTYG GMSGRDVAAL
AVGLREAVEE AYVADRLDGV RAFADLLEDA GVPIYTPPGG HAVYLDAGTA LPHLAPDEFP
GQALVCELYR EGGVRGVELG SFAFPDTDRP ELVRLAVPRR TYHTEHFEHV AETAATVLEK
REAVSGLEIV SEPENRELRH FTADLEPLSV