Gene Hmuk_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2053 
Symbol 
ID8411588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1962652 
End bp1963701 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content70% 
IMG OID645020391 
ProductThreonine aldolase 
Protein accessionYP_003177873 
Protein GI257388100 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGAC ACGACGACCC GATCGACCTC CGATCAGACA CGGTGACTAG GCCCTCGACA 
GCCATGCGGG AGGCCGCCCG CAACGCCCCA GTCGGCGACG ACGTGTACGG CGAAGATCCC
ACCGTCGCCG AACTGGAGGC CCGTGCCGCC GGTCTGCTCG GGAAAGCCGA CGCGCTCTTC
GTTCCCAGCG GCACCATGGG CAACCAGATC GCCGTCCGGG CCCACACCGA ACGGGGACAG
GAACTCCTGC TGGATCGTGA GTCACACATC TATCGCTGGG AACTCGGCGG GACCGCCCAG
CACGCACAGG TCCAGTGTCG CACGGTCGAC GCCAGCGAGC GCTGCGTACC GACCCCCGAA
CAGATCAGCG AGGCGTTCGT CGCCGAGGAC CTGCACCGAC CGGGGACCGG TCTGGTGACC
CTGGAGAACA CGCACAACTA CCGCGGCGGC GTCGCGGTCC CCGAATCCCA CGTCGACGCC
GCGTGTGACG CGGCACACGC TCTCGGCGTG CCGGTCCACC TCGACGGCGC GCGGCTGTGG
AACGCCGCGG TCGCGCTCGA CACCGCGCCG GCCGCGCTCG CCCGAGAAGC GGACTCGGTG
ATGGCCTGCC TCTCCAAGGG ACTGGGCGCA CCCGTCGGCT CGGTCCTCGC GGGCACCGAG
TCGTTCGTCG ACGAGGCCCG TCGCCTCCGG AAGCTGTTTG GCGGCGGAAT GCGCCAGGCG
GGCATGATCG CGGCACCCGG CCTCGAAGCG CTCGACAACG TCGACCGGCT CGCCGACGAC
CACGAGAACG CACGGCGGCT GGCGACCGGT CTCGACGCGA TAGACGGCCT CCGCGTGCCG
ACACCGGAGA CCAACATCGT CGTCGTCGAC AGCGAACCCG CCGGGATCAC CAGCGACGCC
TTCGTCGAGG GCTGTGTGGC GCGTGGCGTT CGCTGTGGGA GCGTCTCCGA GTACACGACG
CGGCTGTGTA CCAACCTCGA CGTGGACCGC GCCGACGTCG ACGCGGCGAT CGATCGGATC
GGGCGCGTGG TCCGAGCGGC CACCGAATAG
 
Protein sequence
MSGHDDPIDL RSDTVTRPST AMREAARNAP VGDDVYGEDP TVAELEARAA GLLGKADALF 
VPSGTMGNQI AVRAHTERGQ ELLLDRESHI YRWELGGTAQ HAQVQCRTVD ASERCVPTPE
QISEAFVAED LHRPGTGLVT LENTHNYRGG VAVPESHVDA ACDAAHALGV PVHLDGARLW
NAAVALDTAP AALAREADSV MACLSKGLGA PVGSVLAGTE SFVDEARRLR KLFGGGMRQA
GMIAAPGLEA LDNVDRLADD HENARRLATG LDAIDGLRVP TPETNIVVVD SEPAGITSDA
FVEGCVARGV RCGSVSEYTT RLCTNLDVDR ADVDAAIDRI GRVVRAATE