Gene Hmuk_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2239 
Symbol 
ID8411779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2158799 
End bp2159881 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content67% 
IMG OID645020582 
Productcarboxylate-amine ligase 
Protein accessionYP_003178059 
Protein GI257388286 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGAGA CGGGTTCCGC CGCCGCGTTC GACCGGATGG GCACGCTCGG GATCGAAGAG 
GAGTTCTACG TCGTCGACGA GTACGGCCGG CCGACTGCAG GGTCGGACGA GCTGGTCTAC
GAGAGCGACC CGCCGACGAT CCTCGACGGA CGGCTCGACC ACGAACTGTT CAAGACCGTC
GTCGAGACTC AGACGCCGAC ACTCGACGGG CTCGATGACG CACGATCGGC TCTGGAAGAC
GTTCGCAACG CACTGGTCGA CCACGCCGAG GCCAACGGCT TCCGGATCGC GGCGGCCGGG
CTCCACCCGC TCGCGAAGTG GCGCGAACTC GAACACGCCC AGAAACCCCG CTACCGCTCC
CAGCTGGATC GCATCCAGTA CCCCCAGCAC CGCAACACCA CAGCGGGCCT CCACGTCCAC
GTCGGCGTCG ACGACGCGGA CAAGGCCGTC TGGATCGCCA ACGAACTCCG CTGGCACCTC
CCGCTCGTGC TCGCGCTGTC TGCGAACTCG CCGTACTGGA ACGGCTACGA CACCGGGCTC
GCGTCCGCTC GTGCGAAGAT CTTCGAGGGG CTCCCGAACA CGGGAATGCC GACCGCGTTC
GAGTCGTACG CGGCCTACGA GCAGTTCGAG CGCCGGATGG TCGAGACCGA CAGCATCCGC
GACCGGGGAG AGCTGTGGTT CGACGTGCGA CCACACTCCG GACACGGTAC CGTCGAAGTC
CGGACGCCGG ACGGCCAGCG GAACCCCGAG TACGTGCTGG CCTTCGTCGA GTACGTCCAC
GCGCTGGTCG CGTCCCTGGC CGACCAGTTC GAAGACGGTG CCTCGGGAAC CGATACCCGG
CGTGAGTATC TGGACGAGAA CAAGTGGCGA GCGATGCGCC ACGGTCACGA CGCCTCGCTG
CTCACCCGAA CGGGGTCGAC GGCTCCGCTG GGAGAACTGG TCGACCGGGA GTGTGACCGA
CTCGGAATCG ACGGCCTCCG GACGCTCTAC GACCGCGAGA GCGGTGCGAA TCGCCAGCGC
CGCCTCCGGA AGCAAGGCGT CGCCACTCTC GCAGACGACC TCGTCTTGCA AAAGAACGCG
TAA
 
Protein sequence
MEETGSAAAF DRMGTLGIEE EFYVVDEYGR PTAGSDELVY ESDPPTILDG RLDHELFKTV 
VETQTPTLDG LDDARSALED VRNALVDHAE ANGFRIAAAG LHPLAKWREL EHAQKPRYRS
QLDRIQYPQH RNTTAGLHVH VGVDDADKAV WIANELRWHL PLVLALSANS PYWNGYDTGL
ASARAKIFEG LPNTGMPTAF ESYAAYEQFE RRMVETDSIR DRGELWFDVR PHSGHGTVEV
RTPDGQRNPE YVLAFVEYVH ALVASLADQF EDGASGTDTR REYLDENKWR AMRHGHDASL
LTRTGSTAPL GELVDRECDR LGIDGLRTLY DRESGANRQR RLRKQGVATL ADDLVLQKNA