Gene Hmuk_0281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0281 
Symbol 
ID8409779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp275649 
End bp277178 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content66% 
IMG OID645018606 
ProductCarboxypeptidase Taq 
Protein accessionYP_003176125 
Protein GI257386352 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.053767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.142613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGG AAGCCGACGC TGACGACGCA GAGACGCCCG ACACCTACGA GCAGTTCCGC 
GCTCACGTCG AGCAGCTGAC CTACGTCGGC GACGCCGCCG GTGTCCTCCA GTGGGATCAG
GAAGTGATGA TGCCCGACGA GGGGACACCC GCCCGCTCGA AGCAGTCGGC GGCGCTGTCG
ACGCTCTCGC ACGACCTCCT GACCGACGAC GACGTGGCCG AGTGGCTGGA CGAACTGGAG
GGATCGGACC TCGATCCCGA GCGGGAAGCG GTCGTCCGCG AGATTCGCCG CCAGCACGAC
CGCGCCGCGA AGGTGCCCAG CGACCTCGTC CAGCGCATCT CGGAGGCCAC CTCGAACGCG
CTGCCGGTCT GGAAGGAAGC CAAGGCCGAG GACGACTTCG AGATCTACGC CGACACGCTC
GAAGAGCTGG TCCAGCTCAA GCGCGAGTAC GCCGAGGCGA TCGATCCCGA CCGAGACCCC
TACGCGGTCC TGTTCGAGGA GTACGAACCG TACCTCGGGC TCGACACCGC CGAAGCGGCA
CTCGAAGACC TCCGCGACAC GCTCGTCCCG CTGATCGACG ACATCAAAGA CAGCGACGTG
ACGCTGGCCG ACCCCTTCGC CGGTGGCAGC TACGACGAGG CGTCACAGGA GGACCTCGTT
CGGTCGGCGC TTGACTACCT GGGCTACGAC TGGGACCGCG GGCGACTCGA CACTGCGCCA
CATCCCTTCT CGACCGGGAC GCAGTTCGAC GCCCGCGTGA CCACGCGGTT CGATCCCGAG
GATCCGCTGG GTGCGCTCAG TTCGACCATC CACGAGTTCG GCCACGCGAC GTACACGCTC
GGGCTCCCCG ACGAACACTA CGGGACGCCG CTGGGCGAGT CTCGAGACCT CTCGGTCCAC
GAGTCCCAGT CCCGACTCTG GGAGAACCAC GTCGGGCGTT CCCGGCCGTT CTGGGAGGGC
TTTGCCCCGA CTGTCGAGGA CCACCTCGCC ACGTCGGCCA CGCCCCGAGA GTACTACGAG
GCGGCCAACA CGGTCCACCC GGACAACTGC ATCCGCGTCG AGGCCGACGA ACTGACCTAC
CACATGCACA TCGTCCTGCG CTTCGAGATC GAGCGGGACC TGATCCACGG CGACCTCGAC
GTGAGCGAGG TACCGCAGGT CTGGAACGAC AAGATGGAGG AGTACCTCGG AGTCCGGCCC
GAGACCGACG CCGAGGGGTG CCTACAGGAC ATCCACTGGA GCCACGGCTC CTTCGGCTAC
TTCCCGACGT ACTCCCTGGG GTCGGTGCTC GCCGCACAAC TGTTCGCCGC CGCCGAAGAC
GACATCGGCG ATCTGGACGG ACAGCTCCGC GACGGCGAGT TCGACGACCT CCACGAGTGG
CTCACGGACA ACGTCCACAG CCACGGCGCA CGCTACGAGA CCGACGACCT CATCGAGGAA
GCGACCGGCG AGCCCTTCAC CGCCGACTAC TTCCTCGAAT ACGCCGAGTC GAAGTACCGT
GACCTGTACG ACTGCTATAG TAACAATTGA
 
Protein sequence
MATEADADDA ETPDTYEQFR AHVEQLTYVG DAAGVLQWDQ EVMMPDEGTP ARSKQSAALS 
TLSHDLLTDD DVAEWLDELE GSDLDPEREA VVREIRRQHD RAAKVPSDLV QRISEATSNA
LPVWKEAKAE DDFEIYADTL EELVQLKREY AEAIDPDRDP YAVLFEEYEP YLGLDTAEAA
LEDLRDTLVP LIDDIKDSDV TLADPFAGGS YDEASQEDLV RSALDYLGYD WDRGRLDTAP
HPFSTGTQFD ARVTTRFDPE DPLGALSSTI HEFGHATYTL GLPDEHYGTP LGESRDLSVH
ESQSRLWENH VGRSRPFWEG FAPTVEDHLA TSATPREYYE AANTVHPDNC IRVEADELTY
HMHIVLRFEI ERDLIHGDLD VSEVPQVWND KMEEYLGVRP ETDAEGCLQD IHWSHGSFGY
FPTYSLGSVL AAQLFAAAED DIGDLDGQLR DGEFDDLHEW LTDNVHSHGA RYETDDLIEE
ATGEPFTADY FLEYAESKYR DLYDCYSNN