Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0281 |
Symbol | |
ID | 8409779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 275649 |
End bp | 277178 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645018606 |
Product | Carboxypeptidase Taq |
Protein accession | YP_003176125 |
Protein GI | 257386352 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.053767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.142613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGG AAGCCGACGC TGACGACGCA GAGACGCCCG ACACCTACGA GCAGTTCCGC GCTCACGTCG AGCAGCTGAC CTACGTCGGC GACGCCGCCG GTGTCCTCCA GTGGGATCAG GAAGTGATGA TGCCCGACGA GGGGACACCC GCCCGCTCGA AGCAGTCGGC GGCGCTGTCG ACGCTCTCGC ACGACCTCCT GACCGACGAC GACGTGGCCG AGTGGCTGGA CGAACTGGAG GGATCGGACC TCGATCCCGA GCGGGAAGCG GTCGTCCGCG AGATTCGCCG CCAGCACGAC CGCGCCGCGA AGGTGCCCAG CGACCTCGTC CAGCGCATCT CGGAGGCCAC CTCGAACGCG CTGCCGGTCT GGAAGGAAGC CAAGGCCGAG GACGACTTCG AGATCTACGC CGACACGCTC GAAGAGCTGG TCCAGCTCAA GCGCGAGTAC GCCGAGGCGA TCGATCCCGA CCGAGACCCC TACGCGGTCC TGTTCGAGGA GTACGAACCG TACCTCGGGC TCGACACCGC CGAAGCGGCA CTCGAAGACC TCCGCGACAC GCTCGTCCCG CTGATCGACG ACATCAAAGA CAGCGACGTG ACGCTGGCCG ACCCCTTCGC CGGTGGCAGC TACGACGAGG CGTCACAGGA GGACCTCGTT CGGTCGGCGC TTGACTACCT GGGCTACGAC TGGGACCGCG GGCGACTCGA CACTGCGCCA CATCCCTTCT CGACCGGGAC GCAGTTCGAC GCCCGCGTGA CCACGCGGTT CGATCCCGAG GATCCGCTGG GTGCGCTCAG TTCGACCATC CACGAGTTCG GCCACGCGAC GTACACGCTC GGGCTCCCCG ACGAACACTA CGGGACGCCG CTGGGCGAGT CTCGAGACCT CTCGGTCCAC GAGTCCCAGT CCCGACTCTG GGAGAACCAC GTCGGGCGTT CCCGGCCGTT CTGGGAGGGC TTTGCCCCGA CTGTCGAGGA CCACCTCGCC ACGTCGGCCA CGCCCCGAGA GTACTACGAG GCGGCCAACA CGGTCCACCC GGACAACTGC ATCCGCGTCG AGGCCGACGA ACTGACCTAC CACATGCACA TCGTCCTGCG CTTCGAGATC GAGCGGGACC TGATCCACGG CGACCTCGAC GTGAGCGAGG TACCGCAGGT CTGGAACGAC AAGATGGAGG AGTACCTCGG AGTCCGGCCC GAGACCGACG CCGAGGGGTG CCTACAGGAC ATCCACTGGA GCCACGGCTC CTTCGGCTAC TTCCCGACGT ACTCCCTGGG GTCGGTGCTC GCCGCACAAC TGTTCGCCGC CGCCGAAGAC GACATCGGCG ATCTGGACGG ACAGCTCCGC GACGGCGAGT TCGACGACCT CCACGAGTGG CTCACGGACA ACGTCCACAG CCACGGCGCA CGCTACGAGA CCGACGACCT CATCGAGGAA GCGACCGGCG AGCCCTTCAC CGCCGACTAC TTCCTCGAAT ACGCCGAGTC GAAGTACCGT GACCTGTACG ACTGCTATAG TAACAATTGA
|
Protein sequence | MATEADADDA ETPDTYEQFR AHVEQLTYVG DAAGVLQWDQ EVMMPDEGTP ARSKQSAALS TLSHDLLTDD DVAEWLDELE GSDLDPEREA VVREIRRQHD RAAKVPSDLV QRISEATSNA LPVWKEAKAE DDFEIYADTL EELVQLKREY AEAIDPDRDP YAVLFEEYEP YLGLDTAEAA LEDLRDTLVP LIDDIKDSDV TLADPFAGGS YDEASQEDLV RSALDYLGYD WDRGRLDTAP HPFSTGTQFD ARVTTRFDPE DPLGALSSTI HEFGHATYTL GLPDEHYGTP LGESRDLSVH ESQSRLWENH VGRSRPFWEG FAPTVEDHLA TSATPREYYE AANTVHPDNC IRVEADELTY HMHIVLRFEI ERDLIHGDLD VSEVPQVWND KMEEYLGVRP ETDAEGCLQD IHWSHGSFGY FPTYSLGSVL AAQLFAAAED DIGDLDGQLR DGEFDDLHEW LTDNVHSHGA RYETDDLIEE ATGEPFTADY FLEYAESKYR DLYDCYSNN
|
| |