Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0031 |
Symbol | |
ID | 8409527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 26322 |
End bp | 28106 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645018368 |
Product | peptidase M3A and M3B thimet/oligopeptidase F |
Protein accession | YP_003175889 |
Protein GI | 257386116 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0426313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTAC CCGACCACGC AGACAGAGAC GAACAGTACA CCTGGGAGAC CGGTCAGATC TTCGCGACGC CCGCAGAATG GGATCGGTAC CGAGCGGAGC TGCAGGCCGA CCTCGACGCG GCCGATCCAC CCGCCGCCCC GATCGAATCG ACGGCCGCGG CGCGCGAGCT CGTCGACACG GTCGCCGACT GGTACCAACG CATCCAGCGA CTCGAACTGT ACGCGACGCT CCGGGACTGC ATCACGGACG ACCCGGCGGC GAGCGATCGA GGCGGGACCG CACGCAGCCT CTCGACGCGA GTCGAGACGG CCGTCGGCGA GCACCTCGAC CGGCTGCACA GCACAGACGC CGTGACGCTC GATCGGATCG AAGAGGCACT CGGAGAGCGA CGGCGCTACT TCGCATCGCT CCGTGCGCGG GCCGCTCGCC GTCGATCGCC GGCGGTCGAG GCCGTCGTCG AGCAGTTCGC CGAACAGTGT TCCAGCGCCG ACCGGATCGT CCGGGCCGTC AAGAACGACG ACTTCGACGC CCCGACCGTC GAGACGCCCG ACGGCGACGA GCGAACCGTC ACGCTGGGCC GGTACGCGAC CGAGCTCGCC GCTTCCGACC GCGACTATCG GCGGCGAGTG TACGAGTCCT CGCTGGATGG GCTCGCCACG TTCGAGGGGA CGCTGGCGAC GGCCTACGAC GAGAAGCTCA CGGCCGCCAG CACGTCCGCC GACGCCGTCG GCATCGACTC GATCCGGGAG CGACAGTTGA CGAATCGCTC GTACCCGGAG GGCGGTATCG AGTTTCAGTT CCCGACGGCG CTGCACGACC GGCTGATCGA CGCCGTCGGC GACGCGACGG GGCCCCGAGA GCGCGCTCGC GAGCGCCGCG CCCGACGGCT GGGGATCGAC GCCGTTCGAC CGTGGGACAC CCAGGTCTCA GTCGCCGACG CCCCCGAACC CGAGATCGAG TACGAGGCCG CCGTCGGACA CGTCGTCGAC GCCGTCGCAC CCCTCGGCGA GGCGTACCAG GAGCGCGCCC GCCGGTTCTT CGCGCAACGG CGCGTCGACG TGTACGAGTG TGCGGACAAA CGCAGCGACA TCCCCGCGTT CTGCCCCTCC TCGGCCGAAG ACGGTGCCTT CGTGCTCCTG AACTTCCAGC GGGACGTGCG AACGGTGTTT CACCTCTGTC ACGAACTCGG CCACGCGCTC CACGTCGAGC ACCACCGTGA GGGGCCCGCG ATGTACGCTA CCGGCCCGCG CCCGATCTCG GAGGTGCCGA GCGTCCTCCA CGAGGTGTTG CTGACCGAGC ACCTCGCCCA GCAGAACGGA CCACTGGCCG CCCACGCCCG CGAGCGACTG CTGCAGTCAC TCGAAATGCT GCTGTACGAG CAGGCCGCGA ACGCGGCGTT CAAGCGTCGC CTCGCCGCGA CCGTCGACGG CGGGGAGCGC CTCACCGCCG ACCGGATCGC CGACGCCTAC CGCGAGACAC TGGCGCGGTT CGACCCCGCG CTCGAACCGT GTGACCGGAC ACGGTTCGAG TGGCTCACCG GGGCGCTGTT CCGCGACGCC TTCCACCACT ACCAGTACGT TCTGGGTGCC GTCGGCGCGC TCCACGTCCG CGAGTCGCTC CGAGACGGAC GCCTCGATCC CGCGACCTAC CGAGAGTTCC TCCGCTCGAC GGGACGAGAC GACCCCGTGT CGCTGTTCGA ACGACTGGGC GTGGATCTGA CGACGAGTGC GCCCTACGAG CGGGCGGCGC AGGCGTTCGA GGGATATCTG GATCGCTGGA CGTAG
|
Protein sequence | MSLPDHADRD EQYTWETGQI FATPAEWDRY RAELQADLDA ADPPAAPIES TAAARELVDT VADWYQRIQR LELYATLRDC ITDDPAASDR GGTARSLSTR VETAVGEHLD RLHSTDAVTL DRIEEALGER RRYFASLRAR AARRRSPAVE AVVEQFAEQC SSADRIVRAV KNDDFDAPTV ETPDGDERTV TLGRYATELA ASDRDYRRRV YESSLDGLAT FEGTLATAYD EKLTAASTSA DAVGIDSIRE RQLTNRSYPE GGIEFQFPTA LHDRLIDAVG DATGPRERAR ERRARRLGID AVRPWDTQVS VADAPEPEIE YEAAVGHVVD AVAPLGEAYQ ERARRFFAQR RVDVYECADK RSDIPAFCPS SAEDGAFVLL NFQRDVRTVF HLCHELGHAL HVEHHREGPA MYATGPRPIS EVPSVLHEVL LTEHLAQQNG PLAAHARERL LQSLEMLLYE QAANAAFKRR LAATVDGGER LTADRIADAY RETLARFDPA LEPCDRTRFE WLTGALFRDA FHHYQYVLGA VGALHVRESL RDGRLDPATY REFLRSTGRD DPVSLFERLG VDLTTSAPYE RAAQAFEGYL DRWT
|
| |