Gene Hmuk_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0031 
Symbol 
ID8409527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp26322 
End bp28106 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content70% 
IMG OID645018368 
Productpeptidase M3A and M3B thimet/oligopeptidase F 
Protein accessionYP_003175889 
Protein GI257386116 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0426313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTAC CCGACCACGC AGACAGAGAC GAACAGTACA CCTGGGAGAC CGGTCAGATC 
TTCGCGACGC CCGCAGAATG GGATCGGTAC CGAGCGGAGC TGCAGGCCGA CCTCGACGCG
GCCGATCCAC CCGCCGCCCC GATCGAATCG ACGGCCGCGG CGCGCGAGCT CGTCGACACG
GTCGCCGACT GGTACCAACG CATCCAGCGA CTCGAACTGT ACGCGACGCT CCGGGACTGC
ATCACGGACG ACCCGGCGGC GAGCGATCGA GGCGGGACCG CACGCAGCCT CTCGACGCGA
GTCGAGACGG CCGTCGGCGA GCACCTCGAC CGGCTGCACA GCACAGACGC CGTGACGCTC
GATCGGATCG AAGAGGCACT CGGAGAGCGA CGGCGCTACT TCGCATCGCT CCGTGCGCGG
GCCGCTCGCC GTCGATCGCC GGCGGTCGAG GCCGTCGTCG AGCAGTTCGC CGAACAGTGT
TCCAGCGCCG ACCGGATCGT CCGGGCCGTC AAGAACGACG ACTTCGACGC CCCGACCGTC
GAGACGCCCG ACGGCGACGA GCGAACCGTC ACGCTGGGCC GGTACGCGAC CGAGCTCGCC
GCTTCCGACC GCGACTATCG GCGGCGAGTG TACGAGTCCT CGCTGGATGG GCTCGCCACG
TTCGAGGGGA CGCTGGCGAC GGCCTACGAC GAGAAGCTCA CGGCCGCCAG CACGTCCGCC
GACGCCGTCG GCATCGACTC GATCCGGGAG CGACAGTTGA CGAATCGCTC GTACCCGGAG
GGCGGTATCG AGTTTCAGTT CCCGACGGCG CTGCACGACC GGCTGATCGA CGCCGTCGGC
GACGCGACGG GGCCCCGAGA GCGCGCTCGC GAGCGCCGCG CCCGACGGCT GGGGATCGAC
GCCGTTCGAC CGTGGGACAC CCAGGTCTCA GTCGCCGACG CCCCCGAACC CGAGATCGAG
TACGAGGCCG CCGTCGGACA CGTCGTCGAC GCCGTCGCAC CCCTCGGCGA GGCGTACCAG
GAGCGCGCCC GCCGGTTCTT CGCGCAACGG CGCGTCGACG TGTACGAGTG TGCGGACAAA
CGCAGCGACA TCCCCGCGTT CTGCCCCTCC TCGGCCGAAG ACGGTGCCTT CGTGCTCCTG
AACTTCCAGC GGGACGTGCG AACGGTGTTT CACCTCTGTC ACGAACTCGG CCACGCGCTC
CACGTCGAGC ACCACCGTGA GGGGCCCGCG ATGTACGCTA CCGGCCCGCG CCCGATCTCG
GAGGTGCCGA GCGTCCTCCA CGAGGTGTTG CTGACCGAGC ACCTCGCCCA GCAGAACGGA
CCACTGGCCG CCCACGCCCG CGAGCGACTG CTGCAGTCAC TCGAAATGCT GCTGTACGAG
CAGGCCGCGA ACGCGGCGTT CAAGCGTCGC CTCGCCGCGA CCGTCGACGG CGGGGAGCGC
CTCACCGCCG ACCGGATCGC CGACGCCTAC CGCGAGACAC TGGCGCGGTT CGACCCCGCG
CTCGAACCGT GTGACCGGAC ACGGTTCGAG TGGCTCACCG GGGCGCTGTT CCGCGACGCC
TTCCACCACT ACCAGTACGT TCTGGGTGCC GTCGGCGCGC TCCACGTCCG CGAGTCGCTC
CGAGACGGAC GCCTCGATCC CGCGACCTAC CGAGAGTTCC TCCGCTCGAC GGGACGAGAC
GACCCCGTGT CGCTGTTCGA ACGACTGGGC GTGGATCTGA CGACGAGTGC GCCCTACGAG
CGGGCGGCGC AGGCGTTCGA GGGATATCTG GATCGCTGGA CGTAG
 
Protein sequence
MSLPDHADRD EQYTWETGQI FATPAEWDRY RAELQADLDA ADPPAAPIES TAAARELVDT 
VADWYQRIQR LELYATLRDC ITDDPAASDR GGTARSLSTR VETAVGEHLD RLHSTDAVTL
DRIEEALGER RRYFASLRAR AARRRSPAVE AVVEQFAEQC SSADRIVRAV KNDDFDAPTV
ETPDGDERTV TLGRYATELA ASDRDYRRRV YESSLDGLAT FEGTLATAYD EKLTAASTSA
DAVGIDSIRE RQLTNRSYPE GGIEFQFPTA LHDRLIDAVG DATGPRERAR ERRARRLGID
AVRPWDTQVS VADAPEPEIE YEAAVGHVVD AVAPLGEAYQ ERARRFFAQR RVDVYECADK
RSDIPAFCPS SAEDGAFVLL NFQRDVRTVF HLCHELGHAL HVEHHREGPA MYATGPRPIS
EVPSVLHEVL LTEHLAQQNG PLAAHARERL LQSLEMLLYE QAANAAFKRR LAATVDGGER
LTADRIADAY RETLARFDPA LEPCDRTRFE WLTGALFRDA FHHYQYVLGA VGALHVRESL
RDGRLDPATY REFLRSTGRD DPVSLFERLG VDLTTSAPYE RAAQAFEGYL DRWT