Gene Hmuk_3199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3199 
Symbol 
ID8412752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3087043 
End bp3089124 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content66% 
IMG OID645021544 
ProductATP-dependent protease Lon 
Protein accessionYP_003179009 
Protein GI257389236 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.11002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ATACCGAGAT CGACGACACT CCCGACGACG AGCGGGAGGT GACCGGGACG 
CCGCCCGAAG ACGACGACCA CAGCGCGGCC GACGACGCTG GTGACACTGT GGAGAGTGGC
GGGGCCGAAG GTGGCGGCCT CGACGACCTC GGCAGTGAGG TCGAAGTCGA GAGCGGCGCG
TCCCTCGACG ACGAAGACGG GCTCCTCGGT GGGCTCGGCA TCGAGTCGAC CAGCGACATC
GAAGTCCCCG ACCGACTCGT CGATCAGGTC ATCGGGCAGG ACCACGCCCG TGACGTCATC
AAAAAGGCAG CCAAACAGCG CCGTCACGTG ATGATGATCG GCTCGCCCGG GACGGGCAAG
TCGATGCTGG CCAAAGCGAT GAGCCAACTG CTCCCGAAAG AGGAACTACA GGACGTTCTG
GTCTACCACA ACCCGGACGA CGGAAACGAG CCGAAGGTTC GAACCGTCCC CGCCGGGAAG
GGCGAACAGA TCGTCGACGC CCACAAGGAG GAGGCCCGCA AGCGAAACCA GATGCGGACC
TTCCTGATGT GGATCATCAT CGCCATCATC ATCGGTTACG CGCTGATCAT CCAGGGGTCG
GTCCTGCTGG GGATCCTCGC GGCCGGTGTC ATCTACCTGG CCTTCCGCTA TGGCTCGCGT
GGCAACGACT CGATGATCCC GAACCTGCTG GTCAACCGCG CGAACCAGAC CACCGCGCCC
TTCGAGGACG CGACCGGTGC CCACGCCGGT GCGCTGCTGG GCGACGTTCG CCACGACCCG
TTCCAGTCCG GTGGGATGGA GACGCCCAGC CACGACCGCG TCGAGGCCGG CGCGATCCAC
AAGGCCAACA AGGGCGTCCT GTTCGTCGAC GAGATCAACA CGCTGGACAT CCGCAGCCAG
CAGAAACTCA TGACCGCGAT CCAGGAGGGC GAGTTCTCCA TCACTGGCCA GTCCGAGCGC
TCCTCGGGCG CGATGGTCCA GACCGAGCCC GTCCCCTGTG ATTTCATCAT GATCGCGGCG
GGGAACCTCG ACGCCATGGA GAACATGCAC CCCGCGCTGC GCTCCCGGAT CAAGGGGTAC
GGCTACGAGG TGTACATGGA CGACACCATC GAGGACACCC CGGAGATGCG CCGGAAGCTC
ACCCGCTTCA TCGCACAGGA AGTCGAGAAC GACGGGCGAC TCCCCCACTT CGACCGCGAG
GCCGTCGAGG AGATCATCCT CGAAGCCCGC CGCCGCGCGG GCCGCAAAGG GCACTTGACC
CTGAAACTGC GCGAACTGGG CGGACTGGTC CGCGTCGCGG GCGACATCGC CCGCGCGGAG
GATCAGGACG TGACCACTCG CGAGGACGTG CTCCAGGCCA AAGGCCGGAG TCGCTCCATC
GAGCAACAGC TCGCAGACGA CTACATCGAA CGCCGCAAGG ACTACGAACT GACCGTCAAC
GAGGGCGACG TGGTCGGCCG CGTCAACGGT CTGGCCGTGA TGGGCGAGGA CAGCGGGATC
GTCCTCCCGG TCATGGCGGA GGTCACGCCC TCGCAGGGTC CCGGTCAGGT GATCGCCACC
GGCCAGCTCA AGGAGATGGC CGAGGAGGCA GTCCAGAACG TCTCGGCGAT CATCAAGAAG
TTCTCGGACG AGGACATCTC CGAGAAGGAC GTTCACATCC AGTTCGTTCA GGCCGGCGAG
GGCGGTGTCG ACGGCGACTC CGCCTCGATC ACGGTCGCGA CGGCGGTCAT CTCCGCGCTG
GAGAACGTTC CGATCGAGCA GAACCTCGCG ATGACCGGCT CGCTGTCGGT GCGGGGCGAC
GTGTTGCCCG TCGGCGGCGT GACCCACAAG ATCGAGGCCG CGGCCAAGTC CGGGCTCGAT
ACGGTCATCA TCCCCGAGGC AAACACCCAG GACGTGATGA TCGAAGAGGA GTACGAGGAG
ATGATCGAGA TCGTCCCGGT CTCGCACATC TCCGAGGTGC TGGAAGTGGC CCTGGCCGGC
GAGCCCGAGA AGGACTCGCT GGTCGACCGG CTCAAGTCCA TCACCGGCAA AGCGCTCGAA
CGCGAAGTGG GCCAGCAGAG CGGCTCACCC AGCCCGCAGT AG
 
Protein sequence
MSDNTEIDDT PDDEREVTGT PPEDDDHSAA DDAGDTVESG GAEGGGLDDL GSEVEVESGA 
SLDDEDGLLG GLGIESTSDI EVPDRLVDQV IGQDHARDVI KKAAKQRRHV MMIGSPGTGK
SMLAKAMSQL LPKEELQDVL VYHNPDDGNE PKVRTVPAGK GEQIVDAHKE EARKRNQMRT
FLMWIIIAII IGYALIIQGS VLLGILAAGV IYLAFRYGSR GNDSMIPNLL VNRANQTTAP
FEDATGAHAG ALLGDVRHDP FQSGGMETPS HDRVEAGAIH KANKGVLFVD EINTLDIRSQ
QKLMTAIQEG EFSITGQSER SSGAMVQTEP VPCDFIMIAA GNLDAMENMH PALRSRIKGY
GYEVYMDDTI EDTPEMRRKL TRFIAQEVEN DGRLPHFDRE AVEEIILEAR RRAGRKGHLT
LKLRELGGLV RVAGDIARAE DQDVTTREDV LQAKGRSRSI EQQLADDYIE RRKDYELTVN
EGDVVGRVNG LAVMGEDSGI VLPVMAEVTP SQGPGQVIAT GQLKEMAEEA VQNVSAIIKK
FSDEDISEKD VHIQFVQAGE GGVDGDSASI TVATAVISAL ENVPIEQNLA MTGSLSVRGD
VLPVGGVTHK IEAAAKSGLD TVIIPEANTQ DVMIEEEYEE MIEIVPVSHI SEVLEVALAG
EPEKDSLVDR LKSITGKALE REVGQQSGSP SPQ