Gene Hmuk_0061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0061 
Symbol 
ID8409558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp60716 
End bp63835 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content71% 
IMG OID645018399 
Productprotein of unknown function DUF214 
Protein accessionYP_003175919 
Protein GI257386146 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.221236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTATC GAAACGCACT GCTGTTCCGG TGGTCACGAC GCGACCGACT GACGATCGTC 
GTCGTCGCGG TGACGGCCGC GTTCCTCGTC GGGACCGCGC TCCTGTTGTT CACGGCGACG
ACCTACTCGG AGACGTTCGC GGAGCCGCTG TCGAACGCCG GGACGATCAG CTACGAGACC
GCGGACGGCG ACCGGCCGGC ATCGACCGAG CGACGGGTCG TCCTCCCACT GACGACCGCC
TCGATCGACG GCAAGTCGGC TCCCGTCGTC GGGATCCCAC CGGACGCGCC GCGGGTGATC
CAGAACGGCT CGGCGTCGTG GCAACAGGGC CGCTTGCCAG CGATGCCGTC CGACGCCGAC
GCACGCGGTC CGGTGTCGAG ACAGCGGACC CGAACGCTGT CCGGGCCCGA CGGGCAGGTG
ACACTCTCAG TCGTCCCACG AGAGCGGAGC AACAGCTTCC TCCAGCCGAC GTGGTACGTC
GCCAACGCCT CGGTCGTCGA CGCCATCGGG ACGACGGGCT ATCTCGTGAT CGACCGAGAC
AGCGAAGCCG ACAGTGGGAA CGCGATCCCC GAGACCGGCG TGCCCCTCGT GAGCGCGCTC
CTGTACGTGC TGGGCGGGAT CGAACAGGTG CTCTGGGCGC TGAGCATCGC CGTCGCTGCG
GGTGGCCTCC TCGTGCTCGT CGTCGTCTAC AGCGTGACGC GCATGAGCGT CCGCGACCGG
ACCGAGGCGA TCAGCGTGAT CCGCTCGACC GGGGCACCGG GGTGGCACGT CGGCCTCCTG
TTTACTGCGC GGGCGGCGTT GCTGGTCGCG GTCGGCGTGG CCATCGGCTA TGCGGGCGGG
TTGATCGCGA TCAAGGCGAT CGTCAACGCC GCGGTCTACC TCGGGCTCCC GATCGCGCTC
GACGTGACCG TGACCGGAGG GAGCGTCGGC GTCGTGGGCG GCATCGCCGG ACTCCTCGTC
GGGATGGGTG TCGTCGCCGG TGCGATCGCG GCGTATCCGG CCGCCTCCCG CCCGCCAGCG
ACGCTCGGAC ACAGACGCGC CCGCCTGCAG TCGTCGACCG GAGCGTCCGG AGGGCGGCTG
GCCCGACTGC GGTCGATCCT GAAGCCGACG CTGCTCTCGT GGCGCTCGCT CGTTCCGACT
GCGGCGACGC TTTCCGTGTT CGCGCTGACG GTCCTGCTGG TCGTCGCGAT CGCCGGCCTG
GCCTCGCCAC TCGGCGGTGA CGCCGGGGGA ACGGGCACGA TCACGGAAGC CGACGCCCCG
CATCCGCTGA ACAGTCGCCT CGACGCCGAC TACGCCCGCG CGCTCACGGC GAGTGGCACG
CCGGCCAGCC CGGAGATCAT CTACGCGCAA GTGCGGGACG GCCAGCCCTA CATGGCCCAC
GGGGCCGACT ACGAGATGTT CGCCAACGTC ACGAACGCGA CGGTCGTCGA GGGGCGTACA
CCGGCGACCG CCGACGAGGC CGTCGTCGGG ACCGACCTGG CACGGACGCT CGACCTCTCG
GTCGGCGACA CGGTCACGCT CGGTGGCAGC GTCGCTCCCG GCGTCCGCCA GTTCGAGGTC
GTCGGCGCGT ACGACGCCCA CGGAACGCTC GACGACCTCC TCGTCGTCCC CCTGCGTTCC
TCGTGGGGAC TGGCCACGGC GCGAGGGCAG GTCCACATGA TCCGCGTGGC CGGTGACGTG
CCATCGGGTG CGGAGTCGGG AACACCCGTC GGCGGGGAGT CCACCGATCA GACGGGCCTC
GCGATAACGG AGTTCACGGG TCCAGAGACG GTCACGCAGG GCGAGAACAT CACGCTCTCC
GTGACGGTTC GGAACTTCGG CGACACGGCG GGCTCGCGGG CGGTGCCCGT CGAGTACGGG
AACCAGCGCG CAAACCGGAC GGTATCGGTG CCGGCAGGGG GACAGACGAC CGTAGAGGTG
ACTGTCGTCG CCGAGCAGAC CGGCGAGGTG CGGGCCCGGA CCGGCGAGTA CACGCACACC
GTGACCGTCG TTTCACCGAA CGCGATCCGG ATCCCCGCCG AACTACCCGG GACCGCGCCA
CCGGGGAGCG GCCTGTACGT ACCCGTCGTC GACGGTACCG GTGACCCGGT CACTGACGCC
GCAGTCACCG TCGACGGCGT GACGGTACAG ACCCGCGATG AAGGGGTGGC GGTGGTACCG
CTACCGCGGA CGGAGGGCAA CTACACGATC ACCGCACAGC ACGAGAACCG GACCGCGACG
CACGCCCTCC GGATCGTCGC CGGGAGCGAA CGACGGCTCT CCGGGCGGCT CGACGTGTCG
CCCCAGTCGG GCAATGCGCT GACGAGCCCG ACGGTGACGG TCGAGCTGGG GAATCCGTGG
CAACAACAGC TCACGCGGAC GATCACCGTC GTCGGACCGA CGGGGACTCG CGAGCGCCAG
GTCACCCTGT CTCCCGGGAA CGGGACCCGG AGCGAGTTCA CCGCCGCTGC GGGGGCTCGC
ACCCAGCCCG GTGAGTACGC GTTCCGGCTG AGCTCGAACG GGACCCAGCT GGCGACGGCC
GACTACACCG TGACCGGAGA CGAGCGACTG GCAGCAGCGG TCGCGAGCAG CGGCCAGTAC
GCCTCCGGAA CGACGATCGA GCGATCGGTC GAAGGCGTCT TCGGGAACGT CCAGCTCGTC
CTCGTCGCAC TCGTCGTCCT GGCCGGCCTG AGCACAGTCG GCAGCACGAC GGCGACGTTC
GCACAGGCCG TCCACGCGCG GCGACAATCG ATCGGGATCC ATCGATCGGT GGGCGCGACC
CACGGACAGA TCCTGCGCAT CGTCCTCGGA GATGTCGTGC GAATCGCCGT TCCGGCGGCA
CTGCTCGCAG TCGCCGTTGG CGTCGCCGCG ATGCTGGCAC TGAACCGGGC CGGCTGGCTC
GTCTTCTTCG GGTTCCGGCT GTCGACGCCG ACTCCCCCGC TCGTGCTCGT GGGACTGGCG
CTCGCGGGCG TCGGACTCGC GGTGCTTGGC GCGCTCGCCG CGACGGTACC GTACCTGACA
GCGTCGCCCG TCTCGCTGCT CCCGGCGGGC GACCGGGTGC AGCTGCCGAC CGCCGAACGC
GGACGGCAGT CGGACGGTCG CGAACAGCGG CCACCGCCCG ACGCGTCAGA CGACGACTGA
 
Protein sequence
MGYRNALLFR WSRRDRLTIV VVAVTAAFLV GTALLLFTAT TYSETFAEPL SNAGTISYET 
ADGDRPASTE RRVVLPLTTA SIDGKSAPVV GIPPDAPRVI QNGSASWQQG RLPAMPSDAD
ARGPVSRQRT RTLSGPDGQV TLSVVPRERS NSFLQPTWYV ANASVVDAIG TTGYLVIDRD
SEADSGNAIP ETGVPLVSAL LYVLGGIEQV LWALSIAVAA GGLLVLVVVY SVTRMSVRDR
TEAISVIRST GAPGWHVGLL FTARAALLVA VGVAIGYAGG LIAIKAIVNA AVYLGLPIAL
DVTVTGGSVG VVGGIAGLLV GMGVVAGAIA AYPAASRPPA TLGHRRARLQ SSTGASGGRL
ARLRSILKPT LLSWRSLVPT AATLSVFALT VLLVVAIAGL ASPLGGDAGG TGTITEADAP
HPLNSRLDAD YARALTASGT PASPEIIYAQ VRDGQPYMAH GADYEMFANV TNATVVEGRT
PATADEAVVG TDLARTLDLS VGDTVTLGGS VAPGVRQFEV VGAYDAHGTL DDLLVVPLRS
SWGLATARGQ VHMIRVAGDV PSGAESGTPV GGESTDQTGL AITEFTGPET VTQGENITLS
VTVRNFGDTA GSRAVPVEYG NQRANRTVSV PAGGQTTVEV TVVAEQTGEV RARTGEYTHT
VTVVSPNAIR IPAELPGTAP PGSGLYVPVV DGTGDPVTDA AVTVDGVTVQ TRDEGVAVVP
LPRTEGNYTI TAQHENRTAT HALRIVAGSE RRLSGRLDVS PQSGNALTSP TVTVELGNPW
QQQLTRTITV VGPTGTRERQ VTLSPGNGTR SEFTAAAGAR TQPGEYAFRL SSNGTQLATA
DYTVTGDERL AAAVASSGQY ASGTTIERSV EGVFGNVQLV LVALVVLAGL STVGSTTATF
AQAVHARRQS IGIHRSVGAT HGQILRIVLG DVVRIAVPAA LLAVAVGVAA MLALNRAGWL
VFFGFRLSTP TPPLVLVGLA LAGVGLAVLG ALAATVPYLT ASPVSLLPAG DRVQLPTAER
GRQSDGREQR PPPDASDDD