Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0061 |
Symbol | |
ID | 8409558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 60716 |
End bp | 63835 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645018399 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003175919 |
Protein GI | 257386146 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.221236 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTATC GAAACGCACT GCTGTTCCGG TGGTCACGAC GCGACCGACT GACGATCGTC GTCGTCGCGG TGACGGCCGC GTTCCTCGTC GGGACCGCGC TCCTGTTGTT CACGGCGACG ACCTACTCGG AGACGTTCGC GGAGCCGCTG TCGAACGCCG GGACGATCAG CTACGAGACC GCGGACGGCG ACCGGCCGGC ATCGACCGAG CGACGGGTCG TCCTCCCACT GACGACCGCC TCGATCGACG GCAAGTCGGC TCCCGTCGTC GGGATCCCAC CGGACGCGCC GCGGGTGATC CAGAACGGCT CGGCGTCGTG GCAACAGGGC CGCTTGCCAG CGATGCCGTC CGACGCCGAC GCACGCGGTC CGGTGTCGAG ACAGCGGACC CGAACGCTGT CCGGGCCCGA CGGGCAGGTG ACACTCTCAG TCGTCCCACG AGAGCGGAGC AACAGCTTCC TCCAGCCGAC GTGGTACGTC GCCAACGCCT CGGTCGTCGA CGCCATCGGG ACGACGGGCT ATCTCGTGAT CGACCGAGAC AGCGAAGCCG ACAGTGGGAA CGCGATCCCC GAGACCGGCG TGCCCCTCGT GAGCGCGCTC CTGTACGTGC TGGGCGGGAT CGAACAGGTG CTCTGGGCGC TGAGCATCGC CGTCGCTGCG GGTGGCCTCC TCGTGCTCGT CGTCGTCTAC AGCGTGACGC GCATGAGCGT CCGCGACCGG ACCGAGGCGA TCAGCGTGAT CCGCTCGACC GGGGCACCGG GGTGGCACGT CGGCCTCCTG TTTACTGCGC GGGCGGCGTT GCTGGTCGCG GTCGGCGTGG CCATCGGCTA TGCGGGCGGG TTGATCGCGA TCAAGGCGAT CGTCAACGCC GCGGTCTACC TCGGGCTCCC GATCGCGCTC GACGTGACCG TGACCGGAGG GAGCGTCGGC GTCGTGGGCG GCATCGCCGG ACTCCTCGTC GGGATGGGTG TCGTCGCCGG TGCGATCGCG GCGTATCCGG CCGCCTCCCG CCCGCCAGCG ACGCTCGGAC ACAGACGCGC CCGCCTGCAG TCGTCGACCG GAGCGTCCGG AGGGCGGCTG GCCCGACTGC GGTCGATCCT GAAGCCGACG CTGCTCTCGT GGCGCTCGCT CGTTCCGACT GCGGCGACGC TTTCCGTGTT CGCGCTGACG GTCCTGCTGG TCGTCGCGAT CGCCGGCCTG GCCTCGCCAC TCGGCGGTGA CGCCGGGGGA ACGGGCACGA TCACGGAAGC CGACGCCCCG CATCCGCTGA ACAGTCGCCT CGACGCCGAC TACGCCCGCG CGCTCACGGC GAGTGGCACG CCGGCCAGCC CGGAGATCAT CTACGCGCAA GTGCGGGACG GCCAGCCCTA CATGGCCCAC GGGGCCGACT ACGAGATGTT CGCCAACGTC ACGAACGCGA CGGTCGTCGA GGGGCGTACA CCGGCGACCG CCGACGAGGC CGTCGTCGGG ACCGACCTGG CACGGACGCT CGACCTCTCG GTCGGCGACA CGGTCACGCT CGGTGGCAGC GTCGCTCCCG GCGTCCGCCA GTTCGAGGTC GTCGGCGCGT ACGACGCCCA CGGAACGCTC GACGACCTCC TCGTCGTCCC CCTGCGTTCC TCGTGGGGAC TGGCCACGGC GCGAGGGCAG GTCCACATGA TCCGCGTGGC CGGTGACGTG CCATCGGGTG CGGAGTCGGG AACACCCGTC GGCGGGGAGT CCACCGATCA GACGGGCCTC GCGATAACGG AGTTCACGGG TCCAGAGACG GTCACGCAGG GCGAGAACAT CACGCTCTCC GTGACGGTTC GGAACTTCGG CGACACGGCG GGCTCGCGGG CGGTGCCCGT CGAGTACGGG AACCAGCGCG CAAACCGGAC GGTATCGGTG CCGGCAGGGG GACAGACGAC CGTAGAGGTG ACTGTCGTCG CCGAGCAGAC CGGCGAGGTG CGGGCCCGGA CCGGCGAGTA CACGCACACC GTGACCGTCG TTTCACCGAA CGCGATCCGG ATCCCCGCCG AACTACCCGG GACCGCGCCA CCGGGGAGCG GCCTGTACGT ACCCGTCGTC GACGGTACCG GTGACCCGGT CACTGACGCC GCAGTCACCG TCGACGGCGT GACGGTACAG ACCCGCGATG AAGGGGTGGC GGTGGTACCG CTACCGCGGA CGGAGGGCAA CTACACGATC ACCGCACAGC ACGAGAACCG GACCGCGACG CACGCCCTCC GGATCGTCGC CGGGAGCGAA CGACGGCTCT CCGGGCGGCT CGACGTGTCG CCCCAGTCGG GCAATGCGCT GACGAGCCCG ACGGTGACGG TCGAGCTGGG GAATCCGTGG CAACAACAGC TCACGCGGAC GATCACCGTC GTCGGACCGA CGGGGACTCG CGAGCGCCAG GTCACCCTGT CTCCCGGGAA CGGGACCCGG AGCGAGTTCA CCGCCGCTGC GGGGGCTCGC ACCCAGCCCG GTGAGTACGC GTTCCGGCTG AGCTCGAACG GGACCCAGCT GGCGACGGCC GACTACACCG TGACCGGAGA CGAGCGACTG GCAGCAGCGG TCGCGAGCAG CGGCCAGTAC GCCTCCGGAA CGACGATCGA GCGATCGGTC GAAGGCGTCT TCGGGAACGT CCAGCTCGTC CTCGTCGCAC TCGTCGTCCT GGCCGGCCTG AGCACAGTCG GCAGCACGAC GGCGACGTTC GCACAGGCCG TCCACGCGCG GCGACAATCG ATCGGGATCC ATCGATCGGT GGGCGCGACC CACGGACAGA TCCTGCGCAT CGTCCTCGGA GATGTCGTGC GAATCGCCGT TCCGGCGGCA CTGCTCGCAG TCGCCGTTGG CGTCGCCGCG ATGCTGGCAC TGAACCGGGC CGGCTGGCTC GTCTTCTTCG GGTTCCGGCT GTCGACGCCG ACTCCCCCGC TCGTGCTCGT GGGACTGGCG CTCGCGGGCG TCGGACTCGC GGTGCTTGGC GCGCTCGCCG CGACGGTACC GTACCTGACA GCGTCGCCCG TCTCGCTGCT CCCGGCGGGC GACCGGGTGC AGCTGCCGAC CGCCGAACGC GGACGGCAGT CGGACGGTCG CGAACAGCGG CCACCGCCCG ACGCGTCAGA CGACGACTGA
|
Protein sequence | MGYRNALLFR WSRRDRLTIV VVAVTAAFLV GTALLLFTAT TYSETFAEPL SNAGTISYET ADGDRPASTE RRVVLPLTTA SIDGKSAPVV GIPPDAPRVI QNGSASWQQG RLPAMPSDAD ARGPVSRQRT RTLSGPDGQV TLSVVPRERS NSFLQPTWYV ANASVVDAIG TTGYLVIDRD SEADSGNAIP ETGVPLVSAL LYVLGGIEQV LWALSIAVAA GGLLVLVVVY SVTRMSVRDR TEAISVIRST GAPGWHVGLL FTARAALLVA VGVAIGYAGG LIAIKAIVNA AVYLGLPIAL DVTVTGGSVG VVGGIAGLLV GMGVVAGAIA AYPAASRPPA TLGHRRARLQ SSTGASGGRL ARLRSILKPT LLSWRSLVPT AATLSVFALT VLLVVAIAGL ASPLGGDAGG TGTITEADAP HPLNSRLDAD YARALTASGT PASPEIIYAQ VRDGQPYMAH GADYEMFANV TNATVVEGRT PATADEAVVG TDLARTLDLS VGDTVTLGGS VAPGVRQFEV VGAYDAHGTL DDLLVVPLRS SWGLATARGQ VHMIRVAGDV PSGAESGTPV GGESTDQTGL AITEFTGPET VTQGENITLS VTVRNFGDTA GSRAVPVEYG NQRANRTVSV PAGGQTTVEV TVVAEQTGEV RARTGEYTHT VTVVSPNAIR IPAELPGTAP PGSGLYVPVV DGTGDPVTDA AVTVDGVTVQ TRDEGVAVVP LPRTEGNYTI TAQHENRTAT HALRIVAGSE RRLSGRLDVS PQSGNALTSP TVTVELGNPW QQQLTRTITV VGPTGTRERQ VTLSPGNGTR SEFTAAAGAR TQPGEYAFRL SSNGTQLATA DYTVTGDERL AAAVASSGQY ASGTTIERSV EGVFGNVQLV LVALVVLAGL STVGSTTATF AQAVHARRQS IGIHRSVGAT HGQILRIVLG DVVRIAVPAA LLAVAVGVAA MLALNRAGWL VFFGFRLSTP TPPLVLVGLA LAGVGLAVLG ALAATVPYLT ASPVSLLPAG DRVQLPTAER GRQSDGREQR PPPDASDDD
|
| |