Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1094 |
Symbol | |
ID | 8383368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1068962 |
End bp | 1071871 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644972155 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003130006 |
Protein GI | 257052173 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGTCA TCGGCGTCTC GATCGCGTTT CTGACCGGGA GCGCGCTGCT TCTCGTGAGC GGGACCGAGC AGCTACAGAC GATCGCCGCT GACTTCGACA CGACGGGATA CGTGACCGGC TACCGATCCG CCGAGCTGGC GGCCCGCGCT GGCCGTGATG CGGTCTTTCC CATCGCAACG GTCCCGATCG ACGGCACGAA CGCGACCGTT CTGGGGGTTC CGCCGGGTGC CAACGAGACG ATCGCCGCGG CGACGCCGGC GACCGGGCTC GCGAGCGCCC TCACGAACGG CGTGGCGACC GCCGATGTCG GGGCAACGAC GGGTGTCGAC CAGGTCCGGA TCGCCGGCCG CCAGCACGTC GAGATGATTC CCGACAGCTG GTATCTCGCC GAGCGTGGCC TCGTCGACCA ACTCGATCCG GATGGCGCTG TCGTGATCGA CACCGGCACC GACGCGATCG AGCGGGCGAG CGACCGCGAT GCGATCCCGC TGCGAGGGGC TGCACGCTTC TTCGTCAGTG GGGCCCAAGA AGCGCTGTCT CTGTTCGGCG TGATCGTCGC CGGGGTGGCC GGCCTGGTGG GGGTCATCGT CTTCAGCGTC ACCCGGATGA GCGTCCAGGA TCGGGCGAAG ACGATCCGGA TCGTCCGGGC GACGGGAGCC ACGCCTCGTG CAGTCCGCGC CCTGTTCACG GCGCGAGCCA CGATCGTGAC GGTGGTCGGC GTCGCGATCG GCTATGCGGT CGGACTCATC GCCGCTCGGG TCGCGGTCAA CGCTTCGGTC TTTCTCGGCG TCCCTGTGTC GCTGGACCTC TCACTCTCCC GCCCGGCACT CGGCGTGTTG GTCCCACTCT ACCTGGGAGT GGTTCTCTTG GGGGCCATCG CCGGGTATCT CGCGTCCCGG CCGGTGAGTC GGATACCGCC CGCCGCGATC CGATCGAGCG GTGGCGGCGA TCCCACCACC GGTGGGTGGA TTCGGGAGTG GATTCCAGGC TGGGTCGACC TGACGCTGCT TCGCTTTCGG GCAGTGATTC CGACCATCGC GACGATCACT GTCTTTCTGA CGCTATTGGT CGTCCTCGTC TCGACCGGCA CAGCGGTGGC CCCCATGGTC GATTCGGGGG ACGCGACGAT CGTTGAACCG GGCTCCGTTC ATCCCGTCGC GAGCTCAGTC CCGGAGTCGT TCGCGACCGT CCTCGAGGAC CGGGGGATCG ATGCGAGTCC GGAGATCTTG CTGTTTCCGA TCGTCGACGG GGAACCGACG CTCGCCCGTG GCGTCGACTT CTCGTCGTTC GCGAACGTGT CCGGGGCATC GCTGGTGACC GGCCGCGCCC CGCGGTCGGC GGACGAAGCG GTCGTCGGCG AGTCCCTCGC CGCCCGGCGA AACCTCTCCG TCGGCGATAC CGTCCTGGTG GGCGGGAGCA CCCGATCGGC GTTCACGCGG ACGACGATCG TCGGCAGTTA CGACGCGCCG GGGATCTACG AGAGTCATCT GCTTGTCCCG TTGCCGACGG CCCGCGATCT CAGCACTCGC GCGCCGGGGC AAGTCCACGT CGTCCGAGCG ACCCGGCTGC CGGCGGCCGG GAGCGGGATC GACGTCGTCG ACGTCTCCGC GCCGGCGACG GTGGTGCGCA ACCGGTCGTT CCAGACCACC GTCACCGCCG TCAACGTCGG TCGGACGAAT GCAACCCGAA CCGTGTCCAT CCGGGTCGCG AACACGTCTC GGAACGTGAC CCTGGCGATC CCGCCGGGCG AACGAACCGA ACGGACGACG ACGCTGTCGG TCGCCCGACC CGGTATCTGG TCGATTCGGG CCGGATCGGC GACTCAGTCG ATAGCGGTCC GGCAGCCGAA CGCCCTCCAG GTTCGCTTCC CGTCGGCAGT TCGCGTCGGG GCTTCCCCAC GAGTCGCAGT TTCGACCGCC GCCGGCGAGC CGGTCGACAA TGCGACGGTG ACACTCGGCA ATCGAACTGT GCAAACCGAT TCGGCCGGTG TCGCCCGGAT CACGGTTCCG CCCGGCGCGG ACACGCTTAC CGTGACGGCC GATAGCCGAA CCGTGACCGA GTCTGTCACG GCGATCAGTG ACCGGGTTGA CCGCGACGAT CGGTCCGGCG GCGACGGTCG ACCGCTGGTT TCGGTCTCGA TCCAGCCCGA ATCCCCCGGA TTCCGAGTCC AGCCGACCGC CCGGATACAC CTCGAAAACC CCTGGAACCG GACGGTCGCT CCCGAGTTGA CGATCTCTGG GCCGACGAGC AGCCACGATC GGACAGTGTC ACTCGATCCC GGGGAAACGA CGACAGTTTC GGCCCAGCTA TCGCGCAATC CACCGGGCGA GTACGACGTG ACAGTGACAG ACGACACGGA CACTGAACTG GCCCGAACCA CGATGGTCGT GACGGGTAAC GAGCGCCTCG TCGCTGCGCT TGCAACCCAC GGCGAGCGGG GGAGTACACC GTTCTCCCGC GCCGTGTCGC TGGTGTTCGG GAACCTCACC CTGCTCGTCG GTGCCGTCGC CGGGCTCGGC GCACTCATGA CTGTCGGCGG GCTGACTGCC ATCTTCTCCC GGGGTGTCCA CGCCAGACGG CGGACGATCG GGATCTACCG GGCGACCGGT GCGACCCCGG GCCAGGTTTT CGTCCTCGTG CTTCGGGACG CCGGCGTGAT CGGCACGGTT TCGTTGCTGG TGGCGTTCCC GCTCACCTAT CTGCTCCTGG CGTGTCTCTC CTCGGCCGGC GTGCTTTCGG TCTTCGGCGT CGCCATCCAA CCGGTGTTCG CCCCCTGGAT CGTAGTGCTC GGGACAGCCA TCGTGCTCGC GCTCGTCGGT CTCGGTGCCG CTCTCGCCAC GGCAACGCTC GTTCGGACCG CCCCGGCGAG GACACTCCTC GGCGAGCGAA CCGGAGGGAT CGAACGATGA
|
Protein sequence | MVVIGVSIAF LTGSALLLVS GTEQLQTIAA DFDTTGYVTG YRSAELAARA GRDAVFPIAT VPIDGTNATV LGVPPGANET IAAATPATGL ASALTNGVAT ADVGATTGVD QVRIAGRQHV EMIPDSWYLA ERGLVDQLDP DGAVVIDTGT DAIERASDRD AIPLRGAARF FVSGAQEALS LFGVIVAGVA GLVGVIVFSV TRMSVQDRAK TIRIVRATGA TPRAVRALFT ARATIVTVVG VAIGYAVGLI AARVAVNASV FLGVPVSLDL SLSRPALGVL VPLYLGVVLL GAIAGYLASR PVSRIPPAAI RSSGGGDPTT GGWIREWIPG WVDLTLLRFR AVIPTIATIT VFLTLLVVLV STGTAVAPMV DSGDATIVEP GSVHPVASSV PESFATVLED RGIDASPEIL LFPIVDGEPT LARGVDFSSF ANVSGASLVT GRAPRSADEA VVGESLAARR NLSVGDTVLV GGSTRSAFTR TTIVGSYDAP GIYESHLLVP LPTARDLSTR APGQVHVVRA TRLPAAGSGI DVVDVSAPAT VVRNRSFQTT VTAVNVGRTN ATRTVSIRVA NTSRNVTLAI PPGERTERTT TLSVARPGIW SIRAGSATQS IAVRQPNALQ VRFPSAVRVG ASPRVAVSTA AGEPVDNATV TLGNRTVQTD SAGVARITVP PGADTLTVTA DSRTVTESVT AISDRVDRDD RSGGDGRPLV SVSIQPESPG FRVQPTARIH LENPWNRTVA PELTISGPTS SHDRTVSLDP GETTTVSAQL SRNPPGEYDV TVTDDTDTEL ARTTMVVTGN ERLVAALATH GERGSTPFSR AVSLVFGNLT LLVGAVAGLG ALMTVGGLTA IFSRGVHARR RTIGIYRATG ATPGQVFVLV LRDAGVIGTV SLLVAFPLTY LLLACLSSAG VLSVFGVAIQ PVFAPWIVVL GTAIVLALVG LGAALATATL VRTAPARTLL GERTGGIER
|
| |