Gene Huta_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1094 
Symbol 
ID8383368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1068962 
End bp1071871 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content68% 
IMG OID644972155 
Productprotein of unknown function DUF214 
Protein accessionYP_003130006 
Protein GI257052173 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGTCA TCGGCGTCTC GATCGCGTTT CTGACCGGGA GCGCGCTGCT TCTCGTGAGC 
GGGACCGAGC AGCTACAGAC GATCGCCGCT GACTTCGACA CGACGGGATA CGTGACCGGC
TACCGATCCG CCGAGCTGGC GGCCCGCGCT GGCCGTGATG CGGTCTTTCC CATCGCAACG
GTCCCGATCG ACGGCACGAA CGCGACCGTT CTGGGGGTTC CGCCGGGTGC CAACGAGACG
ATCGCCGCGG CGACGCCGGC GACCGGGCTC GCGAGCGCCC TCACGAACGG CGTGGCGACC
GCCGATGTCG GGGCAACGAC GGGTGTCGAC CAGGTCCGGA TCGCCGGCCG CCAGCACGTC
GAGATGATTC CCGACAGCTG GTATCTCGCC GAGCGTGGCC TCGTCGACCA ACTCGATCCG
GATGGCGCTG TCGTGATCGA CACCGGCACC GACGCGATCG AGCGGGCGAG CGACCGCGAT
GCGATCCCGC TGCGAGGGGC TGCACGCTTC TTCGTCAGTG GGGCCCAAGA AGCGCTGTCT
CTGTTCGGCG TGATCGTCGC CGGGGTGGCC GGCCTGGTGG GGGTCATCGT CTTCAGCGTC
ACCCGGATGA GCGTCCAGGA TCGGGCGAAG ACGATCCGGA TCGTCCGGGC GACGGGAGCC
ACGCCTCGTG CAGTCCGCGC CCTGTTCACG GCGCGAGCCA CGATCGTGAC GGTGGTCGGC
GTCGCGATCG GCTATGCGGT CGGACTCATC GCCGCTCGGG TCGCGGTCAA CGCTTCGGTC
TTTCTCGGCG TCCCTGTGTC GCTGGACCTC TCACTCTCCC GCCCGGCACT CGGCGTGTTG
GTCCCACTCT ACCTGGGAGT GGTTCTCTTG GGGGCCATCG CCGGGTATCT CGCGTCCCGG
CCGGTGAGTC GGATACCGCC CGCCGCGATC CGATCGAGCG GTGGCGGCGA TCCCACCACC
GGTGGGTGGA TTCGGGAGTG GATTCCAGGC TGGGTCGACC TGACGCTGCT TCGCTTTCGG
GCAGTGATTC CGACCATCGC GACGATCACT GTCTTTCTGA CGCTATTGGT CGTCCTCGTC
TCGACCGGCA CAGCGGTGGC CCCCATGGTC GATTCGGGGG ACGCGACGAT CGTTGAACCG
GGCTCCGTTC ATCCCGTCGC GAGCTCAGTC CCGGAGTCGT TCGCGACCGT CCTCGAGGAC
CGGGGGATCG ATGCGAGTCC GGAGATCTTG CTGTTTCCGA TCGTCGACGG GGAACCGACG
CTCGCCCGTG GCGTCGACTT CTCGTCGTTC GCGAACGTGT CCGGGGCATC GCTGGTGACC
GGCCGCGCCC CGCGGTCGGC GGACGAAGCG GTCGTCGGCG AGTCCCTCGC CGCCCGGCGA
AACCTCTCCG TCGGCGATAC CGTCCTGGTG GGCGGGAGCA CCCGATCGGC GTTCACGCGG
ACGACGATCG TCGGCAGTTA CGACGCGCCG GGGATCTACG AGAGTCATCT GCTTGTCCCG
TTGCCGACGG CCCGCGATCT CAGCACTCGC GCGCCGGGGC AAGTCCACGT CGTCCGAGCG
ACCCGGCTGC CGGCGGCCGG GAGCGGGATC GACGTCGTCG ACGTCTCCGC GCCGGCGACG
GTGGTGCGCA ACCGGTCGTT CCAGACCACC GTCACCGCCG TCAACGTCGG TCGGACGAAT
GCAACCCGAA CCGTGTCCAT CCGGGTCGCG AACACGTCTC GGAACGTGAC CCTGGCGATC
CCGCCGGGCG AACGAACCGA ACGGACGACG ACGCTGTCGG TCGCCCGACC CGGTATCTGG
TCGATTCGGG CCGGATCGGC GACTCAGTCG ATAGCGGTCC GGCAGCCGAA CGCCCTCCAG
GTTCGCTTCC CGTCGGCAGT TCGCGTCGGG GCTTCCCCAC GAGTCGCAGT TTCGACCGCC
GCCGGCGAGC CGGTCGACAA TGCGACGGTG ACACTCGGCA ATCGAACTGT GCAAACCGAT
TCGGCCGGTG TCGCCCGGAT CACGGTTCCG CCCGGCGCGG ACACGCTTAC CGTGACGGCC
GATAGCCGAA CCGTGACCGA GTCTGTCACG GCGATCAGTG ACCGGGTTGA CCGCGACGAT
CGGTCCGGCG GCGACGGTCG ACCGCTGGTT TCGGTCTCGA TCCAGCCCGA ATCCCCCGGA
TTCCGAGTCC AGCCGACCGC CCGGATACAC CTCGAAAACC CCTGGAACCG GACGGTCGCT
CCCGAGTTGA CGATCTCTGG GCCGACGAGC AGCCACGATC GGACAGTGTC ACTCGATCCC
GGGGAAACGA CGACAGTTTC GGCCCAGCTA TCGCGCAATC CACCGGGCGA GTACGACGTG
ACAGTGACAG ACGACACGGA CACTGAACTG GCCCGAACCA CGATGGTCGT GACGGGTAAC
GAGCGCCTCG TCGCTGCGCT TGCAACCCAC GGCGAGCGGG GGAGTACACC GTTCTCCCGC
GCCGTGTCGC TGGTGTTCGG GAACCTCACC CTGCTCGTCG GTGCCGTCGC CGGGCTCGGC
GCACTCATGA CTGTCGGCGG GCTGACTGCC ATCTTCTCCC GGGGTGTCCA CGCCAGACGG
CGGACGATCG GGATCTACCG GGCGACCGGT GCGACCCCGG GCCAGGTTTT CGTCCTCGTG
CTTCGGGACG CCGGCGTGAT CGGCACGGTT TCGTTGCTGG TGGCGTTCCC GCTCACCTAT
CTGCTCCTGG CGTGTCTCTC CTCGGCCGGC GTGCTTTCGG TCTTCGGCGT CGCCATCCAA
CCGGTGTTCG CCCCCTGGAT CGTAGTGCTC GGGACAGCCA TCGTGCTCGC GCTCGTCGGT
CTCGGTGCCG CTCTCGCCAC GGCAACGCTC GTTCGGACCG CCCCGGCGAG GACACTCCTC
GGCGAGCGAA CCGGAGGGAT CGAACGATGA
 
Protein sequence
MVVIGVSIAF LTGSALLLVS GTEQLQTIAA DFDTTGYVTG YRSAELAARA GRDAVFPIAT 
VPIDGTNATV LGVPPGANET IAAATPATGL ASALTNGVAT ADVGATTGVD QVRIAGRQHV
EMIPDSWYLA ERGLVDQLDP DGAVVIDTGT DAIERASDRD AIPLRGAARF FVSGAQEALS
LFGVIVAGVA GLVGVIVFSV TRMSVQDRAK TIRIVRATGA TPRAVRALFT ARATIVTVVG
VAIGYAVGLI AARVAVNASV FLGVPVSLDL SLSRPALGVL VPLYLGVVLL GAIAGYLASR
PVSRIPPAAI RSSGGGDPTT GGWIREWIPG WVDLTLLRFR AVIPTIATIT VFLTLLVVLV
STGTAVAPMV DSGDATIVEP GSVHPVASSV PESFATVLED RGIDASPEIL LFPIVDGEPT
LARGVDFSSF ANVSGASLVT GRAPRSADEA VVGESLAARR NLSVGDTVLV GGSTRSAFTR
TTIVGSYDAP GIYESHLLVP LPTARDLSTR APGQVHVVRA TRLPAAGSGI DVVDVSAPAT
VVRNRSFQTT VTAVNVGRTN ATRTVSIRVA NTSRNVTLAI PPGERTERTT TLSVARPGIW
SIRAGSATQS IAVRQPNALQ VRFPSAVRVG ASPRVAVSTA AGEPVDNATV TLGNRTVQTD
SAGVARITVP PGADTLTVTA DSRTVTESVT AISDRVDRDD RSGGDGRPLV SVSIQPESPG
FRVQPTARIH LENPWNRTVA PELTISGPTS SHDRTVSLDP GETTTVSAQL SRNPPGEYDV
TVTDDTDTEL ARTTMVVTGN ERLVAALATH GERGSTPFSR AVSLVFGNLT LLVGAVAGLG
ALMTVGGLTA IFSRGVHARR RTIGIYRATG ATPGQVFVLV LRDAGVIGTV SLLVAFPLTY
LLLACLSSAG VLSVFGVAIQ PVFAPWIVVL GTAIVLALVG LGAALATATL VRTAPARTLL
GERTGGIER