Gene Huta_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1151 
Symbol 
ID8383426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1122550 
End bp1124589 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content67% 
IMG OID644972210 
Productprotein of unknown function DUF1680 
Protein accessionYP_003130060 
Protein GI257052227 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACC CACGGGAGTC GCCCTCGATC GGTGACGTCA CGATCGACGA CGAGTTCTTC 
CGGCCGCGAC GCGAGGTCAA CCGCGAGGTC ACGATCGAAT ACCAGTACGA GCAACTGGAG
GCGGCGGGCA CCCTCGACAA TTTCCGGCGG GTGCGCGACG GCGAACGCGG CGGGCACAGC
GGCATGTGGT TCCAGGATTC GGATGCCTAC AAGTGGATCG AGGCGGCGAG TTACGTCCTT
GCCTCGCGGG ACGACGACGA CCTGGAAGCC CGCGTCGACG AGGTGATCGA TCTGATCGCC
GATGCCCAGG GGGCCGACGG CTACCTCAAC ACCTACTTCG ACCTGGAGGT ACCCGACAAA
CGCTGGAGCA ACCTCAACAC GATGCACGAA CTGTACTGCG GGGGCCACCT GATCGAGGCC
GCGGTCGCGC ACTACCGCGC GACCGGCAAG GAGACCCTGC TCGACGTGGC GCGGGACTTC
GCCGATCACG TCGACGAGAT GTTCCCCGAG CAGATCGACG GCGCGCCGGG CCACCAGGAG
ATCGAACTCG CGCTGATCAA ACTCGCTGCC GTGACCGACG AGGACCGGTA TCGCGACCTC
GCGGCATACT TCGTCGACGT GCGCGGGACG ACCGATCGCT TCGCGTGGGA GAACGACCAC
CGCGAGGAGA TCGCGACGCT CGAGGAGTGG GAGGGCGACG AGGGCGGTGC GGAGGGTGAC
GGTGAGATCG GGGCGGACGA CGAAGACGCC GACAGCGATG GCGACGCCGA GGAGGGCAAT
GACGACGCTG AGGGAGGTAA TGACGACGCT GCGGAGGAGA ACGACGAGTG GGAGGACGCC
GAGGTCGAAC CGTACGACGC GAGTTACAAC CAGGCCCACG CCCCATTGCG GGAGCAGGAC
GCCGTCGAGG GCCACGCCGT CCGGGCGATG TACTACTTCG CCGGCGCGAC CGACGTCGCC
GCCGAGACGG GCGACGATGG TCTGCTCGCC CACCTCGATG ACCTCTGGGA GAACATGACG
ACGCGACGGA TGTACGTCAC CGGTGGGATC GGTTCCCAGC ACCCCGGCGA GCGCTTCACG
ATCGATTATC ACCTGCCGAA CGAGACGGCC TACGCCGAGA CGTGCGCGGC GATCGGGAGC
ATCTTCTGGA ACCAGCGCCT GTTCGAAGCG ACTGGCGACG CAAAGTACAC TGATCTGATC
GAGTGGACGC TGTATAATGC CGTACTGCCG GGCGTCTCGC TCGACGGGCG GGAGTTCTTC
TACGACAACC CGCTCGCGAG TGACGGTGAC AGCCACCGGG AGGGGTGGTT CGACTGTGCG
TGCTGTCCGC CGAACGTCGC GCGATTGCTC GCCTCGCTCG AACGGTATCT CTACGCCACC
GACGACGAGG CGCTCTACGT CAACCAGTAC GTCGGCGGGA GGGCCGAACT ATCGGTCGCT
GGGACGGCCG TCTCGATCAG CCAGGTTTCC GATCTACCCT GGGAGGGCAG CGTCACGCTC
GACATCGACG CCGCGGAACC AGCCACGTTC GCCCTCCGGC TCCGGGTGCC CGGCTGGGCC
GAGGATGTCT CGATCGCGGT CGACGGCGAG GCGGTCGACA CGGCCGTCGA CGCCGCGGAC
GCGCCGACGT ACGTCACGCT CGACCGGGAG TGGGCCGACG CCGAGATCAG CGTCGAGTTC
GGGATGTCCG TCGAGGTGTT CGAAGCCCAC CCCGACGTCG CGGCCGACGC CGGCCGCGTG
GCACTGACGC GCGGCCCGCT GGTGTACTGT CTGGAGGGCG TCGATCACGA CCGCCCGCTC
CACCAGTACG CGATCGATCC CACGACCGAT TTCGCGGCGA CTCATCGCGA AGACGTGCTT
GACGGAGTTA CCGTCCTCGA CGGCGAGGCC ACGGTGCCGT CACTCGACGG CTGGGACGAT
GAACTCTACC GGCCCGCTGT CGAAACTGCG ACGGAGAACG TGTCGATCAC CGCCGTCCCC
TACTACGCCT GGGACAACCG CGAGCCCGGC GAGATGGCTG TCTGGGTGCG AGAAGCGTAG
 
Protein sequence
MNDPRESPSI GDVTIDDEFF RPRREVNREV TIEYQYEQLE AAGTLDNFRR VRDGERGGHS 
GMWFQDSDAY KWIEAASYVL ASRDDDDLEA RVDEVIDLIA DAQGADGYLN TYFDLEVPDK
RWSNLNTMHE LYCGGHLIEA AVAHYRATGK ETLLDVARDF ADHVDEMFPE QIDGAPGHQE
IELALIKLAA VTDEDRYRDL AAYFVDVRGT TDRFAWENDH REEIATLEEW EGDEGGAEGD
GEIGADDEDA DSDGDAEEGN DDAEGGNDDA AEENDEWEDA EVEPYDASYN QAHAPLREQD
AVEGHAVRAM YYFAGATDVA AETGDDGLLA HLDDLWENMT TRRMYVTGGI GSQHPGERFT
IDYHLPNETA YAETCAAIGS IFWNQRLFEA TGDAKYTDLI EWTLYNAVLP GVSLDGREFF
YDNPLASDGD SHREGWFDCA CCPPNVARLL ASLERYLYAT DDEALYVNQY VGGRAELSVA
GTAVSISQVS DLPWEGSVTL DIDAAEPATF ALRLRVPGWA EDVSIAVDGE AVDTAVDAAD
APTYVTLDRE WADAEISVEF GMSVEVFEAH PDVAADAGRV ALTRGPLVYC LEGVDHDRPL
HQYAIDPTTD FAATHREDVL DGVTVLDGEA TVPSLDGWDD ELYRPAVETA TENVSITAVP
YYAWDNREPG EMAVWVREA