Gene Huta_0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0458 
Symbol 
ID8382725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp452888 
End bp455485 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content66% 
IMG OID644971520 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003129378 
Protein GI257051545 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCGG CCCTGGGACC GCCGGCGAAG ATGACCGACC GGCGCGAGGA TCTCACGCCG 
ATGTTGCGCC AGTACGTCGA GTTGACCGAG CGCTACGACG ACGCCCTGGT GCTCTTTCAG
TCGGGTGACT TCTACAAGGG ATTTTGCGAG GCTGCCGAGG TGCTTGCACG GATCTGTGAG
GTGACGCTGA CCGAACGCGA GGATTCGACC GGCACCTACG CGATGACGGG TGTGCCGATC
GACAACGCCG AATCCTATAT CGAGAAGTTA CTGGACGCGG GCTACCGCGT GGCGATTGCC
GACCAGGTCG AGGACCCCGA CGAGGTCAGT GGCGTCGTCG AGCGCGCGGT CACGCGGATC
ATCACGCCAG GGACACTCAC CGAGGACGAA CTGCTCGGTG GTGCCGAGAA CAACTACGTC
GCCGCGCTGG CCGCTGACGA TGGCCGATTT GGTGTGGCCG TCCTCGACGT TTCGACGGGG
GACTTCTACG CGACCAGCAC CGACGACCGG GAGACGGTCA GGGACGAACT CGGTCGATTC
TCGCCCGCTG AGGGAATCCT CGGCCCCGAC GTCCCGAACC TCTTCGATGG GGCGTGTACC
GTCAGTCCCG TCGAGGGGAC GTACTTCGCC ACTGACCGGG CAGCCGAGCG CGTCGGCGAG
TACTTCGGGA CGCCTGATCG GCTCCTCGCG ACCGATGCCG AGGTCCGGGC CTGTGGCGCG
TTGCTGGCCT ACGCCGAGTA CGCCCGCGGG GGTGAAGCCG GTCGGCTGGA CTACCTCAAC
CACCTCACAC GATACGATCC GCGGGCGTAC ATGGTACTCG ACGCGGTGGC GCTCGAAAGC
CTAGAGATCT TCGAACGCCG GAGTGTCACC GGCGGCGCGG ACCTGACGCT CGTGGACGTG
ATCGACGAGA CGGCTTCTGC GCTCGGCCGT CGTCGGCTGA CCGAATGGCT TCGCCGTCCG
CTCATCGACC GCGACCGGAT CGAGGCGAGA CACGCGGCAG TGGATGCGCT TGTTTCGGAG
CTCCAAACCC GCGAACGGCT CCACGAGCTA CTATCCGACG TCTACGACCT CGAGCGACTC
ATCTCACGCG TTTCCCGCTC GCGAGCCGAC GCCCGTGACC TCCGCTCGTT GAAAGACACA
CTCGACGTGA TCCCGGAGAT CAAAGCGGCG TTGGACGGCA TCGATGCCCC GCTGTTGACC
GACCTTCGAG ACCGTCTCGA CGAGATGGAC GACGTCCGCG GGTTGATCGA CGACGCGATC
GCGGCGGACC CACCGACCGA AATCACCGAG GGTGGGATCA TCAGTGAGGG CTACGACGAC
CGACTCGACG AGTTGCGCGC GACCGAACGA GAGGGCAAGG AGTGGATCAC CGACTTGGAG
GAAAGCGAAC GCGAGCGGAC CGGCATCGAC TCGTTGAAGG TCGGCCACAA CGCCGTCCAC
GGCTACTACA TCGAAGTGAC CGACGCGAAC GTCGATCGGG TTCCCGAGGA CTACCAGCGG
CGACAGACGC TGAAGAACGC CGAGCGTTAC TACACGCCCG AACTCAAGGA GCGGGAAGAC
GAGATCCTTC GGGCGGAAGG GCAGGCCGAC GATCTGGAGT ACGAGTTGTT CGTGGAAGTG
CGTGACGACG TCGCCGCCGA GTCCGAGCGC GTCCAGGCAG TCGCCGACGC TGTGGCAAAT
CTCGACGTGC TGGTTGGCTT TGCCACCGTC GCGGCCGAGC GGGATTATTG CCGCCCCTCG
GTCGGTGGGG ACGGAATCGA CATCGAGGGT GGTCGCCACC CGGTCGTCGA GCGCACCGAG
GACGCGTTCG TTCCGAACGA CACCCATCTC GACGACGACG CCTGTCTCGC GGTGATCACT
GGGCCGAACA TGAGCGGGAA GTCGACCTAC ATGCGCCAGG TCGCGCTGAT CTCGATTCTC
GCCCAGGTCG GGAGCTTCGT GCCCGCCGAA TCGGCGGACC TGCGGATCGT CGACCGCGTG
TTCACCCGCG TCGGCGCGAG TGACGACATC GCCGGCGGGC GCTCGACGTT CATGGTCGAG
ATGAGCGAAC TCGCGACGAT CCTGGAAGGA GCGACCGCGA ACTCACTCGT CTTGCTCGAC
GAGGTGGGCC GCGGGACCAG CACGACCGAC GGCCTGGCGA TCGCCCAGGC AGTGACGGAG
TTCATCCACG ACGAGGTCGG CGCGACGACG CTGTTTGCGA CCCACCACCA CGAACTGACC
GAGGTCGCGG CCGATCTCAA CGGCGCAGTG AACCGACACT TCCGGACCGA ACAGGCGGGC
GAAGAGGTGT CGTTCCCCTA CGATATCGCC ACCGGGCCCG CCGCGGCATC CTACGGTGTC
GAGGTGGCCG GCGTGGCCGG CGTGCCGGAC ACGGTCGTCG GTCGCTCGCG AGAACTGCTC
GGTGACAGTA CCCCCGACGG ACGGGAACCG GGTCAAGAGC CCGATCGAAC CGCGACGGAA
CGTGGAAGTG AAACGCCCGA GGAACCCGAT CGAGACGATG TCGTCGCCGA ACTCCGATCT
CTTTCCGTCG CTGAGATGAC GCCCATTCAG GCGTTGAACA CGCTGGCCGA CTTACAGCGT
CGGGCCGATC GAGAGTAG
 
Protein sequence
MDAALGPPAK MTDRREDLTP MLRQYVELTE RYDDALVLFQ SGDFYKGFCE AAEVLARICE 
VTLTEREDST GTYAMTGVPI DNAESYIEKL LDAGYRVAIA DQVEDPDEVS GVVERAVTRI
ITPGTLTEDE LLGGAENNYV AALAADDGRF GVAVLDVSTG DFYATSTDDR ETVRDELGRF
SPAEGILGPD VPNLFDGACT VSPVEGTYFA TDRAAERVGE YFGTPDRLLA TDAEVRACGA
LLAYAEYARG GEAGRLDYLN HLTRYDPRAY MVLDAVALES LEIFERRSVT GGADLTLVDV
IDETASALGR RRLTEWLRRP LIDRDRIEAR HAAVDALVSE LQTRERLHEL LSDVYDLERL
ISRVSRSRAD ARDLRSLKDT LDVIPEIKAA LDGIDAPLLT DLRDRLDEMD DVRGLIDDAI
AADPPTEITE GGIISEGYDD RLDELRATER EGKEWITDLE ESERERTGID SLKVGHNAVH
GYYIEVTDAN VDRVPEDYQR RQTLKNAERY YTPELKERED EILRAEGQAD DLEYELFVEV
RDDVAAESER VQAVADAVAN LDVLVGFATV AAERDYCRPS VGGDGIDIEG GRHPVVERTE
DAFVPNDTHL DDDACLAVIT GPNMSGKSTY MRQVALISIL AQVGSFVPAE SADLRIVDRV
FTRVGASDDI AGGRSTFMVE MSELATILEG ATANSLVLLD EVGRGTSTTD GLAIAQAVTE
FIHDEVGATT LFATHHHELT EVAADLNGAV NRHFRTEQAG EEVSFPYDIA TGPAAASYGV
EVAGVAGVPD TVVGRSRELL GDSTPDGREP GQEPDRTATE RGSETPEEPD RDDVVAELRS
LSVAEMTPIQ ALNTLADLQR RADRE