Gene Huta_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0551 
Symbol 
ID8382818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp557406 
End bp559157 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content69% 
IMG OID644971613 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003129471 
Protein GI257051638 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.27881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTAG AGGACTACTG GGGCGTCGGG CCGAAGACCC GTGATCTCCT CGCCGAGTCG 
CTGGGCATCG AAACTGCCAT CGCCGCGATC GAATCCGGCG ATCTTCGGGC GTTGACCGAG
GCCGGATTGA GTCGCGGCCG AGCGACACGG ATCCTTCGCC GGGCCCAGGG CGGGGCGATG
GACGTGCTGG CGACCCGCGA CGCCCGATCC GTGTACAAGT CGGTGCTGGA CCTGGCGAGT
GATTACGCCG TGACCCGCCA CGCCGCCGAC AGCATCCGAC TCCTCACGCC GCTCGATTCC
CGGTCCGCCA TGGAATCACG CCTCGAAACC GTCATGGACG CCGTGGCCGT CTGGAACGCA
CTCGACGAGT CGACTCGGGA GGGCGTCATC GAGGCGTTCG ACGCCTACGA CAGCGTCGAA
GGCGGTGATC TCGCCGGCGT CCGGACCGCA CTGGCATTGC GGGAGACCGG TGTCACCGAC
GGTGTGTTCG CGCCGCTCGC AGACCTCGAG GTCGAACAGC TCGACGCCGC GGCCGACGCG
CTCGCCGCGC TCTCGGCGGA CGGCGTCGCG GCCGGGGCTG ACGACCGTCT CGACGATCTG
AGGACGCAGC TCGGGGCGAT CGAGGATATG GCCGCCGACG CCGAGTCGGT GATCGCCACG
ATCCGCGACG CTGGCGTCCG GGGTGGCGAC GAGTTCCGCG AGCGATTCGT CGACCACGTC
GTCAGCGAAG CCGGCGTCGA TGTCGGGGCC GTCAGGGAGG CGATGGTCAC CGACGCCCCG
GACGTGACCG ACTTCGTTTC GGAGACGCTT CGCGGCCTCG CCGCGGATCG ACGTGACGCC
GTCGAGGAAC GCGAGGCGGA CGTCCGCGAG CGCCTCGAAA CGAGCCTGGA ACTCGCCCGC
GAGGACGTCG ACGCCGCTGT CGACGTCGTC GACGAGATTG CCCGAGACGT CTCGCTGGCC
CGGTTCGCTC GTGCGTTCGA TCTCACCGCG CCGACCTATC GTGAGGGACG GGTGCTAGCC
GTCGAGAACG CTCGCAACCT CGAACTGATG GGTGGAGATG TCGCCGTCCA GCCGGTCACC
TACGGGATCG GCGACCACTC GCTGTCGGTG GCCGGCGCGA ACGAACCGCC ACGGGGCGAC
CGCGTCGCCG TCCTGACAGG AGCCAACAGC GGCGGGAAGA CGACGCTGCT GGAGACGCTC
GCGCAGGTGC AGTTGCTCGC CCAGATGGGG CTGCCCGTGC CAGCGGATGC CGCCGAAGTG
GGGGTCGTCG ACGCCGTGGT CTTCCACCGC CGACACGCGA GTTTCAACGC GGGCGTGCTC
GAATCGACAC TCCGGACAGT CGTCCCGCCC CTGACTGACG AGGGACGAAC CCTGATGCTT
GTCGACGAGT TCGAGGCGAT CACCGAACCC GGCAGCGCCG CCGATCTCCT TCACGGCCTG
GTCACGCTGA CGGTCGACCA GCCGGCGCTT GGCGTGTTCG TCACCCACCT GGCTGACGAC
CTGGAACCGC TCCCATCGGC AGCCCGAACT GACGGCATCT TCGCCGAAGG GTTGACGACG
GATCTCGAAC TCGAAGTCGA CTATCAGCCC CGGTTTGGCA CGGTCGGGCG CTCGACACCG
GAGTTCATCG TCTCGCGACT CGTGGCCGAC GCCGACGATC GACGCGAACG CGGCGGGTTC
CAGACGCTTG CCGCGGCCGT CGGCGAGCAA GCCGTCCAGC GGACACTGTC GGACGCCGAG
TGGTCCGGTT GA
 
Protein sequence
MDVEDYWGVG PKTRDLLAES LGIETAIAAI ESGDLRALTE AGLSRGRATR ILRRAQGGAM 
DVLATRDARS VYKSVLDLAS DYAVTRHAAD SIRLLTPLDS RSAMESRLET VMDAVAVWNA
LDESTREGVI EAFDAYDSVE GGDLAGVRTA LALRETGVTD GVFAPLADLE VEQLDAAADA
LAALSADGVA AGADDRLDDL RTQLGAIEDM AADAESVIAT IRDAGVRGGD EFRERFVDHV
VSEAGVDVGA VREAMVTDAP DVTDFVSETL RGLAADRRDA VEEREADVRE RLETSLELAR
EDVDAAVDVV DEIARDVSLA RFARAFDLTA PTYREGRVLA VENARNLELM GGDVAVQPVT
YGIGDHSLSV AGANEPPRGD RVAVLTGANS GGKTTLLETL AQVQLLAQMG LPVPADAAEV
GVVDAVVFHR RHASFNAGVL ESTLRTVVPP LTDEGRTLML VDEFEAITEP GSAADLLHGL
VTLTVDQPAL GVFVTHLADD LEPLPSAART DGIFAEGLTT DLELEVDYQP RFGTVGRSTP
EFIVSRLVAD ADDRRERGGF QTLAAAVGEQ AVQRTLSDAE WSG