Gene Huta_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1024 
Symbol 
ID8383297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp989124 
End bp991118 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content69% 
IMG OID644972088 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003129940 
Protein GI257052107 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.318391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTGA CGGCGATCCC CGGCGTCGGC GAGAAGACCG CGGCGTCGCT TGCCGAACTC 
GACGATCCAG CGGCGGCGAT CGAGAACGGC GACGTCGCGG CCGTCGCTCG GGCCCCCGGC
ATCAGTCAGG GTCGTGCCGC CCGGATCGTC CGCGCGGCGA TCCGCGAGCG CCACGGCGAC
GCGGGCGAGT TCCTGGCGAC GCCGCGCGCC CGGGAAGTGT ACCGGGACGT TCTGGAACTG
CTCGAAGCAC GCACCGTCAC CGACTACGCC GCCGCCCGCC TGGAGACGCT GTACCCCAGC
GCCAGCGATT CTCGGATCGC CGAAGTCCGC CAACTCAGCG AGCGGGCGAT CGAACGCGAT
CCCGACGAGA CCGTCCTCGA GGCGCTCGAA GACGTCGAAC CGCTGGAGCG GCCTGGCGAC
GTTCGTGTTC GGGACCGGTG TCTCGCGACG ACCGACGGCG AGACCTACGC CAGCGCCCGT
GAGGCGATCC CCGAGATGAG CGTCGAAGTC GTCGAGGACA GCCGCGATCT GGCGGAACTC
GCCCGCGGAT ACGCGACGGT CGTCGCACTG GACGATTCCT TCGCCGGCGT CGACGTCGAG
GGTGACGTCC GCGTCGAACC CGACGCCCTC GAGGATCCGG CCTCGGTCGT GCCCGAGCGC
CCGATTGCCT TTTTCGCGCA CAACCGCGAC CGCATCCTCG CGGCGATTTC CGTCCACCGG
GCAGCCGACT TCGATCCGCC GTGCGACCTC GACGCGCTCG AAGCCGCGCT GGACCGACTC
GACGCGGAGG GGACGCCGAC CGGTGACGAC GAACTAACGC GATTGCAGGC TGCCGTCAAC
GACCTCGACG CCGCCGTGAG CGAGGCCGAA TCGGTCGCCA ACGACCGCCT CCGGGAAGCG
ATCGAAGCGC AGGATGTCAC CATCGAGGGG GCGGACCTGC TCTCGCTCGT CGAGCGGGGC
GCTGGCGTCG ACGAGGTGCT CTCCCGGGAA CTGGCCGACG AGTACGACGA CGCCGTCGAA
GCGGCCCGCG AGCACGTGAT CGACACACTC GACCTGCGGG ACGTAGCTGA CATCACGAAG
CGGGCGTTCC CGGACGAACC CACGTTTCCC GTCGAGCGCG AGGAGAGCGT CGTCTCCCGA
CTCCGGGAGG AGCTCACGAC GGCCCGGGAC CGCCGGGCCG AACGGCTCAA AACCGAGCTG
GCCGACGAAC TGGCGTCGAT GCGAGAGCCG GCCGAGGATC TCGTCGATAC CGCGCTCGAA
CTGGACGTCG AACTCGCCAT CGCCCGCTTG GCTGCCGATT TCGACGCGAC GATGCCGGCA
CTCGACGGCG ACGGGATCAC GATCGAGGGG GGTCGGTCGC CGCTGCTCGA CGTGGACTTC
GTTGACGTCG AACCAGTCGA CTATGAGGTC AGCGGTGTTC GCCTCCTCTC GGGGGTGAAC
AGCGGCGGGA AGACCTCGAC GCTGGACCTG CTCGCGCTGA TTGTCATCCT CGCGCACATG
GGGCTACCGG TGCCCGCAGA CCGGGCCCGA GTCGGGCGGA TCGACGCGCT GCACTACCAC
GCCAAGACCC AGGGCACGCT GGACGCGGGG GCCTTCGAGA GCACGCTCCG ATCGTTCGGC
GAGTTGGTTA CTGACGCCGC GAACGAGGGT GAGACACTCG TGCTGGTCGA CGAGCTGGAG
AGCATCACCG AACCCGGCGC GAGCGCGAAG ATCATGGCCG GGATTCTGGA GGCGCTGGCC
GAACGCGACC AGACGGCCGT GTTCGTCTCC CACCTCGCCC GGGAGATCCG CGAGACGGCC
GATCAGGACA TCGGTGTCGA CGGCATCCAG GCACTCGGCC TCGAAGACGG CGAGTTACAG
GTCGACCGGA CGCCCCGGAA GGACACGCTG GCGCGCTCGA CGCCCGAGTT GATCGTCGAA
AAACTCGCCG ACGGCGACGA TCGCGAGGAC GGCGAGGGGA ACTTCTACGG ACGATTGCTC
GAGAAGTTCG AGTAG
 
Protein sequence
MDLTAIPGVG EKTAASLAEL DDPAAAIENG DVAAVARAPG ISQGRAARIV RAAIRERHGD 
AGEFLATPRA REVYRDVLEL LEARTVTDYA AARLETLYPS ASDSRIAEVR QLSERAIERD
PDETVLEALE DVEPLERPGD VRVRDRCLAT TDGETYASAR EAIPEMSVEV VEDSRDLAEL
ARGYATVVAL DDSFAGVDVE GDVRVEPDAL EDPASVVPER PIAFFAHNRD RILAAISVHR
AADFDPPCDL DALEAALDRL DAEGTPTGDD ELTRLQAAVN DLDAAVSEAE SVANDRLREA
IEAQDVTIEG ADLLSLVERG AGVDEVLSRE LADEYDDAVE AAREHVIDTL DLRDVADITK
RAFPDEPTFP VEREESVVSR LREELTTARD RRAERLKTEL ADELASMREP AEDLVDTALE
LDVELAIARL AADFDATMPA LDGDGITIEG GRSPLLDVDF VDVEPVDYEV SGVRLLSGVN
SGGKTSTLDL LALIVILAHM GLPVPADRAR VGRIDALHYH AKTQGTLDAG AFESTLRSFG
ELVTDAANEG ETLVLVDELE SITEPGASAK IMAGILEALA ERDQTAVFVS HLAREIRETA
DQDIGVDGIQ ALGLEDGELQ VDRTPRKDTL ARSTPELIVE KLADGDDRED GEGNFYGRLL
EKFE