Gene Hore_05680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_05680 
Symbol 
ID7313540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp618765 
End bp621140 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content39% 
IMG OID643610998 
ProductMutS2 family protein 
Protein accessionYP_002508320 
Protein GI220931412 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGGAAA GCTCCCTTGA AATACTGGAA TTTGATAAAA TAATAGACCG TGTTCAGGAA 
TTTGCAGCAA CCATCATTGG AAAGGAAATT ATATCCAGAC TACAACCGGT CGATAACCTT
AATTATGTGA AAAATAAATT ACGGGAAGTC AGCTCTGCCC GTGAAATTCT TGAAGAGTAT
GGACGGCCTC CCTTTGGCGG TATCAGAGAT TTAAGGGAGA TAATTGAAAA AGCTGATAAG
GGAATAGTTT TAAGTGTAAA AGAAGTCATG GATGTCAGGT CAACCCTGGA AGGGGTCAGG
GAATTAAAAA AATATTCCCG GGAAATAGGA ACAGGAATAG ATGATGAGTT GCAGGATATT
TATAGCATTA TTACTGAAAA ATTTGATAGA CTTACCCCTT TAAAACAATT AGAGAATGAA
ATTAATAGAT GTATAGATGA ACATGGGGAG ATAAAGGATT CTGCCAGTAG AAAGCTCAGA
AGTATAAGGC GGGAGATGGA TAGAATTGAG GGTAAAATAA ATGATAAGCT CAATTCTATA
ATAAATAATA CAAGGTATCA GGAAATGCTT CAGGATAAGC TGGTTACCAT CAGAGGAAAC
AGGTATGTTG TCCCTGTTAA AAGCAGTTAC AAGAACACTT TTTCTGGAAT AGTTCATGAT
CAATCAACAA GTGGACTTAC CTATTTTATG GAGCCAATGG CTATAGTAAA GCTTAATAAC
AGACTCGGGG AATTGAAAAG AGCCGAAGAA CAGGAAATAT ACAGGATATT GAAAAAGTTG
AGTGAAAATA TTAAAGAACA CACCCGGGAT CTGAGTGATA ATCTGGAGAT GGTTTCCCTT
CTTGATGTTG ATTTTGCCAG AGCCAGGTTC AGTATTGAAA TAGAAGGTAT AGAACCTGGA
ATTAATGATA AGGGTTTTAT TAATATAAGA GGTGGACGCC ATCCTTTACT TAAAGTCAAA
CCCGTTCCAA TTGATATAAC TGTTGGCAAT GAGTTTAAAA CCCTGGTTAT TACCGGTCCC
AATACCGGTG GTAAGACCGT AGCACTTAAA ACTGTTGGGT TATTTGTATT AATGGTCCAG
GCAGGCCTTC ATATACCTGC AGAAGAGGAA ACGGTTATCT CTATTTTTAA TGGAGTTTAT
GCCGATATTG GAGATGAACA AAGTATAGAA CAGAATTTGA GTACATTTTC ATCCCATATT
AACCGGATTA AGCGGTTTTT AGGTAAGGCT GATGCCAGAA GCCTGGTTCT TCTGGATGAG
ATCGGGGTTG GTACAGATCC TCGCGAAGGG GCTGCCCTCG GGGTGGCCAT CCTGGAACAT
TTAAGGGAAA GGGGAGTAAC TACTATAGCC ACAACTCATT ACAGTGAAAT AAAGAGTTAT
GCTTACTCCC AGGATGGTGT TGAAAATGCT TCTGTTGAAT TTGATATGGA AACCCTTCAA
CCAACCTACC GGCTCCTTAT GGGGATTCCC GGTGGAAGTA ATGCCTTTGA GATTGCCCTG
AAGCTGGGTT TACCCCATGA TATAATTAAA GATGGTAAGG AGTTAATGAG CGGGGATGAT
ATTAAGGTTG AAAATATTAT TTCTGATTTA AATGAAGAAC GGAAAAAATA TGAACAATTA
AAAATAGAAA TAGAAGAAAG GCTGGAGGCA GTAAAGAAAA AGGAACAGAA GTATGATTCT
TTATTGACCG ATCTTGAGAA GAGAAAAAAG AAGTTAATAA CAGAAGCCCG GGAAGAGGCT
TTACAGATAA TTAAAAAGAC CAGGAAAGAG AGCAAAGAGA TTTTACGCCG GTTAAAGAAT
AAAGAATTTG CTTCCAGGTC TGATATAGAC AGGGTTGAGA ATGAAATCAA TTTGAATCTT
AAGGAAACCG AAAAAGAAAT TAGTGAAAAA AGACAGAACA AAGATGGCCG GACCCGGGTT
AAAGAAATAT CCTGTGGTGA CCAGGTCAGG TTGAAAAAAA CTGGTCAGAA GGGTGAGGTT
ATTTCTGTTG ACCGGGAGAA AGGAGAGGCT GTTATCCAGG CCGGTATTAT GAAAGTAACT
ACAGGGCTGG ATGAAGTAGC GAAAATTGAT ATACCTGATG ATACTAAGGA CGAACTCTTT
CATTCCTATC AGGTCAAGAA AAAAAGCCGG GTTTCTCCTA CCCTTGATCT TAGAGGAGAA
CGTTATGAAA CAGCCCAGCA TAAACTGGAT AAATATCTTG ATGATGTTTT CCTGGCCGGA
TTGAAACAGG TAGAAATAAT TCACGGTAAG GGTACCGGGG CTTTGAGGAA GGCTGTTCAT
ACTGTATTAG AAAAAAACCC CCACATCACC TCTTACCGCC TGGGAAGGCA GGAAGAAGGT
GGGAGTGGGG TGACAATTGC TGACCTCAAA TCCTAA
 
Protein sequence
MEESSLEILE FDKIIDRVQE FAATIIGKEI ISRLQPVDNL NYVKNKLREV SSAREILEEY 
GRPPFGGIRD LREIIEKADK GIVLSVKEVM DVRSTLEGVR ELKKYSREIG TGIDDELQDI
YSIITEKFDR LTPLKQLENE INRCIDEHGE IKDSASRKLR SIRREMDRIE GKINDKLNSI
INNTRYQEML QDKLVTIRGN RYVVPVKSSY KNTFSGIVHD QSTSGLTYFM EPMAIVKLNN
RLGELKRAEE QEIYRILKKL SENIKEHTRD LSDNLEMVSL LDVDFARARF SIEIEGIEPG
INDKGFINIR GGRHPLLKVK PVPIDITVGN EFKTLVITGP NTGGKTVALK TVGLFVLMVQ
AGLHIPAEEE TVISIFNGVY ADIGDEQSIE QNLSTFSSHI NRIKRFLGKA DARSLVLLDE
IGVGTDPREG AALGVAILEH LRERGVTTIA TTHYSEIKSY AYSQDGVENA SVEFDMETLQ
PTYRLLMGIP GGSNAFEIAL KLGLPHDIIK DGKELMSGDD IKVENIISDL NEERKKYEQL
KIEIEERLEA VKKKEQKYDS LLTDLEKRKK KLITEAREEA LQIIKKTRKE SKEILRRLKN
KEFASRSDID RVENEINLNL KETEKEISEK RQNKDGRTRV KEISCGDQVR LKKTGQKGEV
ISVDREKGEA VIQAGIMKVT TGLDEVAKID IPDDTKDELF HSYQVKKKSR VSPTLDLRGE
RYETAQHKLD KYLDDVFLAG LKQVEIIHGK GTGALRKAVH TVLEKNPHIT SYRLGRQEEG
GSGVTIADLK S