Gene HY04AAS1_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1152 
Symbol 
ID6743969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1063897 
End bp1066170 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content34% 
IMG OID642750962 
ProductMutS2 family protein 
Protein accessionYP_002121816 
Protein GI195953526 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAT ATGAGCTTAA GAAGTTAGAG TTTCACAAGA TAAAAGAAAA CCTTAAGATA 
AGGACCCATT CTGTAGCAAG TTTAGAGTAT ATAGAAAATA TAAAACCTAT CCCAAAAGAG
GAACTTTTAC AAGAACAAAA GCTTGTTGAA TGTTTTATGA GGCTTTTAAG ACAAAGAGAA
AACATATTAT ATAGCTTTGA TGATATTTCT AAAAGCCTAA AAAAAGCAAT GATAGAAGAA
AGTGTTTTAG GAATAGACGA AATACTTAGC ATATACAAAG TTTTAAAAAT TATAAAAGAT
GTAAGAAAAT TTTTAGTGGA TGCCTTAGAT AGGTGCGATT TATTCAATAA AATACTAAAA
GATTTAGGTA CGTTTCAAAC CTTAGAAGCT GAAATAGAAA GAGCCATAGA CAATTCTGGT
GTTGTAAAAT CAGAAGCCTC TAGGGATCTT TTTGAGATAA GAAAAGAGAT AAAGCAAGTA
GAAAAAACTA TCACGGAGAA ACTAGAAGCT CTTTTCCAAA GACCCGATAG CGATATTTTA
TTTTCTGAAA AGCTTATCAC AGTAAGACAA AATAGATATG TGGTGCCTGT AAAAACCCAA
AGCGTCAAAA GGATAGTGGG GATAGTGCAC GGTGTTTCTT CTTCTGGTTT TACCACATAT
TTAGAGCCCC AGATAGTGGT GGATTTAAAC AACAAGCTTG CAGTTTTAAG AATAGAGGAA
GAAAAAGAGA TACATCTTGT ATTAAAAAAA CTCACTTCAT TTATAAGAGA AAGGGCAAAT
AGACTTTTAG AATCTTTCAA TACGCTTGTT AAAATAGATA TTCTGATGGC AAAAGCATCC
TTTGGCATAG AATACGAGTG CTCATTACCT TCTATTGGCG ATTGTATAGA GCTTTTAGAA
GCAAGAAATC CAATTATGAC AATTTTATCT CAAAATCCTA TTCCAGTGGA TATAATTTTA
AAAGACAAAA AAGGGCTTGT GCTTACAGGG CCAAACACTG GCGGAAAAAC CGTTTTCCTA
AAAACCCTTG GACTTAGTTA TATCATGTTT TTACACGCTA TACCTATACC TGCCTCACCT
AATAGCAAAC TGCCTATTTT TGACAATATT TTTGTGGATA TAGGAGATGA ACAAGACATA
TCACAGTCTT TATCGACTTT TTCTTCTCAT ATAAAAAATA TCTCGGAGAT TTTACAAAGA
TCCACAGAAA AAACGCTGAT ATTGATAGAC GAATTAGGAG CTGGCACAGA CCCACTGGAG
GGTTCTGCTC TTGGTATAGC AATCCTTGAT TATATAAAGA AGTTAAACGC TTTTGTAGTG
GTTTCCACTC ATCACACACC CATAAAGCTT TGGGCTGTAA ACTCTGATTA TTACGAACCG
GCTACGTTGA TGTTTGATAG GGATACCTTA AGACCTCTTT ACAAAGTGCT TTACGGCACC
ATTGGAGAAA GCATGGGTAT AGAAGTGGCT AAAAGGTTTG GGATACCAAA GGAAGTTATT
TTAGAAGCCC AAAAACTTCT TGGAGAGAAT ACTTTGGAAT ACCAAGGTGT TATGGAAAAC
TTAAATAGAC TTGTCAGAGA ATACCAAGAC AAAATGGAAA TATTGGAAAA ACATAGAGAG
GAGCTTGAAC TTTTAAAAAG AAAATACGAA TCTCTCGTAG AAGAGATGGA AAAAGCCAAA
GAAGATGCAT GGAAAAATGC CGCAAAAGAG GCTCAAAACT ATTTGGAACA ACTTAAGAAA
GAAGCTCAGG AATTTTTGGT TGGTCTTAAA GAAAAAGCTA GTTTAAAAGA TTTTATAAAA
CAAAAGCAAG AAGAGTTAAA AAAGTTTGAA AAAGAAGAAG AGCAGCAAAT AGAAGTAGGA
GATTGGGTAG AGTTTATGGG TGGAAAAGGT AGGGTTTTAG AAATAAGGCA AGACAAAGCT
CAAGTGATGT TTGGGGATAT AAAGGCTTGG ATAAAGCTAA AAGATCTTTC AAAAACCACC
AAAATACCAA GAACTCACAC TACAAATATA AGCATAGAAA GGTTTGAAAA CAAAAAAGCC
GGTATGCCAG AGATAAATTT AACCGGGCTT TCAGTAGAAG AAGCTATAAG TAAACTTGAT
AAATTTTTAG ATAGCGCTTT TGCAAGTGGA GTTAAAATGG CAAAAGTAAT ACACGGTGTT
GGGGTACTTA AGAAAGCTGT ATCAGACTAT CTTTCGTCAT CTTCTTACGT TGTATTCTAC
AGAGATGCCT ATCCAAAAGA AGGTGGACCT GGTACCACCA TAGTATATTT TTAG
 
Protein sequence
MREYELKKLE FHKIKENLKI RTHSVASLEY IENIKPIPKE ELLQEQKLVE CFMRLLRQRE 
NILYSFDDIS KSLKKAMIEE SVLGIDEILS IYKVLKIIKD VRKFLVDALD RCDLFNKILK
DLGTFQTLEA EIERAIDNSG VVKSEASRDL FEIRKEIKQV EKTITEKLEA LFQRPDSDIL
FSEKLITVRQ NRYVVPVKTQ SVKRIVGIVH GVSSSGFTTY LEPQIVVDLN NKLAVLRIEE
EKEIHLVLKK LTSFIRERAN RLLESFNTLV KIDILMAKAS FGIEYECSLP SIGDCIELLE
ARNPIMTILS QNPIPVDIIL KDKKGLVLTG PNTGGKTVFL KTLGLSYIMF LHAIPIPASP
NSKLPIFDNI FVDIGDEQDI SQSLSTFSSH IKNISEILQR STEKTLILID ELGAGTDPLE
GSALGIAILD YIKKLNAFVV VSTHHTPIKL WAVNSDYYEP ATLMFDRDTL RPLYKVLYGT
IGESMGIEVA KRFGIPKEVI LEAQKLLGEN TLEYQGVMEN LNRLVREYQD KMEILEKHRE
ELELLKRKYE SLVEEMEKAK EDAWKNAAKE AQNYLEQLKK EAQEFLVGLK EKASLKDFIK
QKQEELKKFE KEEEQQIEVG DWVEFMGGKG RVLEIRQDKA QVMFGDIKAW IKLKDLSKTT
KIPRTHTTNI SIERFENKKA GMPEINLTGL SVEEAISKLD KFLDSAFASG VKMAKVIHGV
GVLKKAVSDY LSSSSYVVFY RDAYPKEGGP GTTIVYF