Gene SYO3AOP1_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSYO3AOP1_1568 
Symbol 
ID6332100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurihydrogenibium sp. YO3AOP1 
KingdomBacteria 
Replicon accessionNC_010730 
Strand
Start bp1619751 
End bp1622042 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content31% 
IMG OID642657843 
ProductMutS2 family protein 
Protein accessionYP_001931720 
Protein GI188997469 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAA GAGATTTAGA AAGTTTAGAG TATAGAAAGT TTTTAAACCT GCTTTCAAGC 
TATACCCATA ACGAGATAAC GAAAAATAAG ATAAACAATC TAAAACCTAT TACAAACAGA
GAGTTTTTAG AGAGAGAAAT AGCAAAAGCA TCTGAGTTTG AATCCATCTT TTTAAAAGAA
GGATATTTCC CACTTTCAGA ATTTCCTGAC ATTACCCAAG CAATAAACCT TGCAAAGGTT
GAAGATAGTA TTTTGTCTCC TAAAGAGATT TTTGAAATCG GAGAAATTTT ACGAGTAGTC
AAGAATGTAA AATCTTTTTT ATCAAACCAT ACTCTAAACC ATCTTAAAAA GCTGTTTCAA
AATCTTACTC CACTTAGAGA GCTTGAAAAG TTTATCACAG ACAGCATAGA CTCAGACTTT
GCTGTAAAAG ACAGTGCAAG TAAAGATTTA GCAAGGATTA GAAAAGAGAT TAAAGAAGTT
GAAAAGCTTA TAAACAATCA GCTTGAAAAA ATCTTGAACA ATCCAAACTA CCAAGACGCA
ATTCAAGAAA AGCTTATCAC CTTAAGAAGA GATAGATTTG TCATTCCTGT AAAGTATAAC
TTTTCCCATA GAATAAAGGG CATCATTCAA GACAGGTCTT CTTCAGGAAA TACAGTCTAT
GTAGAACCAT TTGAAGTAGT ACCGCTGAAC AACAAACTCA CAGATTTAAA ACTCCAAGAA
AACTTAGAAA TTAGAAAAAT ACTGAGATTC TTAACAGACA TAATTAGAAC AAAAATTAAC
TTTATTTCAA ACTCTTTTGA TGCTTTGATT GAGTTTGACA TTCTCTACAC AAAAGCTAAA
TTTTCAAAAG CTTTTAACTG CAGATTTCCA CAGATAGGAG AATCATATCA GCTTTATAAT
GCAAAACATC CTATCTTTTT ACTTAAAGAA AAGCCATTTA TTCCAATAGA TATCTTACTT
GACGAAAAGA GAGGATTGGT AATTACAGGA CCAAATACAG GCGGAAAGAC GGTAGCACTA
AAAACAGCAG GACTTTTGTC TTTAATTTTT CAATCAGCCA TAGCCATACC TGTTGATGAA
GGAAGTAAAA TTCCAATCTT TAACGGAATA TTTATAGACA TTGGAGACTA TCAAAGTATA
GAGGAAAATC TATCTACATA CTCAGCACAT ATTAAAAACA TAAGAGAGAT GTTAGATTTA
GCAAATAAAA ACTCTCTTTT GCTATTTGAC GAGCTTATAC CAGGAACAGA CCCAGATTTT
GCATCAGCTA TTGGTATAGC AATTTTAGAC TATGTAAAAG AAAAAAACAT AAGAGTAATA
GCCACTACAC ACTTAAAGAA AATAAAAGCT TACGTATTAA ACAACGATTA CTTTAAGATA
GCAGCAGTTG GATTTGACAA AGAGACCTTA ACACCAACTT ATAAAATTTA TTACAATGCA
GTAGGCGAAA GCATGGCGTT TTACATAGCC CAAAAGCTAA ATTTACAAAA AGAAATCATA
GAAAAAGCTA AATCTTTAAT ATCTAAGGAT TTACTAAATT TTGAAGAATT AGCATCTAAA
TTTTCAGCGT TAATATCAGA ATATGAAGAA AAAATTAAAG AGATAAACCA GTTAAAACAG
CAATTAGAAT TAGAAAAAGC AAAATACGAA AACTTAGCAA AACAGCTTGA AAAAGACAAA
AAAGAAAAAT GGAAAGAGAG CTTAAAAGAA ATTCAAGATT TTGTTGAAAA AATAAGACAG
GAAGGCTATG AAGTCTTAAA AGAAGTAAAA GAAAGGCAAT CAGGAGCACC GTTAGAAAAA
TTTGTCAAAG AAAAGAAAAA CATCAATATA AAAACAGAAG AAGAAATTAA GGCAGAAGAG
ATTAAAGAAG GTGATGTTGT AAGAATAAAA GGAAAAACTC AGGAAGGAAC GGTTATTGCC
ATCAGAGAAG ATAAAGCTAA TGTAAACTTC GGTGGGATAA AAATATGGCT ACCTTTAAAT
CAGCTAGAAA AAAGACAGCC TAAAGAAGAA AAGACAACCT TTAAAATAAC CAAATCAAAA
ACAGATATAA CCCCATCTAT CAATCTAATT GGCAAGACAA AAGAAGAGGC TATAAAAGAG
TTAGAAAAGT ACATCGATAA AGTAATACTT GAAGGTTATA CTACCTTTAA AATAATCCAT
GGCTATGGGG CAGGCGTTTT AAGAAATGCA GTAAGAGAGT ACTTAGACAA ACTTCCTTTT
AAGCTAAAAT ACGAAGATGC ACCATATCAC GAAGGCGGTC TTGGAGTAAC CATAGTTAGA
TTTGAAGAGT AG
 
Protein sequence
MRERDLESLE YRKFLNLLSS YTHNEITKNK INNLKPITNR EFLEREIAKA SEFESIFLKE 
GYFPLSEFPD ITQAINLAKV EDSILSPKEI FEIGEILRVV KNVKSFLSNH TLNHLKKLFQ
NLTPLRELEK FITDSIDSDF AVKDSASKDL ARIRKEIKEV EKLINNQLEK ILNNPNYQDA
IQEKLITLRR DRFVIPVKYN FSHRIKGIIQ DRSSSGNTVY VEPFEVVPLN NKLTDLKLQE
NLEIRKILRF LTDIIRTKIN FISNSFDALI EFDILYTKAK FSKAFNCRFP QIGESYQLYN
AKHPIFLLKE KPFIPIDILL DEKRGLVITG PNTGGKTVAL KTAGLLSLIF QSAIAIPVDE
GSKIPIFNGI FIDIGDYQSI EENLSTYSAH IKNIREMLDL ANKNSLLLFD ELIPGTDPDF
ASAIGIAILD YVKEKNIRVI ATTHLKKIKA YVLNNDYFKI AAVGFDKETL TPTYKIYYNA
VGESMAFYIA QKLNLQKEII EKAKSLISKD LLNFEELASK FSALISEYEE KIKEINQLKQ
QLELEKAKYE NLAKQLEKDK KEKWKESLKE IQDFVEKIRQ EGYEVLKEVK ERQSGAPLEK
FVKEKKNINI KTEEEIKAEE IKEGDVVRIK GKTQEGTVIA IREDKANVNF GGIKIWLPLN
QLEKRQPKEE KTTFKITKSK TDITPSINLI GKTKEEAIKE LEKYIDKVIL EGYTTFKIIH
GYGAGVLRNA VREYLDKLPF KLKYEDAPYH EGGLGVTIVR FEE