Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4891 |
Symbol | |
ID | 5736726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6226670 |
End bp | 6229090 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282057 |
Product | MutS2 family protein |
Protein accession | YP_001547649 |
Protein GI | 159901402 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATCAG AAGTTGTACT CCAAACCCTT GAGTTCGATA AAGTTCGTGA TCAGCTTGCA CGTCATGCTG CGTTTAGCGC AAGCCGTGAG TTGGTTGCGC AACTCCATCC ATCAACCGAT GGGCAGTGGA TTCTGCAGGC GCAAATTCGT ACTAGTGCCG CCCGTGCTTT AATTGAATCG TTTGCCGATG TCTCAATTGG CGGTGCTCGT GATGTTCGCC CTGCGGTGGA ACATGCTCGT CGTGGTGGGA TTCTTGAGGC GAGTCGGGTA CAAGAAATTG CCGCTACCTT GGGGGCGATG CGGCGGTTGC GTGGCCAAGT TTTGCGCAAT CATCCCGATT TTGTGCCATT ACACCCTTTA GCCGAGCAAC TGCCAAATTT GGCCACGCTG GAACATGAAA TTGAACGCAC AATCGGCCCT GATGGCGAGG TATTAGATAG TGCTTCGGCT GAATTGGGCC GCTTACGTAG CGCCATTCGG GTGGCGTTTA ATCGGTTGCA AGAGCGTTTG CAAGCAATTA TCAATTCGTC GCAATATGCC GATGTGCTGC AAGAGCCAAT TATCACGGTG CGCGATGGCC GCTATGTCGT GCCAGTCAAA GCCCCACAAC GACGGGCCTT GCGCGGGATT GTCCACGATC AATCATCGTC GGGCGCAACC CTGTATATCG AGCCATTGGC TACGGTTGAG TTAAATAACC AGTGGCGCGA GCTGCAATTG GCCGAACGCG AAGAAATCCA GCGCATTTTG GCGGCGCTCT CGGGCAAAAT TGCCAACGAA GGTATGCCAA TTATTGTTGG GGTTGAGGCT ACCGCTGAAT TAGATTTAGC CTTTGCCAAA GCCAAATATA GCATTAGCTT GCGTGCTAGC CAACCTGCGA TCAATACGCC AGTTCCTGCC GATGATTTGC ACCCCGAATC AACTTTGTCA TTGCTCAAAG CCCGCCATCC CTTGCTCAAC CAAGATCTGG TTGTGCCAAC CGATGTCTGG CTGGGCGGCC CAACGCAGAT GATTATTATC ACTGGCCCGA ATACTGGTGG TAAAACGGTG GCGCTCAAAA CCGTTGGTTT GATGGCATTA ATGGCCCAGG CGGGTCTGCA TATTCCGGCC CATCAAGGCT CGCGCTTACC TATTTTTGGT AAAATTTTTG CTGATATCGG CGATGAACAA AGTATCGAAC AAAGCCTCTC GACCTTCTCC TCCCACATGA CCAACATTAT CCAAATCTTG GATCGGGTAA CCCCCGATTC GTTGGTGTTG TTTGATGAAT TGGGCGCTGG TACTGATCCA GTCGAGGGTG CTGCTTTGGC GCGAGCAATT ATCGAACGCT TGTTGAATGT GGGATGTTTG GCGATGGCAA CCAGCCACTA TGCTGAACTC AAGGCTTTTG CCTACAGCAC TGATGGGGTT GAAAATGCCT CGGTTGAGTT TGATGTTGAA ACCTTATCGC CAACCTATCG ACTTTCAATC GGCTTGCCAG GCCGCTCGAA TGCTTTGGCA ATTGCTGAAC GCTTGGGGCT TAAACGCGAC TTAATCGAAC GTGCTCGGGC AACGATTAGC CGCGATAACG TCCAAGTTGA AGATTTGCTG GCCGCGATTC ATCGCGAACG CACAACCGCC GAAAGTGAGG CTGCCCGCGC CTTGGAATTG CGCGAAGATG CCGAATTGGT GCGCGATCGG CTGAGCCGCG AATTGTATGA GTTTGAACAG GATCGCGAAC AGCAATTAGC CAGTTACCAA CGCCAACTTG ATGATGAATT GCGTGAAGTA CGAGCTGAAT TGCGCCGCCT ACGTGATGAA TTTCGCTCAG TTTCGGTTAG CCGCCAATGG ATGGAACAAG CCGAACAACG CCTCAGCCGA GTTGCCGAAC GGGTTCCCCA AACTCCAACT CCCCCCAAAG CCAAAGTTCC AGTTGTACCC AAAGTTGCGC TTGCCCCACT TCCTCGCACA ATTCAAGTTG GCGATCAGGT GTTTGTGAGC AGCGTGAAGC TTTCGGGCGT GGTGCTCGAT TTGGATGAAG AAGCCAACGA GGCCGAGGTT CAATTGGGTG GCTTCCGCTT GCGGGTTGAT TTACGCGAGT TGCGGCTGGA AAAAGCGGGC ACTAGTCCAA CCCAAGCGGT ACAAAAATAT GTACCTGTTC AGCGCATGAT CAATACTCCT CCACCGCCGA ATGTTTCGAT GCAGCTTGAT ATGCGTGGTT GGCGAGCCTC GGATGTGGAA AGTCAGCTCG ATCATTATCT CAACGATGCG TACCTCGCCA ATCTTTCAGA AGTGCGTTTG GTTCATGGCA AGGGTACAGG GGCGCTGCGC CAAGTTGTAC GAACATTGCT CAAACGCCAT CCCTTGGTCG AATCGTACAA TAGCGGTAGC CAAGGTGATG GCGGCGATGG CGTAACAATC GCCAAAATGG TTGCTCGTTG A
|
Protein sequence | MISEVVLQTL EFDKVRDQLA RHAAFSASRE LVAQLHPSTD GQWILQAQIR TSAARALIES FADVSIGGAR DVRPAVEHAR RGGILEASRV QEIAATLGAM RRLRGQVLRN HPDFVPLHPL AEQLPNLATL EHEIERTIGP DGEVLDSASA ELGRLRSAIR VAFNRLQERL QAIINSSQYA DVLQEPIITV RDGRYVVPVK APQRRALRGI VHDQSSSGAT LYIEPLATVE LNNQWRELQL AEREEIQRIL AALSGKIANE GMPIIVGVEA TAELDLAFAK AKYSISLRAS QPAINTPVPA DDLHPESTLS LLKARHPLLN QDLVVPTDVW LGGPTQMIII TGPNTGGKTV ALKTVGLMAL MAQAGLHIPA HQGSRLPIFG KIFADIGDEQ SIEQSLSTFS SHMTNIIQIL DRVTPDSLVL FDELGAGTDP VEGAALARAI IERLLNVGCL AMATSHYAEL KAFAYSTDGV ENASVEFDVE TLSPTYRLSI GLPGRSNALA IAERLGLKRD LIERARATIS RDNVQVEDLL AAIHRERTTA ESEAARALEL REDAELVRDR LSRELYEFEQ DREQQLASYQ RQLDDELREV RAELRRLRDE FRSVSVSRQW MEQAEQRLSR VAERVPQTPT PPKAKVPVVP KVALAPLPRT IQVGDQVFVS SVKLSGVVLD LDEEANEAEV QLGGFRLRVD LRELRLEKAG TSPTQAVQKY VPVQRMINTP PPPNVSMQLD MRGWRASDVE SQLDHYLNDA YLANLSEVRL VHGKGTGALR QVVRTLLKRH PLVESYNSGS QGDGGDGVTI AKMVAR
|
| |