Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1657 |
Symbol | |
ID | 4709847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1810359 |
End bp | 1812965 |
Gene Length | 2607 bp |
Protein Length | 868 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639856124 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001003223 |
Protein GI | 121998436 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCGA CCCAAGAGGC TGAAACCGCC ACCAATGGCG CCGGGCATAC CCCGATGATG CGGCAGTTCC TGCGCATCAA GGCCGAGTAC CCCGAGACCC TGCTGTTCTA CCGGATGGGT GATTTCTACG AGCTATTCTA TGAGGACGCC GAGCGTGCCG CGAAACTCCT CGACATCACC CTGACCACTC GAGGGGAGTC GGCGGGCGCG CCCATCCCCA TGGCCGGTGT ACCCGTCCAG TCGGTGGAAA GCTATCTGGC GCGGCTGGTC CGCCTCGGCG AATCGGTGGC CATCTGCGAG CAGATCGGTG ACCCCAACAC CACCAAGGGG CCCGTGGAGC GCCAGGTGGT CCGGGTGGTT ACCCCCGGCA CCCTGACCGA GGATGCGCTG CTCGAGGAGC GCAGCGCCAA CCTGCTCACC GCTGTGGCTC CGGGGCCGAA AGGCCGTTTT GGTGTCGCCT CGCTGGAGCT CTCCTCGGGG CGTTTCTCCG TGCTCGAGGC CCCCGATCAG GAGGCCCTCA CCGCAGAGCT CGAACGCCTG CGCCCCGCCG AACTGATCCT CCCGGATGAC GACCAGACGC CCGCACCGGA AGGGGGGTGT GTGGCGCAGC GCCGCCCTTC GTGGCACTTC GAATACGATT CGGCCCGGCG CCTGCTCCTG CGCCAACTCG GCACCCACGA CCTCTCCGGC TTCGGCGCCG AAGAGTTGCA CGCCCCGGTC ACCGCAGCCG GGGCGCTCCT GCAATATCTC AACGAGACCC AGCGGGCCGC GCTGCCGCAT GTTGGCGCAC TGACCGTCGA GTCGCGGGAT GAGGCCATCA CCATCGATGC CGCCAGCCGA CGCAATCTGG AGATTGAACA CAATCTCTCC GGCGGCACCG AACACACCCT GGCCTCGGTG ATCGATACCA GCGTCACCGC CATGGGCGGC CGACTGCTGC GCCGCTGGCT ACAGCGCCCC TTGCGCCGGC GCGAGACCAT CGCCGCCCGC CATGCCGCGG TGGCCGCGCT GGCCGACGGC GCCTTCGCCG ACGTCCGCAG CACCCTGGAA GGCTGCGCCG ATGTCGAACG CATCCTCGCC CGGGTCGCCC TCGGCACCGC CCGCCCGCGG GATCTGACCG GCCTGCGCGA CGCCCTCGAG CGCCTGCCGC AGCTGCAGAT CCTGCTCGGA CAACTCAACA GCCACCGCCT GCAGGCACTG GGCGTCGAAC TCGATGAGCA CCCGCAAACC GTCGACCTGC TGCAGCGGGC GATCATCGAT ACCCCGCCCG CGACCGTGCG CGACGGTGGG GTCATCGCCG ACGGCTTCGA CGGCGAGCTC GACGAACTGC GCTCAATGTC GCGCAACGCC GACGACTACC TGGCCGCCCT GGAGGCCGAG GAGCGCGCAG CGACAGGGAT CCCGACGCTC AAGGTGGGCT TCAACCGGGT CCACGGCTAC TACATTGAAG TCAGCCGCAG CCAGAGCGGC CATATGCCGG AACGCTACAC GCGCCGCCAG ACCCTGAAGG CCGCGGAGCG ATTCATCACC CCGGAGCTCA AACGCTTCGA GGAGCAGGTC CTCTCCGCGC GCGAGCGCGC CCTGGCCCGC GAGAAGGCCC TCTACGAGCA GCTCGTCGCC GACCTCGCCT CCGAACTGAC CCCACTGCAG CGCAGCGCCT CAGCCCTGGC CGAGCTAGAC GCCCTGGCCG CCTTTGCCGA ACGGGCGCGC AGCCTCGACT ACGTCCAGCC CGAGCTGGCC GACACCCCTG GTGTGCGGAT CGAGGGCGGT CGCCACCCCG TGGTCGAGCA GGCCCTGGAC GCCCCCTTTG TGCCCAACGA TGTGCGCCTG GACAACCGCC GGCGGATGCT CCTGATCACC GGGCCGAACA TGGGGGGCAA ATCGACGTAC ATGCGCCAGA CCGCTCTGAT TGCCCTCCTC GCCTACGCCG GCGCTTTCGT GCCGGCCCAG CGCGCAGTGC TCGGCCCCAT CGACCGCATC TTCACGCGCA TCGGGGCGGC CGACGACCTC GCCTCCGGCC GCTCCACCTT CATGGTGGAG ATGACCGAAA CGGCCAACAT CCTCCACAAC GCCACCGCCG AGAGCCTGGT ACTGATGGAT GAAATCGGCC GGGGCACCAG CACCTTCGAC GGCCTGGCCC TGGCCTGGGC CACCGCCGAG CGGCTGGCCA CGCGCATCCG GGCCTTCACG CTGTTTGCCA CCCACTACTT TGAGATGACC GCCCTCGAAC AGATCCACCC CGGCGTGGTC AACGTCCACC TCGAGGCGGC GGAACATGGC GAACGCATCG TCTTCCTTCA CGCCGTGCGC GATGGCCCCG CCAACCAGAG CTACGGCCTC CAGGTCGCCG CGCTGGCCGG TGTGCCGCAA GAGGTCCTCA AGGCCGCCCG GGAGAAGCTG CGCAGCCTGG AGTCCGGGGA TGGCGGCGAC ACCGGCTCGG CACAGCTGCC GCTGTTCGGC CCCGAGCCGG TGTTCCCGCC ACCGGCGCAG CCGGAGCCCG AGCCCGACCC GATCCGTGAA GCGGTCGAAA ACCTCGATCC CGATGGGCTG ACCCCGCGGG ACGCCCTCGA GACCATCTAT TGGCTCAAGG CACAGTGCAA GGAGTAG
|
Protein sequence | MAATQEAETA TNGAGHTPMM RQFLRIKAEY PETLLFYRMG DFYELFYEDA ERAAKLLDIT LTTRGESAGA PIPMAGVPVQ SVESYLARLV RLGESVAICE QIGDPNTTKG PVERQVVRVV TPGTLTEDAL LEERSANLLT AVAPGPKGRF GVASLELSSG RFSVLEAPDQ EALTAELERL RPAELILPDD DQTPAPEGGC VAQRRPSWHF EYDSARRLLL RQLGTHDLSG FGAEELHAPV TAAGALLQYL NETQRAALPH VGALTVESRD EAITIDAASR RNLEIEHNLS GGTEHTLASV IDTSVTAMGG RLLRRWLQRP LRRRETIAAR HAAVAALADG AFADVRSTLE GCADVERILA RVALGTARPR DLTGLRDALE RLPQLQILLG QLNSHRLQAL GVELDEHPQT VDLLQRAIID TPPATVRDGG VIADGFDGEL DELRSMSRNA DDYLAALEAE ERAATGIPTL KVGFNRVHGY YIEVSRSQSG HMPERYTRRQ TLKAAERFIT PELKRFEEQV LSARERALAR EKALYEQLVA DLASELTPLQ RSASALAELD ALAAFAERAR SLDYVQPELA DTPGVRIEGG RHPVVEQALD APFVPNDVRL DNRRRMLLIT GPNMGGKSTY MRQTALIALL AYAGAFVPAQ RAVLGPIDRI FTRIGAADDL ASGRSTFMVE MTETANILHN ATAESLVLMD EIGRGTSTFD GLALAWATAE RLATRIRAFT LFATHYFEMT ALEQIHPGVV NVHLEAAEHG ERIVFLHAVR DGPANQSYGL QVAALAGVPQ EVLKAAREKL RSLESGDGGD TGSAQLPLFG PEPVFPPPAQ PEPEPDPIRE AVENLDPDGL TPRDALETIY WLKAQCKE
|
| |