Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0653 |
Symbol | |
ID | 8533789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 707090 |
End bp | 709735 |
Gene Length | 2646 bp |
Protein Length | 881 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646383042 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_003262553 |
Protein GI | 261855270 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTGAAT CAAAAGATCT AAGCCAACAC ACGCCGATGA TGCAGCAATT CTGGACGATG AAACAGGCGC ACCCGGATGT GTTGCTGTTT TATCGTATGG GGGATTTTTA CGAGCTGTTT TACGCCGATG CCGAGCGGGC GGCGCGCATT CTCGATTTGA CACTGACGAC GCGCGGGCAG TCGGCAGGCG AGCCGATTCC GATGGCGGGT GTTCCGGTTC ATGCCTACGA GAGCTATCTG GCGCGGTTGA TTCGCGCGGG CGAATCGGTG GCCATTTGCG AGCAGATCGG TGAAACCAAA ACCAAAGGCC CGATGGAGCG TGCGGTGGTG CGGGTCGTCA CACCCGGAAC GGTCACGGAT GAGGCCTTGC TCGATCAGCG CGAAGGCAAC CGCTTGGCGG CATTGGTGCC GCTGGCAACC ACGCCACCGG AATACGGGTT GGCGCATCTG GATCTGGCGG CAGGCGATTT CGTGCTCATG CGGCTCGATG ATGCGGCGCT GACGGCCGAG CTGGCGCGAA TCGATCCGCG TGAATTGCTG TTGCCGGAAT CGCTGGCCGA GGCCGCCGAC ACGGCGGCGA AGATAGGCGT GGACCCCAAA CGTTGGCGTA CGCGCGCCGA TTGGCAGTTC GATGCCAAAC GCGGGCAGGC GGCCTTGCTC AAACACTGGC AGATTCACGA TCTTAAAAGT TTTGGCGTGA CGGAAATCCA TCAACCGGCG CTGGGTGCGG CCGCCATTTT GCTGACCTAT GTAGCCGAGA CCCAGCGTAG TGCCGTGCCG CATATCGAGC GCCTGCGGGT GGAGCACCTG GGCGATGCCC TGCTGATTGA CCGCAACACC CGTCGCCATC TGGAGCTCTT CACTTCAAAT CAAGAAGGAA GTCACGATGA CGGCCGTTCG GCAGCCACGC TGATCAACCT GCTGGATGAG ACGGTGACCG CGCACGGCTC GCGGCTGCTC AAGCATTGGC TAGGTCGCCC GCTGCGTGAT CAGGCCGTGT TGCGGCATCG GCAGCAGGCG ATTGGCGAAC TGATCGAGCG CGGCAAGATC AATGCGCTGC GCGAATCGTT GCGCGGTATC AACGATATTG AACGCATCAC CACCCGCATC GTGATGGGCA GCGCCCGCCC CCGTGATTTG TCCGGGCTGC GCGATGCCCT TGGTGTATTG CCCGCGCTGA GTGCGCAACT CAACCAACTC GACCTGCCCT TATGGCGCGA TCTGGCCGTT CGGCTGACCG ATCAACCCGC CCCGCGTGAA TTGCTGAACC GCGCACTGGT GACCCAACCA CCCGTGTGGC TGCGCGATGG CGGCGTGATT GCCGCCGGAT TCGATGCCGA ACTCGACGAA TTGCGCCACC TTTCTGAACA CGCGGACGAC GCCCTGAATG CGCTCGAAGC CCAAGCGCGA CTGCAAAGCG GTATTCAGTC CTTGAAGATC GCCTACAACC GTGTGCAGGG GTTCTATTTT GAAGTCAGCC GGTTGCAGGC CGAAAAAATG CCACCGCAGT TTATTCGCCG CCAGACGCTC AAATCGGTGG AGCGCTATAC GACCGAAGAG CTGAAAACCT TCGAAGATCG CGTGTTGTCC GCCCGCGACC GCGCCTTGGC ACGCGAACAA GGGCTCTTCA CCGAATTGTT GCAAACCCTC GCGACGCACC AGAGCGCCCT GCGCCGCATG GCCGAAGCCA TTGCCGAGGT CGATGTGCTG CACAGTTTGG CGCGGGTGGC CGAGTGCCAG CGCTGGGTGG CACCGGAACT CGGCAGTGAA CCGGGCATCC ACATCGAAGC GGGACGACAT CCGGTGATTG AAGCCCTGAC CAAACAAACC TTAGGGAATC AGCCCTTCAC ACCGAATGAT TGCGAACTCA CGCCAAACCG GCAACTGTTG ATGATTACCG GCCCGAACAT GGGCGGTAAA TCGACCTATA TGCGGCAAAC GGCGTTGATC GTGCTGCTGG CGCACATTGG CGCGTTCGTC CCTGCTACCC GCGCGCGTAT CGGTCCGATC GATCGCATTT TCACCCGCAT CGGCGCGGGC GATGATCTGG CCTCCGGCCG TTCGACTTTT ATGGTCGAGA TGACCGAAAC GGCAGAAATC CTGCACACGG CGACCGAAAA TTCACTGGTA TTGATCGATG AAATCGGTCG GGGCACGTCG ACCTTCGATG GCCTGGCACT GGCCTGGGCC GTGGCGGAGC ACCTGATTCG CCGCAACCGC GCGCTCACGC TGTTCGCCAC CCATTACTTC GAGCTGACTC AACTGACCGA GCGCTTCGAT ACGGTCCGAA ACGTACACCT CGATGCCGTC ACACACAAGG ACGATTTGAT TTTTCTGCAC AGCGTGAAAG ATGGCCCGGC CAGCCAGAGT TACGGCATCA AGGTCGCTGC GCTGGCCGGT TTGCCCCGGG AGGCTATTCG GCGAGCACAA GCGTTACTAA AACAACTAGA GCAGCAACAC CCCGTGGGAG CGGCCACGCC GCAGCTCGAT TTGTTTGCCG CGCCCGAAGT AACCGATGCA ATTGAGGAAC CTGAGATTGA GCCGCACCCG TTGATTACCG CGCTCGAAAA ACTCGACCCG GACATACTCA CGCCGAAGCA GGCGCTGGAT TTGATTTATG CCTGGCGCAA TGAACTTAAG AAGTAA
|
Protein sequence | MTESKDLSQH TPMMQQFWTM KQAHPDVLLF YRMGDFYELF YADAERAARI LDLTLTTRGQ SAGEPIPMAG VPVHAYESYL ARLIRAGESV AICEQIGETK TKGPMERAVV RVVTPGTVTD EALLDQREGN RLAALVPLAT TPPEYGLAHL DLAAGDFVLM RLDDAALTAE LARIDPRELL LPESLAEAAD TAAKIGVDPK RWRTRADWQF DAKRGQAALL KHWQIHDLKS FGVTEIHQPA LGAAAILLTY VAETQRSAVP HIERLRVEHL GDALLIDRNT RRHLELFTSN QEGSHDDGRS AATLINLLDE TVTAHGSRLL KHWLGRPLRD QAVLRHRQQA IGELIERGKI NALRESLRGI NDIERITTRI VMGSARPRDL SGLRDALGVL PALSAQLNQL DLPLWRDLAV RLTDQPAPRE LLNRALVTQP PVWLRDGGVI AAGFDAELDE LRHLSEHADD ALNALEAQAR LQSGIQSLKI AYNRVQGFYF EVSRLQAEKM PPQFIRRQTL KSVERYTTEE LKTFEDRVLS ARDRALAREQ GLFTELLQTL ATHQSALRRM AEAIAEVDVL HSLARVAECQ RWVAPELGSE PGIHIEAGRH PVIEALTKQT LGNQPFTPND CELTPNRQLL MITGPNMGGK STYMRQTALI VLLAHIGAFV PATRARIGPI DRIFTRIGAG DDLASGRSTF MVEMTETAEI LHTATENSLV LIDEIGRGTS TFDGLALAWA VAEHLIRRNR ALTLFATHYF ELTQLTERFD TVRNVHLDAV THKDDLIFLH SVKDGPASQS YGIKVAALAG LPREAIRRAQ ALLKQLEQQH PVGAATPQLD LFAAPEVTDA IEEPEIEPHP LITALEKLDP DILTPKQALD LIYAWRNELK K
|
| |