Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2015 |
Symbol | |
ID | 8535174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2159921 |
End bp | 2161774 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646384397 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_003263884 |
Protein GI | 261856601 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCATA TCGCTCAATT ACCGCCGGAA TTGGTCAACC AGATCGCTGC TGGTGAAGTG GTCGAGCGCC CGGCTTCCGT TGTCAAGGAG TTGGTTGAGA ACGCCATTGA TGCGGGTGCC CGGCGAATCG ACGTCCGTAT CGAAGAAGCC GGTTCTCGGC TGATTGAAGT GCGCGATGAT GGTAGCGGCA TGTCCGAGGA GGACTTGCCC TTGGCCTTCG CCGCCCACGC CACCAGTAAA ATCCGATCGC TCGATGAACT CGAATCGGTG GCGACCATGG GCTTTCGGGG TGAAGCACTG GCGAGTATCG CCTCGATCGC GCAAGTCAGC GTGTCTTCTC GTCGCGAGCG CGATGCACAT GGTTGGATTC TAAGCCCCAA TGAATCGCTG GAATGTCGCC CGACCGCTCA TCCGGTCGGA ACCACGGTCA CAGTGGCTGA TTTGTTCTAC AACACCCCGG CGCGCCGTAA ATTCCTGCGC ACCGAGCGTA CCGAATTTTC CCAGATCGAC CAGCTCATGC GTCGATTCGC GCTGGCCCAT CCCGGAATCG GGTTTTCACT GACGCATCAG GGGCGGCTCG TATTCGAACT CTCGCCACTT CCTCGCGCAC AGTTGGCGGA GCAGATGCCC GCACGTATCG AGGCATTGCT CGGCGCGGAG TTTCTCAACC GAGCTCGACG GATTGATTCC GCGGCATCGG GCTTGAGTCT GAGCGGCTGG GTGGCCGACC CGGCCTATGC CCGTTCGACA ACCGATCAGC AGTTGTTTTT TGTCAATGGC CGTATCGTCC GCGATCGGCT GATTGGCTTC GCCATCCGAC GCGCTTACGC CGACGTGCTC CACCACGCTC GGCAGCCGGC TTATGTGCTG GCGCTCGATC TCGAACCGCG CGCGGTCGAT GTCAATGTTC ATCCGACCAA GGCGGAAGTT CGTTTCCGTG ATGCACGCGC CGTGCAGGAT TTTATTTTCC GCGAAATCCA TCGGGCGCTG GCTCAGGGTG CCGTGGCGGG AGCGCAGCAA GAATCGGGTG TCGAGCAGGC GAGTCGTGGC GCTCCCGCGT TCGATGCGGG AATGGGGCGA GGCGAGCGGG TGGATGGCTG GTCAGGCACG GCATCAATGC CTTCTTCTGC GCCGTCTTCT TCTGTTCAGG GGTATTTGAA CCTGCTCGCC GCGAGCGCCG CAGCGCCGCA ATTGGTTGAT GCGTGCCTGT CATCGGTTCA AGAGCCTTCT CAGGCACAGC AACCCACACA GGACATGCCG CCCTTGGGCT ATGCAGTGGC GCATCTGCAC GGTGTGTACA TTCTGGCGCA GAACGCGCAG GGGCTGGTGA TCGTCGATGC CCATGCCGCG GCGGAGCGCA TCACCTACGA GCGATTGAAA GCGGCCTATG CCGCCCAGCA GATGACCATC CAGCCATTAT TGCTGCCGGT GACATTTGCC GTGACGGAGT CCGAAGCGGA GCAATATGAA GCGCAGGCCG AGGCTTTCGC CCGTTTGGGC GTCGTGCTGG CACGGGTGGC GCCGACCCGC GTGCGCGTGA CCGCCCTGGC TGCCCTGTTG CGTCATGCCG ATGCCGAAGC GCTGGCCCGT GCACTTTTGT CTGCGTTAAG CGAAGAGGAA CCGGACTGGG GGCAACCTCA GGCCGTGCTG ACCGAGCCGA TCAATGCCGT GTTGTCGCGC ATGGCCTGCC ACGGTTCAGT GCGCGCCAAT CGCATCCTGG CCCGCCCGGA AATGGATGCC TTGCTGCGCG ACATGGAGCG CACCGAGCGG GCCGACCAGT GCAATCATGG CCGACCCACT TGGCGACAGC TTTCGATGAC CGATCTCGAT CGACTGTTCA TGCGGGGGCA GTAA
|
Protein sequence | MPHIAQLPPE LVNQIAAGEV VERPASVVKE LVENAIDAGA RRIDVRIEEA GSRLIEVRDD GSGMSEEDLP LAFAAHATSK IRSLDELESV ATMGFRGEAL ASIASIAQVS VSSRRERDAH GWILSPNESL ECRPTAHPVG TTVTVADLFY NTPARRKFLR TERTEFSQID QLMRRFALAH PGIGFSLTHQ GRLVFELSPL PRAQLAEQMP ARIEALLGAE FLNRARRIDS AASGLSLSGW VADPAYARST TDQQLFFVNG RIVRDRLIGF AIRRAYADVL HHARQPAYVL ALDLEPRAVD VNVHPTKAEV RFRDARAVQD FIFREIHRAL AQGAVAGAQQ ESGVEQASRG APAFDAGMGR GERVDGWSGT ASMPSSAPSS SVQGYLNLLA ASAAAPQLVD ACLSSVQEPS QAQQPTQDMP PLGYAVAHLH GVYILAQNAQ GLVIVDAHAA AERITYERLK AAYAAQQMTI QPLLLPVTFA VTESEAEQYE AQAEAFARLG VVLARVAPTR VRVTALAALL RHADAEALAR ALLSALSEEE PDWGQPQAVL TEPINAVLSR MACHGSVRAN RILARPEMDA LLRDMERTER ADQCNHGRPT WRQLSMTDLD RLFMRGQ
|
| |