Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lferr_1972 |
Symbol | |
ID | 6877960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 53993 |
Kingdom | Bacteria |
Replicon accession | NC_011206 |
Strand | - |
Start bp | 1970145 |
End bp | 1972469 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642789841 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_002220396 |
Protein GI | 198284075 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.656834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTGG CCGGGGTTGC TCCTTGGCAG GACAGCTTCT GGCGGGCTCT GGATGCCAGT GATGTGCAGA GCGCGTGGTC ACAGCATTGT GGCCACATCT ATGGCCGCAG ACATGTTGCG GCGACTACCC CCTGGGTGCC GCTTGCGGAA ATCCATGATC GTTTGAATTG GTTGGCCTGG TTGTCGCAGC GGTTTTCCGC CGGTTTTCTT CTTCCCGTTT CTGATCTCCC CGATATCGAT GCTCTGCTGG TTCTGGCACA GCGCCCTGGT GCCGTGCTCA GCGGGAGAGA TTTGCGCGCG GTGCTCGATG TGCTGACGGC GCAGAAGGGC TATGCCGCAG CTTTGCGCGA GACTCGGGAT GCCTTGACCG GGCTGGCAGG CGAGCTTGAT CCCCCGGTGA CCTTGTTGCG CCGTCTCGGC GTGGCGTTGG ATGAGGAGGG TGAGCTCCTG GATACGGCGA GTGCGGATCT CGCCCTCCTG CGCCAGCGCT TGCGAATCAG CCGCAACGAA TTGCAGCGTT TTCTGCAGGG CTTTCTGCGT AATCGAGACT GGCAGGAATA CTGGCAGGAT CAGGTCATCG TGCAGCGTAA TGAGCGTTAC GTACTCCCGC TCAAAGCTAG CCATAAAGGG CGTATCAAGG CGATCGTCCA TGACCGGTCG GCAAGTGGTG AGACTCTTTT TGTGGAGCCG CTGGCGGCCG TCGATCTGAA TAATCAACTG GTCCAGGACC GGCGTGCCGA AATTCAGGAG CAGGAGCGCA TTCTTCGTGC CCTGAGCGCA GCGGTAGGGC TGGAAGTGCC GGGCATCGTC GCCGCCCTGC GGCATATTGG CCGGCTGGAT GCGGTACGGG CGGGACTGGA ACTGGGGGAT GCCTGTGGCG GGATTTTGCC CAAGGTCGAT GCGGCTGCGG CATTTGATCT GCGGGCCCTG CGTCATCCGC TGCTTTGTCT GCGCCATCCG GGGCAGGTCG TGGGTAACGC CCTGCGTCTC GGGGCGGACG CGCAGCAACT GGTCATTACC GGCCCCAATA CCGGCGGCAA GACGGCTATC CTGAAAGCAC TCGGACTCAA TCATCTGATG GCCTACATGG GATTGCCGGT AACTGCGGAA GGTACATTGG GTTATTTCCC GAAGTGCTTT GCGGTCATCG GCGACGCCCA GGATATCCAC ACGGATCTTT CCACTTTTTC GGCGCAGGTA CAGCGTCTTC GGGAAGTGCT GGAGCATGCC GACGCCCACA GTCTCGTGCT TCTCGACGAA CTGGGCAATG GCACCGACCC GCGGGAAGGG GGCGCGCTGG CTCAGGCGGT CGCAGAGGCC TTGCTGGCGG CCGAATGCTG CACACTGCTC ACGAGCCATC TGGAAGTGAT GAAGCGTTAT GCCCTGAGCC ATGCGGGTGT AGCCCTGGCG GGTATGGGCT TTGATGCAGA GTCTTTGAAA CCCACCTACC GTCTGCTTTG GGGTGTAGGC GGCGCGAGCC AGGGTCTGGT CATCGCCCGG CGCGTGGGGA TGCCTGCACC ACTGATGAAC CGCGCCGAGG CACTCTATGC CGATGACCGC GAGAACTGGG AACGTTGGGA AGCGCAACGG GAAACCCTAT TGCAGGCCGC CCGGCAGGCT ATGGATGAGG CCGTACTGGC GCGTGACGAA GCCACCAGTG TGGCCCGCAG CCTGGAACGC GAACTGGAAG CAGCGAGGCA GGAGCGTGAC AAAGCCGCCG CCGCCGCCCG CGCCGAATGG GAAGATATAC TGGCAACGGC GCGCCAGCAG GTGCGGCAAG CCATTGCCGC GCTCAAGTCC GGGAGGGATA CCCAGGCGGC AACGGCGGCT TTGCAACGTC TGGAAGTCCC CTTCCGGGCG GAGGAGCAGC AAGTGGACAG CCTGCCCGCG GTGGGGACCA GGGGTCTGTT CCTGCCACTC CGGCAGGTGA CCCAAGTGCT GCGTGCGGAT CCCGCACAAC ACAGGGTGCA GATTCAATTA CGGGGCAAGC AGTTGTGGGT GCCCGCCGCA CAATTTGCCG TGGACGCCGC GCTGCAGATC CCAAAAGAAG CAGGCAGCAC CCAGTATGCC ACCCCCGACG ATCATCCCTG GCGTCTGGAT TTGCGCGGAC AGCTTCGGGA AGACGCCCTG GCGGCGCTGC GTCGTCATGT GGATGGCGCC GTAGCCGCCG GTCGTCGACA GGTCCAGATC CTGCATGGTA AGGGCAACGG GGTGCTCGCA GAAATGGTGC GCGAGTTTGC CGGGCAAGAC CCTCGGGTCA GCCAGTGGCG TATGGCGCGG CCAGAGCATG GCGGTGGCGG CGTCAGCGAG TTGGAGTTAC GCTGA
|
Protein sequence | MAVAGVAPWQ DSFWRALDAS DVQSAWSQHC GHIYGRRHVA ATTPWVPLAE IHDRLNWLAW LSQRFSAGFL LPVSDLPDID ALLVLAQRPG AVLSGRDLRA VLDVLTAQKG YAAALRETRD ALTGLAGELD PPVTLLRRLG VALDEEGELL DTASADLALL RQRLRISRNE LQRFLQGFLR NRDWQEYWQD QVIVQRNERY VLPLKASHKG RIKAIVHDRS ASGETLFVEP LAAVDLNNQL VQDRRAEIQE QERILRALSA AVGLEVPGIV AALRHIGRLD AVRAGLELGD ACGGILPKVD AAAAFDLRAL RHPLLCLRHP GQVVGNALRL GADAQQLVIT GPNTGGKTAI LKALGLNHLM AYMGLPVTAE GTLGYFPKCF AVIGDAQDIH TDLSTFSAQV QRLREVLEHA DAHSLVLLDE LGNGTDPREG GALAQAVAEA LLAAECCTLL TSHLEVMKRY ALSHAGVALA GMGFDAESLK PTYRLLWGVG GASQGLVIAR RVGMPAPLMN RAEALYADDR ENWERWEAQR ETLLQAARQA MDEAVLARDE ATSVARSLER ELEAARQERD KAAAAARAEW EDILATARQQ VRQAIAALKS GRDTQAATAA LQRLEVPFRA EEQQVDSLPA VGTRGLFLPL RQVTQVLRAD PAQHRVQIQL RGKQLWVPAA QFAVDAALQI PKEAGSTQYA TPDDHPWRLD LRGQLREDAL AALRRHVDGA VAAGRRQVQI LHGKGNGVLA EMVREFAGQD PRVSQWRMAR PEHGGGGVSE LELR
|
| |