Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2999 |
Symbol | |
ID | 5734871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3787657 |
End bp | 3789552 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280143 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_001545765 |
Protein GI | 159899518 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATTC GTGTTTTAGA CCCAACTTTA GCTGCCCAAA TCGCCGCTGG CGAGGTGGTT GAACGCCCAG CCTCGGTCGT CAAAGAATTA ATCGAAAATT CGGTCGATGC TGGCGCTACC GAGATTCGGG TCGAGGCACG CGAGGGCGGC AAACGCGAGT TGCGCATCCA AGATAATGGC TGTGGCATTG CCAGCGATGA GGTTGAAACG GCATTTTTGC GCCATGCCAC CAGTAAAGTA ACCGAAATTG AAGACCTATT TTCGATTCGC ACACTGGGTT TTCGTGGCGA AGCCTTGCCT TCCATTGCCT CAGTTGCTCA AGTAACATGC CTTACCCGCA CTGCCGCCGA TGAAGTTGGT ACTGAATTGC GGATCGCTGG TGGCGAAATT CAGGCTAAAA CGCCGCGTGG CTGCTCGGTC GGCACAACCT TCACGATTCG CAATTTGTTT TATAACACCC CAGCACGGCT CAAATTTATG CGCTCCGATG CCACCGAAAT GAGCCAAATT AGCACAATCG TGACCCAATA TGCCTTGGCC TACCCCAACA TTCGCTGGAC CTTGCTGCTC GATGGCAAGC TGGCCTTACA AACCCCAGGC AATGGGCGTT TGCTCGATGC CTTGATTGAA TTGTATGGGA TTGATGTTGG CCGCGAGATG ATCAGCGTTG ATCGGACCTC GGAAGCCGAA GATGAAACTG TGCGCGTTCA TGGCTTTGTC AGCCAACCTT CGACCTTTCG CGCTGCCCGT TCGTATATGC ACTTATTCGT CAATCAGCGT TGGATCAAGC CGCAAGGCAA TTTGGTCTAT ATGATCGAAG AGGCCTACCA TACCTTATTG ATGAAGGGTC GGCATCCGAT TGTGGCCTTG AATATTGAGC TTGAGCCAGA AGCGGTTGAT GTGAATGTGC ACCCAACCAA GAGCGAGGTC AAATTTCGCA ATCAATCGCA TGTCTATGGC GCATTGACCA AAGCAGTACG CGAAGCCTTG GCTGCTCAAA GCACTATTCG GGCTTGGACA GGCTTTGGAG CCAACGAAAG TGTCAATCGG CGGGTCGAAT TACGCTCGCC CAATGGCGAA CGACGTGGCT CAAGCAATGA TGCACCCTTG TTTGATGATG CACCAGCAGC GCCACGCCCT CAGGTCAATA ATTATCCTGA TGACGATTTT GATTCGACCG TGAATTTGCC GCCAATTGCT AAACAAGCGC GGTTTGAAAC CCCAACCACC AGCCAAACCA GCAGTTTCTT GCCGCCGCAG CAACAAGCTT TTGATCCGGC GTATGCACCG AGCATGCCAG CTCCAGGCGA GGCCAAATTG CCGATGTTAC GGGTGGTTGG CCAAGTTAAC GAAACCTATA TTGTGGCCGA AAGCAGCGAT GGTATGTATT TGGTCGATCA GCATGCGGCC CACGAACGGG TGGTGTATGA GCGCTTGATG GCCGAACATC AGGATGTGCC AATTGAACGC CAAACCCTGA TGTTAGCCCA ACCGATTGAA CTACCACCAG CCGTCACCCG TTTGCTCAGC GCTCACTTGG CCGATTTGGA GCAATGGGGC TTTGAGGCCG AGGAATTTGG CGAAGGCACA TTAATGTTGC GAGCTGTGCC AAGTGGCTTG CACGTTGGCC AAATTGCCAC CGCCTTGATG GAAATCGCTG ATCATTTGAG TTATGAAGGC GGGGCCACTA GCGACGATCG GCGTGAAAAA ATGTTGACCA CGATCGCCTG CCATAGCTCA ATTCGCGCGG GCAAAACCCT GACCCACGAA GAGATGCGCC AGCTTTTGCA ACAACTTGAG CGCTGCGAAA TGCCGCGCAC CTGCCCGCAT GGCCGCCCAA CCATGCTCCA AATTACGCAA GGCCAAATCG AACGCCAATT TGGGCGCAAA GGCTAG
|
Protein sequence | MPIRVLDPTL AAQIAAGEVV ERPASVVKEL IENSVDAGAT EIRVEAREGG KRELRIQDNG CGIASDEVET AFLRHATSKV TEIEDLFSIR TLGFRGEALP SIASVAQVTC LTRTAADEVG TELRIAGGEI QAKTPRGCSV GTTFTIRNLF YNTPARLKFM RSDATEMSQI STIVTQYALA YPNIRWTLLL DGKLALQTPG NGRLLDALIE LYGIDVGREM ISVDRTSEAE DETVRVHGFV SQPSTFRAAR SYMHLFVNQR WIKPQGNLVY MIEEAYHTLL MKGRHPIVAL NIELEPEAVD VNVHPTKSEV KFRNQSHVYG ALTKAVREAL AAQSTIRAWT GFGANESVNR RVELRSPNGE RRGSSNDAPL FDDAPAAPRP QVNNYPDDDF DSTVNLPPIA KQARFETPTT SQTSSFLPPQ QQAFDPAYAP SMPAPGEAKL PMLRVVGQVN ETYIVAESSD GMYLVDQHAA HERVVYERLM AEHQDVPIER QTLMLAQPIE LPPAVTRLLS AHLADLEQWG FEAEEFGEGT LMLRAVPSGL HVGQIATALM EIADHLSYEG GATSDDRREK MLTTIACHSS IRAGKTLTHE EMRQLLQQLE RCEMPRTCPH GRPTMLQITQ GQIERQFGRK G
|
| |