Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3182 |
Symbol | |
ID | 5735057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4025436 |
End bp | 4027292 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280328 |
Product | DNA mismatch repair protein MutS domain-containing protein |
Protein accession | YP_001545947 |
Protein GI | 159899700 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACTCC CAAATCAAGC CACCCCAATC GAGACCTATC GCCAACGCAG TGCTAGCTAT CAACTTGAAT TAGAGCAGCT TCGCCAACGC TCAGATCGCT TTGCCAATCT CCGTTTGGTG TTGTTTGGCG CAGCGCTGTT TGCGGTGATT TTTGGCTTTG CCAATGCACC ATGGTGGTTT GCAGTCGCAG TTGGCTTGCT AATTGGCTTT GTTTGGGTCT TGATTCAGCA TAATACGGTT GAAGAACAGC GCCAACGCAC TCAAGCGCTG TATCAATTAA ATCACTATGG CGTGCAACGC TTGGAGCGCG ATTGGGTCAA TTTGCCCTTG CGTGTGCCAA GCCTCGACAT TCCTGCCGAT TCGTATGCTA ACGATTTGGA TTTGGTGGGC CGCGCCTCGT TGTTGCACTT ACTTGGCCAT TTACCAACCG CCTTGGGCTT GGATACGCTG CTGAGTTGGC TTTTGGCTCC AGCAAATCCC AGCACAATTA CATCGCGTCA ACAGGCGATC GCCGAACTAG CCCAAAACGT GCAATTGCGC GAAGAGTTGC ATGTGGCAGC CCATCATGTT ACCGCTGAAC GTGCTGATTT CGAGCAATTT CTGGTTTGGA CAGAACAGCC AGCGCAGGTA CTCAACCGGA TGTGGGTGGT TTGGTTAGCG CGGTTGTTGC CAATCGGGCC AATCGCGGGC TTAGTGGCCT ATTTGGCTGG CTGGACGAAT GTGCCTTATT GGTTGGTGTT GTTTATTCTC AATATCATCG TCAGCAATGG CTTAGGCTTA TGGGCCAATC AGGTGATTAT GGCGGTCAGT GCACGGCGCA AAGGCTTTGC TGGCTTTGGC GCATTGTTTC GGGCGATCAA CGAGCCGCAA TTTCAAGCCG CTTTGTTGCG CCAAGCCCAA AGCCAATTAA CTGCCAATGG GGTTCGCGCC GATCATCAGA TGGCGCGTTT GAGTCGCTTG TTGAGCCTTG CTGATCAGCG ATTTAGCTAT TTTTATATTG TGCTCCAAGG CTTTAGTTTG TGGAATATTC ACATTGGATG GTTGCTTGAG CGTTGGCAGC GCGATGTGCG TGGTCAAGCC CGCGCTTGGC TCACAACCTT GGGCGAAATT GAAGCGCTTG CCGCCCTCGC CACCTTACAA GCCGACCACC CAAATTGGAC TCAAGCCCAA CTGCATACCA CTGATCAGGT GATTGCTAGC GGTTTGGCCC ATCCCTTGTT GCGGCCAGAT AAAGCGGTGG CCAACGATTT GAGCATCGGG CCAGCAGGCA CACTCTTTTT GATTACTGGC TCGAATATGG CAGGCAAAAG CACCTTGATG CGAGCCATCG GCCTCAATAT TGTGCTCGCC CAAACTGGCA GTGTGGTGTG TGCCAGCAAT TTGCAATTGC CAGTGCTGCG GCTTGCCACC AGCATGCGCA TCAACGATTC GCTAGAACAA GGCGTTTCGT ACTACATGGC CGAATTGTTG CGGCTCAAAA TGGTGTTGGG CGAGGTTGAG CAAGCGCGGG CAGCAGGCCA ACCAGCCTTA TTCTTGTTGG ATGAGATTTT GCATGGCACC AACACGCGTG AACGTACCGT TGCTGCACGC CATATCATTG CCCGCTTAAT CGGTTTAGGT GCGATCGGCG CAGTTTCAAC TCACGATTTG TCGCTGGCAA CCGCGCCCGA TATTGCCGCC ATCAGCCAAC CATGGTATTT GACCGAACAC TTCGAGCGCG GCGATAATGG CCCAAGCATG ACCTTCGATT ATCAATTGCG AGCTGGTTTA GCGCCGAGTA CTAACGCCCT CAAGCTGATG GAAATCGTTG GTCTAGTCGA TCAGGATGTG CCGTTGAGCG TTGCCGCCAA TAGTTAG
|
Protein sequence | MSLPNQATPI ETYRQRSASY QLELEQLRQR SDRFANLRLV LFGAALFAVI FGFANAPWWF AVAVGLLIGF VWVLIQHNTV EEQRQRTQAL YQLNHYGVQR LERDWVNLPL RVPSLDIPAD SYANDLDLVG RASLLHLLGH LPTALGLDTL LSWLLAPANP STITSRQQAI AELAQNVQLR EELHVAAHHV TAERADFEQF LVWTEQPAQV LNRMWVVWLA RLLPIGPIAG LVAYLAGWTN VPYWLVLFIL NIIVSNGLGL WANQVIMAVS ARRKGFAGFG ALFRAINEPQ FQAALLRQAQ SQLTANGVRA DHQMARLSRL LSLADQRFSY FYIVLQGFSL WNIHIGWLLE RWQRDVRGQA RAWLTTLGEI EALAALATLQ ADHPNWTQAQ LHTTDQVIAS GLAHPLLRPD KAVANDLSIG PAGTLFLITG SNMAGKSTLM RAIGLNIVLA QTGSVVCASN LQLPVLRLAT SMRINDSLEQ GVSYYMAELL RLKMVLGEVE QARAAGQPAL FLLDEILHGT NTRERTVAAR HIIARLIGLG AIGAVSTHDL SLATAPDIAA ISQPWYLTEH FERGDNGPSM TFDYQLRAGL APSTNALKLM EIVGLVDQDV PLSVAANS
|
| |