Gene Haur_3182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3182 
Symbol 
ID5735057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4025436 
End bp4027292 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content53% 
IMG OID641280328 
ProductDNA mismatch repair protein MutS domain-containing protein 
Protein accessionYP_001545947 
Protein GI159899700 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTCC CAAATCAAGC CACCCCAATC GAGACCTATC GCCAACGCAG TGCTAGCTAT 
CAACTTGAAT TAGAGCAGCT TCGCCAACGC TCAGATCGCT TTGCCAATCT CCGTTTGGTG
TTGTTTGGCG CAGCGCTGTT TGCGGTGATT TTTGGCTTTG CCAATGCACC ATGGTGGTTT
GCAGTCGCAG TTGGCTTGCT AATTGGCTTT GTTTGGGTCT TGATTCAGCA TAATACGGTT
GAAGAACAGC GCCAACGCAC TCAAGCGCTG TATCAATTAA ATCACTATGG CGTGCAACGC
TTGGAGCGCG ATTGGGTCAA TTTGCCCTTG CGTGTGCCAA GCCTCGACAT TCCTGCCGAT
TCGTATGCTA ACGATTTGGA TTTGGTGGGC CGCGCCTCGT TGTTGCACTT ACTTGGCCAT
TTACCAACCG CCTTGGGCTT GGATACGCTG CTGAGTTGGC TTTTGGCTCC AGCAAATCCC
AGCACAATTA CATCGCGTCA ACAGGCGATC GCCGAACTAG CCCAAAACGT GCAATTGCGC
GAAGAGTTGC ATGTGGCAGC CCATCATGTT ACCGCTGAAC GTGCTGATTT CGAGCAATTT
CTGGTTTGGA CAGAACAGCC AGCGCAGGTA CTCAACCGGA TGTGGGTGGT TTGGTTAGCG
CGGTTGTTGC CAATCGGGCC AATCGCGGGC TTAGTGGCCT ATTTGGCTGG CTGGACGAAT
GTGCCTTATT GGTTGGTGTT GTTTATTCTC AATATCATCG TCAGCAATGG CTTAGGCTTA
TGGGCCAATC AGGTGATTAT GGCGGTCAGT GCACGGCGCA AAGGCTTTGC TGGCTTTGGC
GCATTGTTTC GGGCGATCAA CGAGCCGCAA TTTCAAGCCG CTTTGTTGCG CCAAGCCCAA
AGCCAATTAA CTGCCAATGG GGTTCGCGCC GATCATCAGA TGGCGCGTTT GAGTCGCTTG
TTGAGCCTTG CTGATCAGCG ATTTAGCTAT TTTTATATTG TGCTCCAAGG CTTTAGTTTG
TGGAATATTC ACATTGGATG GTTGCTTGAG CGTTGGCAGC GCGATGTGCG TGGTCAAGCC
CGCGCTTGGC TCACAACCTT GGGCGAAATT GAAGCGCTTG CCGCCCTCGC CACCTTACAA
GCCGACCACC CAAATTGGAC TCAAGCCCAA CTGCATACCA CTGATCAGGT GATTGCTAGC
GGTTTGGCCC ATCCCTTGTT GCGGCCAGAT AAAGCGGTGG CCAACGATTT GAGCATCGGG
CCAGCAGGCA CACTCTTTTT GATTACTGGC TCGAATATGG CAGGCAAAAG CACCTTGATG
CGAGCCATCG GCCTCAATAT TGTGCTCGCC CAAACTGGCA GTGTGGTGTG TGCCAGCAAT
TTGCAATTGC CAGTGCTGCG GCTTGCCACC AGCATGCGCA TCAACGATTC GCTAGAACAA
GGCGTTTCGT ACTACATGGC CGAATTGTTG CGGCTCAAAA TGGTGTTGGG CGAGGTTGAG
CAAGCGCGGG CAGCAGGCCA ACCAGCCTTA TTCTTGTTGG ATGAGATTTT GCATGGCACC
AACACGCGTG AACGTACCGT TGCTGCACGC CATATCATTG CCCGCTTAAT CGGTTTAGGT
GCGATCGGCG CAGTTTCAAC TCACGATTTG TCGCTGGCAA CCGCGCCCGA TATTGCCGCC
ATCAGCCAAC CATGGTATTT GACCGAACAC TTCGAGCGCG GCGATAATGG CCCAAGCATG
ACCTTCGATT ATCAATTGCG AGCTGGTTTA GCGCCGAGTA CTAACGCCCT CAAGCTGATG
GAAATCGTTG GTCTAGTCGA TCAGGATGTG CCGTTGAGCG TTGCCGCCAA TAGTTAG
 
Protein sequence
MSLPNQATPI ETYRQRSASY QLELEQLRQR SDRFANLRLV LFGAALFAVI FGFANAPWWF 
AVAVGLLIGF VWVLIQHNTV EEQRQRTQAL YQLNHYGVQR LERDWVNLPL RVPSLDIPAD
SYANDLDLVG RASLLHLLGH LPTALGLDTL LSWLLAPANP STITSRQQAI AELAQNVQLR
EELHVAAHHV TAERADFEQF LVWTEQPAQV LNRMWVVWLA RLLPIGPIAG LVAYLAGWTN
VPYWLVLFIL NIIVSNGLGL WANQVIMAVS ARRKGFAGFG ALFRAINEPQ FQAALLRQAQ
SQLTANGVRA DHQMARLSRL LSLADQRFSY FYIVLQGFSL WNIHIGWLLE RWQRDVRGQA
RAWLTTLGEI EALAALATLQ ADHPNWTQAQ LHTTDQVIAS GLAHPLLRPD KAVANDLSIG
PAGTLFLITG SNMAGKSTLM RAIGLNIVLA QTGSVVCASN LQLPVLRLAT SMRINDSLEQ
GVSYYMAELL RLKMVLGEVE QARAAGQPAL FLLDEILHGT NTRERTVAAR HIIARLIGLG
AIGAVSTHDL SLATAPDIAA ISQPWYLTEH FERGDNGPSM TFDYQLRAGL APSTNALKLM
EIVGLVDQDV PLSVAANS