Gene Haur_4891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4891 
Symbol 
ID5736726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6226670 
End bp6229090 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content52% 
IMG OID641282057 
ProductMutS2 family protein 
Protein accessionYP_001547649 
Protein GI159901402 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATCAG AAGTTGTACT CCAAACCCTT GAGTTCGATA AAGTTCGTGA TCAGCTTGCA 
CGTCATGCTG CGTTTAGCGC AAGCCGTGAG TTGGTTGCGC AACTCCATCC ATCAACCGAT
GGGCAGTGGA TTCTGCAGGC GCAAATTCGT ACTAGTGCCG CCCGTGCTTT AATTGAATCG
TTTGCCGATG TCTCAATTGG CGGTGCTCGT GATGTTCGCC CTGCGGTGGA ACATGCTCGT
CGTGGTGGGA TTCTTGAGGC GAGTCGGGTA CAAGAAATTG CCGCTACCTT GGGGGCGATG
CGGCGGTTGC GTGGCCAAGT TTTGCGCAAT CATCCCGATT TTGTGCCATT ACACCCTTTA
GCCGAGCAAC TGCCAAATTT GGCCACGCTG GAACATGAAA TTGAACGCAC AATCGGCCCT
GATGGCGAGG TATTAGATAG TGCTTCGGCT GAATTGGGCC GCTTACGTAG CGCCATTCGG
GTGGCGTTTA ATCGGTTGCA AGAGCGTTTG CAAGCAATTA TCAATTCGTC GCAATATGCC
GATGTGCTGC AAGAGCCAAT TATCACGGTG CGCGATGGCC GCTATGTCGT GCCAGTCAAA
GCCCCACAAC GACGGGCCTT GCGCGGGATT GTCCACGATC AATCATCGTC GGGCGCAACC
CTGTATATCG AGCCATTGGC TACGGTTGAG TTAAATAACC AGTGGCGCGA GCTGCAATTG
GCCGAACGCG AAGAAATCCA GCGCATTTTG GCGGCGCTCT CGGGCAAAAT TGCCAACGAA
GGTATGCCAA TTATTGTTGG GGTTGAGGCT ACCGCTGAAT TAGATTTAGC CTTTGCCAAA
GCCAAATATA GCATTAGCTT GCGTGCTAGC CAACCTGCGA TCAATACGCC AGTTCCTGCC
GATGATTTGC ACCCCGAATC AACTTTGTCA TTGCTCAAAG CCCGCCATCC CTTGCTCAAC
CAAGATCTGG TTGTGCCAAC CGATGTCTGG CTGGGCGGCC CAACGCAGAT GATTATTATC
ACTGGCCCGA ATACTGGTGG TAAAACGGTG GCGCTCAAAA CCGTTGGTTT GATGGCATTA
ATGGCCCAGG CGGGTCTGCA TATTCCGGCC CATCAAGGCT CGCGCTTACC TATTTTTGGT
AAAATTTTTG CTGATATCGG CGATGAACAA AGTATCGAAC AAAGCCTCTC GACCTTCTCC
TCCCACATGA CCAACATTAT CCAAATCTTG GATCGGGTAA CCCCCGATTC GTTGGTGTTG
TTTGATGAAT TGGGCGCTGG TACTGATCCA GTCGAGGGTG CTGCTTTGGC GCGAGCAATT
ATCGAACGCT TGTTGAATGT GGGATGTTTG GCGATGGCAA CCAGCCACTA TGCTGAACTC
AAGGCTTTTG CCTACAGCAC TGATGGGGTT GAAAATGCCT CGGTTGAGTT TGATGTTGAA
ACCTTATCGC CAACCTATCG ACTTTCAATC GGCTTGCCAG GCCGCTCGAA TGCTTTGGCA
ATTGCTGAAC GCTTGGGGCT TAAACGCGAC TTAATCGAAC GTGCTCGGGC AACGATTAGC
CGCGATAACG TCCAAGTTGA AGATTTGCTG GCCGCGATTC ATCGCGAACG CACAACCGCC
GAAAGTGAGG CTGCCCGCGC CTTGGAATTG CGCGAAGATG CCGAATTGGT GCGCGATCGG
CTGAGCCGCG AATTGTATGA GTTTGAACAG GATCGCGAAC AGCAATTAGC CAGTTACCAA
CGCCAACTTG ATGATGAATT GCGTGAAGTA CGAGCTGAAT TGCGCCGCCT ACGTGATGAA
TTTCGCTCAG TTTCGGTTAG CCGCCAATGG ATGGAACAAG CCGAACAACG CCTCAGCCGA
GTTGCCGAAC GGGTTCCCCA AACTCCAACT CCCCCCAAAG CCAAAGTTCC AGTTGTACCC
AAAGTTGCGC TTGCCCCACT TCCTCGCACA ATTCAAGTTG GCGATCAGGT GTTTGTGAGC
AGCGTGAAGC TTTCGGGCGT GGTGCTCGAT TTGGATGAAG AAGCCAACGA GGCCGAGGTT
CAATTGGGTG GCTTCCGCTT GCGGGTTGAT TTACGCGAGT TGCGGCTGGA AAAAGCGGGC
ACTAGTCCAA CCCAAGCGGT ACAAAAATAT GTACCTGTTC AGCGCATGAT CAATACTCCT
CCACCGCCGA ATGTTTCGAT GCAGCTTGAT ATGCGTGGTT GGCGAGCCTC GGATGTGGAA
AGTCAGCTCG ATCATTATCT CAACGATGCG TACCTCGCCA ATCTTTCAGA AGTGCGTTTG
GTTCATGGCA AGGGTACAGG GGCGCTGCGC CAAGTTGTAC GAACATTGCT CAAACGCCAT
CCCTTGGTCG AATCGTACAA TAGCGGTAGC CAAGGTGATG GCGGCGATGG CGTAACAATC
GCCAAAATGG TTGCTCGTTG A
 
Protein sequence
MISEVVLQTL EFDKVRDQLA RHAAFSASRE LVAQLHPSTD GQWILQAQIR TSAARALIES 
FADVSIGGAR DVRPAVEHAR RGGILEASRV QEIAATLGAM RRLRGQVLRN HPDFVPLHPL
AEQLPNLATL EHEIERTIGP DGEVLDSASA ELGRLRSAIR VAFNRLQERL QAIINSSQYA
DVLQEPIITV RDGRYVVPVK APQRRALRGI VHDQSSSGAT LYIEPLATVE LNNQWRELQL
AEREEIQRIL AALSGKIANE GMPIIVGVEA TAELDLAFAK AKYSISLRAS QPAINTPVPA
DDLHPESTLS LLKARHPLLN QDLVVPTDVW LGGPTQMIII TGPNTGGKTV ALKTVGLMAL
MAQAGLHIPA HQGSRLPIFG KIFADIGDEQ SIEQSLSTFS SHMTNIIQIL DRVTPDSLVL
FDELGAGTDP VEGAALARAI IERLLNVGCL AMATSHYAEL KAFAYSTDGV ENASVEFDVE
TLSPTYRLSI GLPGRSNALA IAERLGLKRD LIERARATIS RDNVQVEDLL AAIHRERTTA
ESEAARALEL REDAELVRDR LSRELYEFEQ DREQQLASYQ RQLDDELREV RAELRRLRDE
FRSVSVSRQW MEQAEQRLSR VAERVPQTPT PPKAKVPVVP KVALAPLPRT IQVGDQVFVS
SVKLSGVVLD LDEEANEAEV QLGGFRLRVD LRELRLEKAG TSPTQAVQKY VPVQRMINTP
PPPNVSMQLD MRGWRASDVE SQLDHYLNDA YLANLSEVRL VHGKGTGALR QVVRTLLKRH
PLVESYNSGS QGDGGDGVTI AKMVAR