Gene Haur_0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0529 
Symbol 
ID5732446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp615246 
End bp617075 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content34% 
IMG OID641277656 
Producthypothetical protein 
Protein accessionYP_001543305 
Protein GI159897058 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.320462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA GCGGGTACGA ATTATCATTT AGTCTTAATA CATTGAATCA CTTAAGCGAA 
GGTCTTTATA GTAATATTCC TGCGGTTTTA TCTGAATTAG TTGCTAATGC TTGGGATGCC
GATGCAACTG AGGTATCTAT TAATATACGA CAAGATGAAA TAGTCATTCA AGATAATGGT
ATAGGAATGT CAATTGAGGA TGCTAATACT AAGTTCTTAC GAATTGGGCA TCAAAAGAGA
GAAGATTCAG CCAACACTAT AAGTGGACGG CATGTTATGG GCAGGAAAGG GATCGGTATA
CTGGCAATAT TTGGTATAGC CAATATTGCT GAGGTTTATT CTTGTAAGGA TGGAGTACCC
CATGGATTTA TAATTCGTAA AGGCGATATA GAAAGAGGGA TAAGTAGCGA TGTAACGTTG
TATAGGCCAT CACCGGTTCC TCAAGATGAT TTATCTATTG AATCAGGAAC AAAGATTATA
TTACGAGAAA TAAAATCTTC TATTGTAAAT GCGGAGAAAA CTCTAAGAAC TGATCTTGCC
CGGAGATTTA CAATTATTAA TAATAATTGT AATTTTTCTG TAATTATTAA TAATATACCA
ATTAGTGATA ATGATCGTGA TTATTTAAAT AAAGTACAAT TTCTTTGGTA TCTTGGTGAA
GAAAGTAGTA AATATGCTGA TTTTTTTACT AAGCTAAAGA AATCATTTGA AATTACGAAT
CTAGTAGATG GAATGTCAGG TATAACTGTT AAGGGTTGGA TTGGTACTGT ATATCGTCCA
AGTAATATTC CAACGAGCCA CCGTACCATC TCTATTTTTG CTCATGGTAA AATGATTCAA
GAAGATATTT TAATAGATAT AACTGATGCG GGTGTATATC GACAATATAT CATTGGTGAA
ATTGAAGCAG ATTTTATGGA TAGCGATGAT GAAGATGACA TTATTACCAG TAATAGACAA
AGAATAAAGC AAACTGATCC TAGATATCTC AAGCTACTAC AGTATGTCAA GGCAGATATT
ATGCGAGTTA TTGCATCAAT GTGGACTAAC TTGCGAAAGG AATACCCATC TAAGCCAAAA
AAAGAGGAAG TTAATGATAG TTCATCATCA AAAGATGCCA ATTCTTCTGA ACAAGAGAAT
ACAAATGCTA GTAGTGATTC ATCAAACACT ACCGATGCTA GTAGTGATTC ATCAAACACT
ACCGATGCTA GTAGTGAGAC GAATGATGGT GATGTGGAAG ATAATTCTTT TTTTGATGAT
GATATTCCTG AACCTAGCCC TCCACCTAAA CAAGAAATTA CTACTGCATT TAGAGAGATG
AAGAATCTTG TTAAGAATAG TAATATTCCC GATCAAATGA AAAATATTAT TTTATATGAT
ATTCAACAAG CAGCCTATGC TTATAAAGGA ACATCATTTA AAGCTTGTAT TGTAATGTTG
GGAGCTATTC TAGAAGGTGT TATGCTTGGA ACAATCCAGA GGACGGATGT ACTAGAATAC
TTGATTACTT TACAGACAGT ACCAAAGCCA TTAAGTGATT TAGGCCCTAG AAATCCTAAA
TTTGCTGATC GTACAGTGCT AGCCCAGTAT ATAGGGACTA CCTTTTCATT TCAGGACTGT
AAGGAAATAA TAGAGCTATG TGTACAAGGT ACTAATAAAC TAGGTGTCGA TATACTTCAA
ACGGTTAGAA ATTCTATACA TCCAGGTTCA GTATTAAAAG ATATGAAACA ACTAGCAAGG
TTCAATCATC AAAGCGCTGT TGGTTACATT GCCAAACTAC ATGAAATTAT TAATTTAGTG
ATTCTATGGA ATCCTCCATC TATTCCATAG
 
Protein sequence
MSDSGYELSF SLNTLNHLSE GLYSNIPAVL SELVANAWDA DATEVSINIR QDEIVIQDNG 
IGMSIEDANT KFLRIGHQKR EDSANTISGR HVMGRKGIGI LAIFGIANIA EVYSCKDGVP
HGFIIRKGDI ERGISSDVTL YRPSPVPQDD LSIESGTKII LREIKSSIVN AEKTLRTDLA
RRFTIINNNC NFSVIINNIP ISDNDRDYLN KVQFLWYLGE ESSKYADFFT KLKKSFEITN
LVDGMSGITV KGWIGTVYRP SNIPTSHRTI SIFAHGKMIQ EDILIDITDA GVYRQYIIGE
IEADFMDSDD EDDIITSNRQ RIKQTDPRYL KLLQYVKADI MRVIASMWTN LRKEYPSKPK
KEEVNDSSSS KDANSSEQEN TNASSDSSNT TDASSDSSNT TDASSETNDG DVEDNSFFDD
DIPEPSPPPK QEITTAFREM KNLVKNSNIP DQMKNIILYD IQQAAYAYKG TSFKACIVML
GAILEGVMLG TIQRTDVLEY LITLQTVPKP LSDLGPRNPK FADRTVLAQY IGTTFSFQDC
KEIIELCVQG TNKLGVDILQ TVRNSIHPGS VLKDMKQLAR FNHQSAVGYI AKLHEIINLV
ILWNPPSIP