Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0529 |
Symbol | |
ID | 5732446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 615246 |
End bp | 617075 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641277656 |
Product | hypothetical protein |
Protein accession | YP_001543305 |
Protein GI | 159897058 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.320462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACA GCGGGTACGA ATTATCATTT AGTCTTAATA CATTGAATCA CTTAAGCGAA GGTCTTTATA GTAATATTCC TGCGGTTTTA TCTGAATTAG TTGCTAATGC TTGGGATGCC GATGCAACTG AGGTATCTAT TAATATACGA CAAGATGAAA TAGTCATTCA AGATAATGGT ATAGGAATGT CAATTGAGGA TGCTAATACT AAGTTCTTAC GAATTGGGCA TCAAAAGAGA GAAGATTCAG CCAACACTAT AAGTGGACGG CATGTTATGG GCAGGAAAGG GATCGGTATA CTGGCAATAT TTGGTATAGC CAATATTGCT GAGGTTTATT CTTGTAAGGA TGGAGTACCC CATGGATTTA TAATTCGTAA AGGCGATATA GAAAGAGGGA TAAGTAGCGA TGTAACGTTG TATAGGCCAT CACCGGTTCC TCAAGATGAT TTATCTATTG AATCAGGAAC AAAGATTATA TTACGAGAAA TAAAATCTTC TATTGTAAAT GCGGAGAAAA CTCTAAGAAC TGATCTTGCC CGGAGATTTA CAATTATTAA TAATAATTGT AATTTTTCTG TAATTATTAA TAATATACCA ATTAGTGATA ATGATCGTGA TTATTTAAAT AAAGTACAAT TTCTTTGGTA TCTTGGTGAA GAAAGTAGTA AATATGCTGA TTTTTTTACT AAGCTAAAGA AATCATTTGA AATTACGAAT CTAGTAGATG GAATGTCAGG TATAACTGTT AAGGGTTGGA TTGGTACTGT ATATCGTCCA AGTAATATTC CAACGAGCCA CCGTACCATC TCTATTTTTG CTCATGGTAA AATGATTCAA GAAGATATTT TAATAGATAT AACTGATGCG GGTGTATATC GACAATATAT CATTGGTGAA ATTGAAGCAG ATTTTATGGA TAGCGATGAT GAAGATGACA TTATTACCAG TAATAGACAA AGAATAAAGC AAACTGATCC TAGATATCTC AAGCTACTAC AGTATGTCAA GGCAGATATT ATGCGAGTTA TTGCATCAAT GTGGACTAAC TTGCGAAAGG AATACCCATC TAAGCCAAAA AAAGAGGAAG TTAATGATAG TTCATCATCA AAAGATGCCA ATTCTTCTGA ACAAGAGAAT ACAAATGCTA GTAGTGATTC ATCAAACACT ACCGATGCTA GTAGTGATTC ATCAAACACT ACCGATGCTA GTAGTGAGAC GAATGATGGT GATGTGGAAG ATAATTCTTT TTTTGATGAT GATATTCCTG AACCTAGCCC TCCACCTAAA CAAGAAATTA CTACTGCATT TAGAGAGATG AAGAATCTTG TTAAGAATAG TAATATTCCC GATCAAATGA AAAATATTAT TTTATATGAT ATTCAACAAG CAGCCTATGC TTATAAAGGA ACATCATTTA AAGCTTGTAT TGTAATGTTG GGAGCTATTC TAGAAGGTGT TATGCTTGGA ACAATCCAGA GGACGGATGT ACTAGAATAC TTGATTACTT TACAGACAGT ACCAAAGCCA TTAAGTGATT TAGGCCCTAG AAATCCTAAA TTTGCTGATC GTACAGTGCT AGCCCAGTAT ATAGGGACTA CCTTTTCATT TCAGGACTGT AAGGAAATAA TAGAGCTATG TGTACAAGGT ACTAATAAAC TAGGTGTCGA TATACTTCAA ACGGTTAGAA ATTCTATACA TCCAGGTTCA GTATTAAAAG ATATGAAACA ACTAGCAAGG TTCAATCATC AAAGCGCTGT TGGTTACATT GCCAAACTAC ATGAAATTAT TAATTTAGTG ATTCTATGGA ATCCTCCATC TATTCCATAG
|
Protein sequence | MSDSGYELSF SLNTLNHLSE GLYSNIPAVL SELVANAWDA DATEVSINIR QDEIVIQDNG IGMSIEDANT KFLRIGHQKR EDSANTISGR HVMGRKGIGI LAIFGIANIA EVYSCKDGVP HGFIIRKGDI ERGISSDVTL YRPSPVPQDD LSIESGTKII LREIKSSIVN AEKTLRTDLA RRFTIINNNC NFSVIINNIP ISDNDRDYLN KVQFLWYLGE ESSKYADFFT KLKKSFEITN LVDGMSGITV KGWIGTVYRP SNIPTSHRTI SIFAHGKMIQ EDILIDITDA GVYRQYIIGE IEADFMDSDD EDDIITSNRQ RIKQTDPRYL KLLQYVKADI MRVIASMWTN LRKEYPSKPK KEEVNDSSSS KDANSSEQEN TNASSDSSNT TDASSDSSNT TDASSETNDG DVEDNSFFDD DIPEPSPPPK QEITTAFREM KNLVKNSNIP DQMKNIILYD IQQAAYAYKG TSFKACIVML GAILEGVMLG TIQRTDVLEY LITLQTVPKP LSDLGPRNPK FADRTVLAQY IGTTFSFQDC KEIIELCVQG TNKLGVDILQ TVRNSIHPGS VLKDMKQLAR FNHQSAVGYI AKLHEIINLV ILWNPPSIP
|
| |