Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4398 |
Symbol | |
ID | 5736248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5619026 |
End bp | 5622307 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641281560 |
Product | M protein-like MukB domain-containing protein |
Protein accession | YP_001547158 |
Protein GI | 159900911 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAGC TTAGTCGCAT TTTTCTGTAT CATTGGCATC GCTTCGATTG GCATGTGTTG GATGTGCAGG ATAGTTTGTA TTTGGCGGGC CACAACGGTT CAGGCAAATC GTCGATTCTT GATGCCTTGC AATTGGTGCT GGTGGCCGAT TTGGGTCGCG TGCGTTTCAA CAGCGCGGCC CAAGATCGCT CGCAACGCTC GTTGGATAGC TATGTGCGCG GCAAAATTGG CGAGCAACAT TGGCTGCGGC CTGGCGACAC GATTGGTTAT GTGGTGCTGG AGTGGACTGA AGATCTGAAG CATCAAGCCT TTGTTAGCGG GGTTTGCTTG GAAGCTCGTG CCGCGACCCA AAGCGTTGAA AAAACCTTTT TTATCTTTGA TGGCCACTTA AATCCTGATG TATTTATTGA GCAAGGCAAA CCACGCACCC GCCGCGAAAT CAAAGCGATG GCGCGTAATC GCTCAACTGC CGAAAGTTTC GATCAAACTG GCGAATATCA AGCGGCGTTG CTCAATCGGT TGGGCGGCTT GAATCAACGC TTTTTCGATT TATTTCTGCG TGCCTTGACC TTCCAGCCAA TTCGTAATAT TCGTGAGTTT GTTGAGCAAT GGCTGCTGGA GCCAAATTCG TTGGATGTTC ACACCTTGCA AAATGTGGTT GAACGCTTAC GCGATTTGGA GCGAAAAGCT AAAGATGTTG AAGAACAATT AAAAAGCCTC GGTGCTATTA TCAAAACCCA AGAAGAAGCC AAGCGCTTGC GTGGCCTGCA TGGGAAATTT GAGATTCTGG CGGCTCACTA TAGCGTGGCG GCAATTCAGC AACAAATTAA TCAACAGCAA CAACGATTAA TCGATACTCG CCGCGATTAC GAATTGAATC AGACTGATTT GGAACAACAA CTTGGCTTAC AGCAAAATGC CCAAGCTGCC TTAGTTGAAG CCCAAGTTCA ATTAAGCCAA TCTGATGTTG TGCGGCGTAA AAACCAACTT GCCGCTGAAA TTCAGCAATT GCAGCAACAC ATTCAAAAAA TTAACCAACG TTGGCGACAA CTACAAGCCA AATTAATCCA AATTGCGGCA TTATTGAATA AATTACAGCC AATTGCTGAG CAAGATTTGG TTAATTTATT GCAGCAGCCT GAGCACATTC CCCAATCACA GCAAATTGTG CCTTTATTAC AACAACAGAG CACGGCGGCT GGCTCTGCTT TGAAATTGCA ACTCCAGGCG ATGACCAGAA CCGACGATAA AATCTCGCGC TTATTTGAAC AAGAAAGCCG CCTAAAACAA GAAATTACCC AGCTTGAGCA ATCGAATCGC CAAAATCATT ATCCCCAAAA TGTGGAAAAT GTGCGTAAGT ACTTAAAATC AGCATTAAAT ATTGAACTCT TTTTGCTTTG TGAGCTGTTA GAAATTCCTG ATGAACATTG GCAAAATGCG GTTGAGGCAA TGCTTGGTCA ACGCCGTTTC AATATTATTG TCGAACCTAA ATATTATAGC CAAGCCCTTG ATTTGCTTGA TCAGTTACGC GAAAAAGAGC GGATCTATGA TGTTGGTTTG GTTGATTTAC AACAAGCTAG TGCAGAAGCT CGCCCAGCAC GACCATTATC ATTGGCGACC AAAGTCAAAG CCAAAACTGG CTTGATTGAT AAATATATTG CGGCAATTTT AGGTGATATT ATTACTTGTC AGCATGTGCA AGATTTGCGT CAACATCGCC GTGCGATTAC CGCTGAAGTG ATGGTTTATC AAGAATGGAC AGCGCGTTCG ATTGATCCAC GGAAATTTCA GCCTTGGTTT ATCGGTGAAC GCGCCCAACG CTCGCAAATT GAAAGCCGCC GCCAGCAATT AACCGAAATC CAAGCCGAAT TGGCGATATT ACGCCCAGAA CAACAACGCC AGCGCCAATA TTATGATCAA TTGGAAAAAT TACAACAACT TTTATTAGGC TTAGATTCGT ATTTCGAGGA AGAACTTGAT TCAAGCGAAT TACAAACGAC ACTTGCTGAA TTGCAACGCG AACACGATAG CATTGATACT AGCGGCGTGG CGGCACTTGA GGCCGAAGTT AAACGTTTAC ACATAATTCA TGAAGAATAT GCCTTGAATA TTCGCCGTTT AGAGCGGATT ATTGGCACAC TGAATAATCA AATTACTCAG ATTGAACAAC AACTTCAAGA AAACCAACAT CAATTGTTGA ATGCTCAAAG CCTGTTTGAG CATACGCAGC AACGCTATCC CGACCAAATT GACGATGCAC TCAACGATTT TCAGCGCGAG AACAATGAGC AAGCGCAATT TAGCCGTTTG CAAGAAACCG CCGAGCAACA AGGTCGTGGC TATAATACCC GTTTTAACAA TACCCAAGAA CAATTACGCG AACAAGTTAT ACTCTATAAT CGCGATTTTC GAGCGCGTGA ACTAGCCAAT ATCGAGCAAA CGAGCTTCCA AGAGAAACGC GCTGAGCTTG AGGCTACTGA TTTACCGCAT TTTGTACAGA AAATTGCTGA AGCCAAACAT GAAACTGAAA CCGAGCTACG CGAACATGTT CTGCATAAAT TGCGTGAAAA TATTGGTCGC GCCAAAGCAG AGCTTGATCG GATTAATGAT GCTTTGCAAG GCTTAGATTT CAATGGCCAG CGTTATCATT TTCGCTATGA AATTGCCGAT TCATTGCGTG ATTTTTATCA ATTGATTGAG CAATCAGCCA ATGTTACTAG CGAGATGATT CAGGATAGTG AATTTTATCA GCAGCACAAA GCCACGTTTG ATCGTTTCTT CCAATTGTTA ATTCAGCCTG GCCTAACCGA CCAAGAAAAG CTTGAGCAGG CTCAAATTAC CGATTATCGG CGCTACTTGA GCTATGATAT TGATGTGATT GAGCGTGATG GTACTCGTTC ACGCTTGAGC AAAATTATGG GTCAAACATC TGGCGGCGAA ACTCAAACTC CGTTTTATGT GACGATTGCG GCCTCGTTTG TGCAGCTTTA TAAAATTAAT GAACGTAGCA AACGCTCAAC CATTCGGATT GTGGCCTTCG ATGAAGCCTT TTCCAAAATG GATCAAGATC GGATTGGCTC GACGCTTGAT CTATTTCACC ATTTTGGTTT ACAAATTATC ACGGCCACGC CGATTGAGCG TTGCGAATAT TTAGTGCCCA AAATGTGTAC AACCTTGGTT TTGACTGGCT TGAAAGATCG CCAGCAGCGG GTTTTGGTCG AGCCATATCG TAATTATGCA GCTCGTCTAG AGGCGCTGAA TGCTGAATCT GAACAAAACT AG
|
Protein sequence | MIKLSRIFLY HWHRFDWHVL DVQDSLYLAG HNGSGKSSIL DALQLVLVAD LGRVRFNSAA QDRSQRSLDS YVRGKIGEQH WLRPGDTIGY VVLEWTEDLK HQAFVSGVCL EARAATQSVE KTFFIFDGHL NPDVFIEQGK PRTRREIKAM ARNRSTAESF DQTGEYQAAL LNRLGGLNQR FFDLFLRALT FQPIRNIREF VEQWLLEPNS LDVHTLQNVV ERLRDLERKA KDVEEQLKSL GAIIKTQEEA KRLRGLHGKF EILAAHYSVA AIQQQINQQQ QRLIDTRRDY ELNQTDLEQQ LGLQQNAQAA LVEAQVQLSQ SDVVRRKNQL AAEIQQLQQH IQKINQRWRQ LQAKLIQIAA LLNKLQPIAE QDLVNLLQQP EHIPQSQQIV PLLQQQSTAA GSALKLQLQA MTRTDDKISR LFEQESRLKQ EITQLEQSNR QNHYPQNVEN VRKYLKSALN IELFLLCELL EIPDEHWQNA VEAMLGQRRF NIIVEPKYYS QALDLLDQLR EKERIYDVGL VDLQQASAEA RPARPLSLAT KVKAKTGLID KYIAAILGDI ITCQHVQDLR QHRRAITAEV MVYQEWTARS IDPRKFQPWF IGERAQRSQI ESRRQQLTEI QAELAILRPE QQRQRQYYDQ LEKLQQLLLG LDSYFEEELD SSELQTTLAE LQREHDSIDT SGVAALEAEV KRLHIIHEEY ALNIRRLERI IGTLNNQITQ IEQQLQENQH QLLNAQSLFE HTQQRYPDQI DDALNDFQRE NNEQAQFSRL QETAEQQGRG YNTRFNNTQE QLREQVILYN RDFRARELAN IEQTSFQEKR AELEATDLPH FVQKIAEAKH ETETELREHV LHKLRENIGR AKAELDRIND ALQGLDFNGQ RYHFRYEIAD SLRDFYQLIE QSANVTSEMI QDSEFYQQHK ATFDRFFQLL IQPGLTDQEK LEQAQITDYR RYLSYDIDVI ERDGTRSRLS KIMGQTSGGE TQTPFYVTIA ASFVQLYKIN ERSKRSTIRI VAFDEAFSKM DQDRIGSTLD LFHHFGLQII TATPIERCEY LVPKMCTTLV LTGLKDRQQR VLVEPYRNYA ARLEALNAES EQN
|
| |