Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2132 |
Symbol | |
ID | 5734020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2676715 |
End bp | 2681007 |
Gene Length | 4293 bp |
Protein Length | 1430 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279273 |
Product | hypothetical protein |
Protein accession | YP_001544900 |
Protein GI | 159898653 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.285692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAATC AGATTTTGCA GGAATCCGCT AACCAGCGGG AAAGTGCTCG TTTGTTGCAG TTGCGGCTCA CGGGCTTCGT TGGGCGTGAA GCCGAACAGG CCGCGATTCG CGGATTAATC GACCAAACCC GCCCAAGCGG TGGCTACGTG TTGGTGACGG GCGAGGCTGG AGCGGGTAAA AGTAGCCTGA TCGCCCAATT AATCGTGAGC GCTGGGCTGG CCCAAACCCC GCAGCATTTT ATTGCGCTGA CCCCAGGCCG TGCCTATCAA CTCGACCTGC TGCGTAGCAT CGTTGCCCAG CTCATGCTCA AACACGATCT GGTAATCAAC TATTTTCCTG CCGATAGCTA CCCCGCTTTA CGCCTTGAAT TTGGCCAGTT GTTACAAACG CTCTCAGCGC GTGGCATCAG CGAAACGATC TATCTCGATG GCTTGGATCA ATTGCAGCCT GAGGTGGATG GAACCCGCGA TCTCAGTTTT TTGCCCTTAC AGCTGCCGTC TGGCATCGTG ATGGTGCTTG GCTCACGACC TAACGAGACG ATCGCTAGTT TAGCGCTTGA ACACGGGGTC GTTTATCAGG TTCCACCGTT GCGCGAGCAG GATGCGGTTG GGCGTTGGCA GCAGGTCCAG CCGACGTTGG AGCCAGTGAG GTTGCATGGT TTAGCGCAAG CCGTCAAGGG CAATGCCTTG TTGGTCGAAC TAGCGGCCAA TGTACTGCGC CACACCACAT CAGCGGAATT GCTGCCATTG CTCGACCACG CCAGCGCTGA CGCGACCAAT CTCTTCCGGC TGAGCCTTGG ACGGATCGAA CAAGCAGCAC CGCACCACTG GCAATCGCTG ATTCGCCCAT TGTTGGCAGT GTTATTGGTG ACCCAAGAAT CGCTTGAACT AGCGGTGCTG GCGGCGATTC TCGAACAACC AACTGCTACA GTGGGCGAGG CGCTGGCCCT GATGAGCGAT TGGGTGAGTG TTGCCGCCGA TCAGCGCGTG GCTTTGCGCC ATTTGTTGTT TCACGATTTT CTGATCGACC ACGAATTTGC GGCAGCGGAA CGCCGTGGGT GGCATGGACG GATGGCTGCG TGGTGTGGCG CAGCGCTTGA CCAGATTTGG CACGATAGTA CAGAATCCGT TGAACAGGCA CGGCGCTGGT ATGCGCGACA GCATTACATC ACCCATTTGG ATTGCGCAGA ACAATGGGAA GCATTGTGGC AAGTGATCGA TGCGGGCGAG TATGGCGAGC AGAAAGTGCG CTTTGAGCCA AGCACCCGCT TGTATGGCTT GGATTTGGAT CGTGCTCGCG AGAGCGTGAT TGCTGCTGGC CAAAGCCTTG AGCAGCAGCT TGAACTCTTG CCGCGCTTGT GGCGCTATAG TCTACTACGC ACCAGCCTCA CGGCCCATGC CGATCGTTGG CGTGACTATC ATTTTGTCAT GTTGGCGATG CTTGGGCGGG TATCTGAGGC GCTAGCCCAG CTCGATATTT GTTCGAATCA AGTATCCCAA GTGCGAATCT GGTCGCAACT ATTGCCCTAT CTAGAATCTG ATGTACGCTG GCGAATCTTC CAGCGCATGG AGCAAACAGC CCGTAGCATC CCCGATCCGC GTCGTCGCGA TTATGTGTTG CATCTCGTTG CTGTGGCCTA TGCCGACTAT GACTTGCTGA AAATGGCCTA TCCAATTGCG ATTAGCCTTG GTGATAGTCG TGATGAAACG CTGGCACATT TGATTGACGT GGCGATTAAA CAGCATGATT TAGCCTATGC TCAAGCCATA ATCGGGTATG TTCAAGCACC GGCAGCACAG ATTAAGCATG CCTTGAATGT AACCAATGCC TTGATTGAAG GCGCAGAATT TGAGGCAGCC CGACACGTAT TGGCCGACAT TATGCCCTTG GCCCAAGCGG AGCATATTGT TGAAATTAAT AGTTTGATTG CGTTGATTGA ATGGCGATTG GGTAACCAGC AACAATCCCA AACATTGCTG GATGAAGCCC AAACCATGAG CGCACACTTA AACCCCGATC TGCGATCTGA TGCGTGGTTG GCGGTGATTA AGAGTTATGT ACATCAAGGC GATTTGCTCA AGGCTGCAAG TTTGCATCGA AAAATTCAAT CAGTACAAGT CTGGTGTGAT CTGATCATGT TTTATCGTGA TCGCGCTGAA GTAACAATAG CGACTGAGTT GGCTATTGCT ATGACCAATG CCCACTTTCG TGATTCGGCC TGTTTTGCGC TGGTTGAATG GTATTGTACC CATGCTGAAT TTGCGGCTGC CCAACGCCTG ATTGAATTAA TCGGTTTCGA TTGGGACAAA GTTAAAGCGT ATTGTTTGCT GGCAAGTAGT TATGCTGAAA ATTTGCAATT TGAACAAATG CTGGCAGTAA TGCAGCTGGC ACACCATACC GTCCCGCGCG AGCAGCAACA TTCGATTCCA AGTTTGTTAG TCATTGCCGA TACCTATGCC CGCCACAATC TGCATGAGCA AGCCCGTAGT GTGTTTGAAC AAATATTAGC ACTCTTTTTG AGCGGCCAAA GGTATAATCG GGATGAACAT AACCTACACT TTGTGCAAAG TACCCAGCGC TATGGCTATC TCGATCTTTC CGAGCCATTG ATTCAACGCT TATTTACGCT AGGAGATTAT GGGTTTAATA ATGATCTCGT TGAGCGGATT GCGAAAGTCT ATGCTGAACA AGGCGAACTT GCCCAAGCAA CCCAGATTAT TCAATCAATT GGTCAAGATT ATCAGTTTAT CCAAGCGGCT CAAGGGCTGG TGTTGAGCAC AACCAACCAG CAATCTGCTG CTAGCGAATC GTTATTGCTT GCGGCGCGGC AACGGGTTGC CCAAATTAAC TCCGATGGGC CGTCTGCCCT AAAAACCCTT TGCGAGCTAG CTGATACAGC GTTGCAGCTT GGATTAACCG CCATCGCCCA AACGTTGCTG AGCGACGTTC ATCAATCTCT ACTACGCAAG CCACATCTCC TACATCATCC CTTGCTCCAA TATGAAGCGT GGCTGCTCAA AAGCTATCAA GCCCAACATA AACTAGCCGA TCTGATAGAG CTTGCACGGT TAATCGACGA TCCGCAAGCC CATGATCGGT GGATTGGCGC AATTCTTGAA GCCTATCTCC AGGCTGATGA TGTTGGACAA GCCTATCAGC TGCTTCGGTT ATTTAATGAT TTTGCGGATG TCTATGCTAA AAGTGCTTGT AAACTTGCGA TCAAAGCCAG CCAACTAGGG TTAAATGAGC TTGCAACCCA AGTGCATCCT GAAGCACTCT CAGCCTGTGA GACCGTTCGA GAGTCGCGTT ATCGCATTGA GTATCTCAGA GATCTTGCGG TTGCCCAAAT TAACTATGGT GATGCAGGCT GTTTAGCCCG CTTATTGCAG ATTTTTCGCG AGCAGGAAGC AGTATTCGGC CAGATCGATT GGTATATTGA AGCGCTTTGC GCAATTGCGG TGGCCTTTGC CGAACAAGGC GATGCTGTAG CGTTTGCTGA TTGGCTAAAC TATGCACATA AGCGGGCTAC GGCGTTTCCC ACTGGCTACC AAGCGCTTGC TGAAACGTAT TTTGGCTATA CTCCAGATTC GGCTATCAAG GCATTTTTGG ATTCTATTGA GCAGCTGGTG CAAATAAGCC TAGATCAGGG CTATGCAGAT ACTGCTCTTG AGGCGTTGGC TAAAATATAT ACAAGCTATG CAGCCCATGG CCATGCCGAG TTTTTGGTCA AAGCCCATCA AACCGCAATT AGTATTCCTA ATCGTGACTA TCAATATGCT GCACTTGTAA CCGTTGCCCA AGCGTATCTG AAGATCAAAG TTATGCCAGA GCTACAAGTG ATTATTAGTG AACTGAGCCA ACTTGGCTTT GATTTTTGGA TATTTCAGGA TCTTACTGCA AGTTGCATTG AAGAAGGCGA GCTTCATTTG GCATACCAGT TGATTCTGTT CGATGAGCGT CATCCAGTCA AGGATGAGGT CATTTGTAAC TTGATTGCCC GCTTAATCCA AGCCGATCAG CCGATAATTG CTTATCAGTT AACAAGCGAG ATCTACGAGG CTGATAATCG GGCTGGCAGT TTGCAACAAA TCATTCATTA TTATCTTGAG CGTCAGCAAA TCACTGATGT TATCAAGATT ATTCAAACCA CATGGCGCGA CTGCCAAAGG AGTTACGAAT TATGGCAATT AAGTACAATC ATTGTGCCGT TGATTCCCCA CTACCCATGG CTTGGCACTG CCGTGCTCGA TAGCGTGCCC TGGCTTGAAC AGCAATTAAC TCGCTTGAAT TAA
|
Protein sequence | MHNQILQESA NQRESARLLQ LRLTGFVGRE AEQAAIRGLI DQTRPSGGYV LVTGEAGAGK SSLIAQLIVS AGLAQTPQHF IALTPGRAYQ LDLLRSIVAQ LMLKHDLVIN YFPADSYPAL RLEFGQLLQT LSARGISETI YLDGLDQLQP EVDGTRDLSF LPLQLPSGIV MVLGSRPNET IASLALEHGV VYQVPPLREQ DAVGRWQQVQ PTLEPVRLHG LAQAVKGNAL LVELAANVLR HTTSAELLPL LDHASADATN LFRLSLGRIE QAAPHHWQSL IRPLLAVLLV TQESLELAVL AAILEQPTAT VGEALALMSD WVSVAADQRV ALRHLLFHDF LIDHEFAAAE RRGWHGRMAA WCGAALDQIW HDSTESVEQA RRWYARQHYI THLDCAEQWE ALWQVIDAGE YGEQKVRFEP STRLYGLDLD RARESVIAAG QSLEQQLELL PRLWRYSLLR TSLTAHADRW RDYHFVMLAM LGRVSEALAQ LDICSNQVSQ VRIWSQLLPY LESDVRWRIF QRMEQTARSI PDPRRRDYVL HLVAVAYADY DLLKMAYPIA ISLGDSRDET LAHLIDVAIK QHDLAYAQAI IGYVQAPAAQ IKHALNVTNA LIEGAEFEAA RHVLADIMPL AQAEHIVEIN SLIALIEWRL GNQQQSQTLL DEAQTMSAHL NPDLRSDAWL AVIKSYVHQG DLLKAASLHR KIQSVQVWCD LIMFYRDRAE VTIATELAIA MTNAHFRDSA CFALVEWYCT HAEFAAAQRL IELIGFDWDK VKAYCLLASS YAENLQFEQM LAVMQLAHHT VPREQQHSIP SLLVIADTYA RHNLHEQARS VFEQILALFL SGQRYNRDEH NLHFVQSTQR YGYLDLSEPL IQRLFTLGDY GFNNDLVERI AKVYAEQGEL AQATQIIQSI GQDYQFIQAA QGLVLSTTNQ QSAASESLLL AARQRVAQIN SDGPSALKTL CELADTALQL GLTAIAQTLL SDVHQSLLRK PHLLHHPLLQ YEAWLLKSYQ AQHKLADLIE LARLIDDPQA HDRWIGAILE AYLQADDVGQ AYQLLRLFND FADVYAKSAC KLAIKASQLG LNELATQVHP EALSACETVR ESRYRIEYLR DLAVAQINYG DAGCLARLLQ IFREQEAVFG QIDWYIEALC AIAVAFAEQG DAVAFADWLN YAHKRATAFP TGYQALAETY FGYTPDSAIK AFLDSIEQLV QISLDQGYAD TALEALAKIY TSYAAHGHAE FLVKAHQTAI SIPNRDYQYA ALVTVAQAYL KIKVMPELQV IISELSQLGF DFWIFQDLTA SCIEEGELHL AYQLILFDER HPVKDEVICN LIARLIQADQ PIIAYQLTSE IYEADNRAGS LQQIIHYYLE RQQITDVIKI IQTTWRDCQR SYELWQLSTI IVPLIPHYPW LGTAVLDSVP WLEQQLTRLN
|
| |