Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0549 |
Symbol | |
ID | 5732283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 641054 |
End bp | 645097 |
Gene Length | 4044 bp |
Protein Length | 1347 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641277676 |
Product | TP901 family phage tail tape measure protein |
Protein accession | YP_001543325 |
Protein GI | 159897078 |
COG category | [S] Function unknown |
COG ID | [COG5412] Phage-related protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.321201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATG AAGAATTAGG GCGAGCAATC ATTAAAATCG GCGCGGATGA TGCCGAGCTT GATCGCTCGA TCAGCAACGC CGAAACCAAA GCCAAGGGCT TCGCGCAGTC GTTAGCCGAA AGTCTGAGTA CCGCACTGAC CACCGCCATT GCGGGCGCGG TGGCTGCCGC AGGCGCAGCC TTAGCAGGAC TGGCGGTCAC GGGGGTCAAC GCGTTCGCGG GCTTCCAACA GCAGATGAAT AGCGTGTTCA CGTTGCTGCC GGGTATTTCC CAACAGGCCA TGCAGCAGAT GGGTGAGCAG GTCAAAGACT TTGCGGTGGA ATTTGGAGCG TTGCCCGAGA AAGTGATCCC CGCGCTCTAT GAAGCCTTGT CCTCGGGCGT GCCCAAAGAC AATGTGTTTT CCTTCCTCGA AACCGCGCAG AAAGCGGCTA TCGGCGGCGT AACCGATACC CAAACCACGG TGGACGGCCT GACCAGTGTA GTCAATGCCT ACGGCGCGGA CGTGCTGAGC GTGCAAGATG CCAGCGACCA GATGTTTACC GCCGTGGCCT ATGGCAAAAC CACGTTTGCC GAACTGGCGA GCACGCTCTA CAACGTCAAC CCGATTGCCG CGAGCTTGGG CGTATCGTTC AGCGATGTCT CTGCCGCGAT TGCCGCTATG ACCTCGCAAG GTGTGCCTAC CGCCCAAACC ACCACGATGT TGCGCCAGTT GTTTGTGGAG TTATCGCAAT CAGGTGGCGA AGTCTCTGAG CTATTCCAGC AACTTTCTGG CACCTCGTTT AAATCCTTTA TTGAGAGCGG TGGCAACGTT CAGCAAGCGC TGCAACTGCT GGAGCAGCAC GCCAATGCCA GTGGCGTGGG CATCAATGAC TTATTTAGCT CTGTCGAAGC GGGTAGCGCG GCGCTGACCC TGACCGGACG CGGCACCGAG ATGTTCACGG GGGCGCTCGA TGCGATGCAA AACAGCGCCG GAGCCACCGA GGCCGCCTAT AACCAAATGG ATCAAGGCTT AGTGCGAGCC TTTGACGGGA TTCGTGCCCA ATTTGCCGTG TTGCAAACGA ATATTGGGAA TGCATTAGCA CCCACCGTCC AAGCCTTTGC CGATTGGTTG GCCGAAGCCA TGCCCCGCAT CAGCGCCGTG GTTGTGGGAG CGTTTGAGCG GATTGGTACA GCCGTCCAGA TGGTTGGCCC GTATTTCCTT GACTTCGCCA GCACGGCGAG CGAAGCCTTT GGTCTGTTCA TTGACCTTGC CAGCAGCGCG GTGCAATGGG GTAGCAACAT TGCCACCCAG CTAGCCAACG GGATTATGGG TGGTGCTGGT GCGGTCGTCG ATAGCCTTGG CTACATTGGC GACCTCGTGA CCTATTGGCT TGAGCCGCAT TCTCCACCCA AGCTGCTTCC CAATATCGAC ACCTGGGGCC GCGATACGGC GCAAGTCTGG ATGGATGGTT GGAATGATGC GCGGCTGCCA ACCGATGGGC TACTGGCCTA TCTCGAAACC GAACTGCAAT CGATTGAAGA TGCCCAGCAG CGGGTCAAGG AAGCAGCCCA AGAAAAGCAG CTCTTGGCGC TGATTAATAG TACGGGTGGC AAGGATAGTG ACCGCGAGCA GGCCAAACTG GAGCTAGCCG CCCTGAAGCT GCGCCAGAAA ATCCGCGAGG AACAAGCCAA AGCCGCCGAG CAGGAAGTCA AAAAGCCCAA AGCCACGGGC GGCGGGGGTG GCGGCGGCAA ATCGCCCATG GACGATGCCG CCAAGAAAGC TGAAGCGGCG GCCCGTGCGC AGTGGGAATA CAGCTTCAGT ATTGCCGGAA CTGCCGATCG CTTACAGATG CTCAAGGACA AGCAGGCCGC CTATGCCACA ACCGATGCCG AGTATTGGCG ACTGCAAGGG CAGATTAATC AGGAAGAGCA GAAACGTCAG GCGGAACTCA ATAAACTGGC CAAAGAGCAA CGCGACTATG AGTTAAGCCT CATGTCCACC GAGGATCAGT TAGCTCGCTT GCGCCAAGAG CAGGGCCAAT ACGCCGAGGG CAGCGCCGAA TACAACGATA TTCAACAGGA GATCAATAAG GCCGAGAAAC AGCGCCAGCG CGAGCTGGAG GAGGTCAAGC GGAAACAGGA CGAAGCCGCC AAAGCCGAAC GCGATTATCA GTATGCCACA GCGGATACTG CGGGCAAGCT GGCAATCCTC CAAGGTGAAT TGGCCAATAC CAACGCTGAC CAATCCGAAT ATTGGCGCAT CAAAACTCAA ATCTCGCAGC TCGAAGCCCA GCAACAAAAA GAGTTAGAGG CTTCAGCCGA GAAAATGAAA GGTGTGGGCG GCGCGGCCAA GGGTGCGGCC AAAGGTGTCG GTGCGCTCGT TCCACCATTC ACCAAGGTTA AGGATGGCGC GGACGAAACC AATCAATCCA TGCAGGATGC GGCAACCGGA GCCGAGGTAG CCGCCGATCG CTATGCTGAT TTGAAGGATC GCCAAGTCGA AATGGCCAAT GCCACCGCGC CTGTGCCATC CTTGATCGAC CGGATTCGCA ACGCTTTCAG CCAAGCTGCA CCGTTTATTG AATCGGTCAA AGGCGCATTC ATGGGCATTG CCACCCTCTT CACGGGTATG GGCCTGGTGT CACTGTTCAC GGGCTTGGCG GGAGCGGTCG CCGCACTCGT GTCACCGATG GGCTTACTGG TAGCCGGAGC CGCCGCGCTG GGATTGGCGT GGCAAACCAA TTTCGCCAAT ATCCGCACCA TCACCGGAGA GGTTTTCAAT GCGGTACAGA CCACCGTACA AACCGCCTTT GGGCTAATTA CCAGTTGGTT CCGCGAGAAC GGCGAGCAGA TCGTGACCTT CCTGCGTGAT GCGTGGGGCC GGATTGAGGG CATTGTGAGC ACGGTGCTGG GCGCGGTTGG CACGGTGATT CAAACCGCAC TCGACGCAAT CCGAAGCTTC CTTGAGCAGC ACGGCGAAGA AATCAAAGGT GTGCTGACGA TGGCGTGGGA AACTATTCAG GTCGTCATTG ATGGCGCACT GGATGTGATT GATGGTGCGA TTATTCCATT CTTCCACGGC ATCGCCACCT TCCTCAGCGA CCACAAAGAC CAGATCACGG GCCTGCTCAG TGGGGCATGG AGCATCATCA AGGGCATCGT CGAAGCCGCC ATGAGCGTGA TTCAAGGCGT GATCAAAACC GTCACTGCCG TGATTCACGG GGATTGGTCA GGCGCATGGA CGGCAATCAA GGATGTCTTT GCCGGACTCT GGAATGGCAT CATCGACATC GTGAAGGGGG CGCTGGAGGT TGTCTGGAAC CTCTTGGTGG TCGCGTGGGA TCTCATTGGC ACGGGCATTC GCACCGCGTG GGATGGCATC AGCGACTACT TCCGCACGAT GTTTGGCGGT GTGCTCGACA ACCTTGACGG CTTTTTGGAG CAATTCAAAA ACGGGGTTGC GGTGGCGTGG CAATGGATTA AGGATGCCAG TGCTGCGGTG TGGGATGAAA TTACCAACGG GATTGTGGAA GCCTTTCGCG GCATCCTCAG TGGCATTAAA TCTCCGCTCA ACCTGATGAT CACCGCCGTC AACAAACTCA TTGAAGGCGC GAATACCGTT GGCTCCGCCT TGGGCTTCGG TGGCATTCCA CTGATTCCCT ACTTAGCCGA CGGGGTGAAG AATTGGGCTG GCGGTTTGGC CTTTGCCTCC GAGCCGTGGA AGGGCATCGA AGCCATGCGC ACACGCGGCG GCGACTTTGC GCTGTTGCCG CCAGGCATTA GCAACATTCC ACGCGGCGCG GAGGTCTTCA CCGCCGAAGA AACCAAGGGC ATGACGGCTC GCATGGTCGT ACCCCAAGGC GGTATGGTCG GCGCACGCGG CATGGAGGGA GCGAGTGGGA CAACCATTGT CAACTATTAC ACCTATGAGA TTGCCGTTGA TGCCCGTGAA GCCATGAATC CCGCAGCGGT CGAAGCGGCG GCCATTCGTG GCGCTGAAAA GGCCATCAAG AACTATGTCG AAAAGGCAAA TATTCAGAAG AAAACCAATC CCTTTGGCAA ATAA
|
Protein sequence | MADEELGRAI IKIGADDAEL DRSISNAETK AKGFAQSLAE SLSTALTTAI AGAVAAAGAA LAGLAVTGVN AFAGFQQQMN SVFTLLPGIS QQAMQQMGEQ VKDFAVEFGA LPEKVIPALY EALSSGVPKD NVFSFLETAQ KAAIGGVTDT QTTVDGLTSV VNAYGADVLS VQDASDQMFT AVAYGKTTFA ELASTLYNVN PIAASLGVSF SDVSAAIAAM TSQGVPTAQT TTMLRQLFVE LSQSGGEVSE LFQQLSGTSF KSFIESGGNV QQALQLLEQH ANASGVGIND LFSSVEAGSA ALTLTGRGTE MFTGALDAMQ NSAGATEAAY NQMDQGLVRA FDGIRAQFAV LQTNIGNALA PTVQAFADWL AEAMPRISAV VVGAFERIGT AVQMVGPYFL DFASTASEAF GLFIDLASSA VQWGSNIATQ LANGIMGGAG AVVDSLGYIG DLVTYWLEPH SPPKLLPNID TWGRDTAQVW MDGWNDARLP TDGLLAYLET ELQSIEDAQQ RVKEAAQEKQ LLALINSTGG KDSDREQAKL ELAALKLRQK IREEQAKAAE QEVKKPKATG GGGGGGKSPM DDAAKKAEAA ARAQWEYSFS IAGTADRLQM LKDKQAAYAT TDAEYWRLQG QINQEEQKRQ AELNKLAKEQ RDYELSLMST EDQLARLRQE QGQYAEGSAE YNDIQQEINK AEKQRQRELE EVKRKQDEAA KAERDYQYAT ADTAGKLAIL QGELANTNAD QSEYWRIKTQ ISQLEAQQQK ELEASAEKMK GVGGAAKGAA KGVGALVPPF TKVKDGADET NQSMQDAATG AEVAADRYAD LKDRQVEMAN ATAPVPSLID RIRNAFSQAA PFIESVKGAF MGIATLFTGM GLVSLFTGLA GAVAALVSPM GLLVAGAAAL GLAWQTNFAN IRTITGEVFN AVQTTVQTAF GLITSWFREN GEQIVTFLRD AWGRIEGIVS TVLGAVGTVI QTALDAIRSF LEQHGEEIKG VLTMAWETIQ VVIDGALDVI DGAIIPFFHG IATFLSDHKD QITGLLSGAW SIIKGIVEAA MSVIQGVIKT VTAVIHGDWS GAWTAIKDVF AGLWNGIIDI VKGALEVVWN LLVVAWDLIG TGIRTAWDGI SDYFRTMFGG VLDNLDGFLE QFKNGVAVAW QWIKDASAAV WDEITNGIVE AFRGILSGIK SPLNLMITAV NKLIEGANTV GSALGFGGIP LIPYLADGVK NWAGGLAFAS EPWKGIEAMR TRGGDFALLP PGISNIPRGA EVFTAEETKG MTARMVVPQG GMVGARGMEG ASGTTIVNYY TYEIAVDARE AMNPAAVEAA AIRGAEKAIK NYVEKANIQK KTNPFGK
|
| |