Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1369 |
Symbol | |
ID | 5733261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1585164 |
End bp | 1589207 |
Gene Length | 4044 bp |
Protein Length | 1347 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641278507 |
Product | TP901 family phage tail tape measure protein |
Protein accession | YP_001544142 |
Protein GI | 159897895 |
COG category | [S] Function unknown |
COG ID | [COG5412] Phage-related protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.275102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATG AAGAATTAGG GCGAGCCATT ATTAAAATCG GCGCGGATGA TGCCGAGCTT GATCGTTCGA TCAGCAACGC CGAAACCAAA GCCAAAGGCT TCGCGCAATC CTTAGCCGAA AGTCTGAGTA CCGCACTGAC CACCGCCATT GCAGGCGCGG TGGCTGCCGC AGGCGCAGCC TTGGCAGGAC TGGCGGTCAC GGGGGTCAAC GCGTTTGCGG GCTTCCAAGA GCAGATGAAC AGCGTGTTCA CTCTCTTGCC TGGTATCTCC CAACAGGCCA TGCAGCAGAT GGGTGAGCAG GTCAAAGACT TTGCGGTGGA GTTTGGAGCG CTGCCTGAAA AGGTGATCCC CGCGCTCTAT GAAGCCCTCT CGTCCGGCGT GCCCAAAGAC AATGTATTTT CCTTCCTTGA AACCGCACAG AAAGCGGCTA TCGGCGGCGT AACGGATACC CAAACCACCG TGGACGGTTT GACCAGCGTG GTCAATGCCT ACGGCGCGGA CGTGCTGAGC GTGCAGGACG CGAGTGATCA GATGTTCACC GCCGTGGCCT ATGGCAAAAC CACCTTTGCC GAACTGGCGA GCACGCTCTA CAACGTCAAC CCGATTGCCG CCAGCTTGGG CGTATCCTTC AGCGACGTGT CGGCAGCCAT CGCCGCCATG ACTTCCCAAG GTGTGCCCAC GGCCCAAACC ACCACGATGT TGCGCCAGTT GTTTGTGGAG TTATCGCAAT CCGGTGGCGA GGTTTCTGAC CTCTTCCAAC AGTTATCGGG CACGTCGTTC AAGAGCTTTA TTGAAAGTGG TGGCAATGTT CAACAAGCGC TGCAACTGCT GGAGCAGCAC GCCAATGCCA GCGGCGTGGG CATCAATGAC TTATTTAGTA GTGTTGAAGC GGGTAGTGCA GCACTGACCC TGACAGGCCG TGGCACCGAG ATGTTTACGG GGGCGCTCGA TGCGATGCAG AACAGCGCCG GAGCCACCGA GGCCGCCTAC ACCCAAATGG ATCAGGGGTT AGCGCGAGCC TTTGACGGGA TTCGTGCCCA GTTTGCCGTG CTGCAAACGA ATATTGGCAA TGCGTTAGCC CCCACCGTGC AAGCCTTTGC CGATTGGTTG GCCGAAGCCA TGCCCCGCAT CAGCGCCGTG GTCGTGGGAG CGTTTGAGCG GATTGGCGCA GCGGTTCAGA TGGTTGGCCC GTATTTCCTT GACTTTGCCA GCACCGCGAG CGAAGCCTTT GGTCTGTTCA TTGACCTCGC ATCAAGCGCC GTGCAATGGG GCAGCAACAT TGCCACCCAG TTAGCCAACG GGATTATGGG TGGTGCTGGT GCGGTCGTCG ATAGCTTGGG ATACATTGGC GACCTTATCA CCTACTGGCT TGAACCGCAC TCCCCACCAA AGTTATTACC CAACATCGAC ACATGGGGTC GCGATACCGC CCAAGTCTGG ATGGATGGCT GGAATGATGC GCGGCTGCCC ACCGATGGCT TGCTGGCCTA TCTCGAAACC GAACTGAAAT CCATTGAAGA TGCGCAGCAG CGGGTCAAGG AAGCAGCCCA AGAACAGCAG CTCTTGGCGC TGATCAACAG CACGGGTGGT AAAGACAGCG ACCGCGAACA GGCCAAGCTC GAATTGGCCG CGCTGAAGCT GCGCCAAAAG ATTCGCGAGG AGCAGGCTAA AGCTGCCGAG CAAGAAACCA AAAAGCCCAA AGCGAGTGGT GGCGGAGGTG GCGGTGGCAA ATCCCCGATG GACGATGCCG CCAAGAACGC TGAAGCGGCG GCCCGTGCGC AGTGGGAATA CACATTCAGT ATTGCCGGAA CTGTCGATCG CTTGCAGATG CTCAAGGACA AGCAAGCCGC CTATGCTACG ACCGATGCCG AGTATTGGCG ACTGCAAGGG CAGATTAATC AGGAAGAGCA GAAACGTCAG GCGGAACTCA ATAAACTGGC CAAGGAGCAA CGCGACTATG AGTTAAGCCT CATGTCCACT GAGGATCAGT TAGCTCGCTT GCGCCAAGAA CAGGGTCAAT ATGCCGAGGG CAGCGCCGAA TACAACGACA TTCAGCAGGA GATTAACAAA GCCGAGAAAC AACGCCAGCG CGAGCTGGAG GAAGTCAAAC GCAAACAGGA CGAAGCGGCC AAAGCCGAGC GGGATTATCA GTATGCCACC GCTGACACCG CTGGCAAACT AGCAATCCTA CAAGGTGAAT TGGCCAATAC CAACGCCGAT CAATCCGAAT ACTGGCGCAT CAAAACCCAG ATTTTGCAGC TCGAAGCCCA GCAACAAAAA GAGCTGGAAG CGTCAGCCGA GAAGATGAAG GGTGTAGGCG GAGCCGCCAA GGGAGCAGCC AAAGGCGTGG GTGCGCTCGT TCCACCATTC ACCAAGGTCA AGGATGGGGC GGACGAAACC AATCAATCCA TGCAGGATGC AGCCACCGGA GCCGAGGCAG CCGCCGATCG CTATGCTGAT TTGAAGGATC GCCAAGTCGA AATGGCGACT GCCACTGCGC CTGTACCGTC CTTGATCGAC CGGATTCGCA ACGCCTTCAG CCAAGCCGCC CCGTTTATTG AATCGGTTAA AGGCGCATTC ATGGGCATTG CGGCACTCTT CACGGGTATG GGACTGGTGT CCCTGTTCTC CGGTTTGGCG GGAGCGGTAG CCGCACTCGT TTCACCGATG GGCTTGCTGG TAGCTGGAGC CGCAGCGCTG GGATTGGCGT GGCAAACCAA TTTCGCCACT ATTCGCACCA TCACGACCGA GGTCTTTACA TCGATTCAGT CCATGGTACA AACCGCGTGG GGCATCATCA CCAGCTGGTT CCGCGAGAAC GGCGAGCAGA TCGTGACGTT CCTCCGCAAT GCGTGGGGCC GGATTGAGGG CATTGTCAGC ACGGTGCTGG GCGCGGTTGG CACGGTGATT CAAACGGCAC TCGGCGCAAT CCGAACCTTC CTCGAGCAGC ACGGCGAAGA AATCAAAGGC GTGCTGACGA TGGCGTGGGA AACCATTCAG GTCGTCATTG ATGGCGCACT GGATGTGATC AATGGTGCGA TCATTCCATT CTTCCACGGC ATCGCCACCT TCCTCAGCGA CCACAAAGAC CAGATCACGG GCCTGCTCAG TGGCGCATGG AGCATCATCA AGGGCATCGT TGAAGCCGCC ATGAGCGTGA TTCAAGGCGT GATCAAAACC GTCAGCGCCG TGATTCAGGG CGATTGGTCA GGCGCATGGA CGGCAATCAA GGATGTCTTT GCTGGACTCT GGAATGGCAT CATCGACATC GTGAAGGGAG CACTGGAGGT CGTCTGGAAC CTCTTGGTGG TGGCGTGGGA TCTCATTGGC ACGGGCATCA GCACCGCGTG GGATGGCATC AGCGACTACT TCCGCACGAT GTTTGGCGGT GTGCTCGACA ACCTTGATGG CTTTTTGGAG CAATTCAAAA ACGGGGTTGC GGTGGCATGG CAATGGATTA AGGATGCCAG TGCTGCGGTG TGGGATGAAA TTACCAACGG GATTGTAGAA GCCTTTCGCG GCATCCTCAG TGGCATTAAA TCCCCACTCA ACCTGATGAT CACCGCCGTC AACAAACTCA TTGAAGGCGC GAATACCGTT GGCTCCGCCT TGGGCTTCGG TGGTATTCCA CTGATTCCCT ACTTAGCAGA CGGCGTGAAA AACTGGGCGG GCGGTTTGGC CTTTGCGTCC GAGCCGTGGA AAGGCATCGA AGCCATGCGC ACGCGGGGTG GTGACTTTGC CCTGCTGCCA CCAGGTATTA GCAATATTCC ACGCGGCGCG GAGGTGTTCA CCGCCGAAGA AACCAAGGGC ATGGCTGCGC GGATGGTCGT CCCGCAAGGC GGCATGGTGG GCACGCTGTC CAATGCGTTG CCAACGAATC AACCATCTGC GCCGTCGATC ATCAATCAGA TCACCATCGA TGCGCGACAG GCCACCAATC CGGCAGCCAT CAAAGCCGCC GTTGAAGAAG TCTGGAACAA GAAGATGAAA GATATTTTGG GTAGCGCCAA TATCCTGCTC AAAACCAATC CATATGGCAA ATAA
|
Protein sequence | MADEELGRAI IKIGADDAEL DRSISNAETK AKGFAQSLAE SLSTALTTAI AGAVAAAGAA LAGLAVTGVN AFAGFQEQMN SVFTLLPGIS QQAMQQMGEQ VKDFAVEFGA LPEKVIPALY EALSSGVPKD NVFSFLETAQ KAAIGGVTDT QTTVDGLTSV VNAYGADVLS VQDASDQMFT AVAYGKTTFA ELASTLYNVN PIAASLGVSF SDVSAAIAAM TSQGVPTAQT TTMLRQLFVE LSQSGGEVSD LFQQLSGTSF KSFIESGGNV QQALQLLEQH ANASGVGIND LFSSVEAGSA ALTLTGRGTE MFTGALDAMQ NSAGATEAAY TQMDQGLARA FDGIRAQFAV LQTNIGNALA PTVQAFADWL AEAMPRISAV VVGAFERIGA AVQMVGPYFL DFASTASEAF GLFIDLASSA VQWGSNIATQ LANGIMGGAG AVVDSLGYIG DLITYWLEPH SPPKLLPNID TWGRDTAQVW MDGWNDARLP TDGLLAYLET ELKSIEDAQQ RVKEAAQEQQ LLALINSTGG KDSDREQAKL ELAALKLRQK IREEQAKAAE QETKKPKASG GGGGGGKSPM DDAAKNAEAA ARAQWEYTFS IAGTVDRLQM LKDKQAAYAT TDAEYWRLQG QINQEEQKRQ AELNKLAKEQ RDYELSLMST EDQLARLRQE QGQYAEGSAE YNDIQQEINK AEKQRQRELE EVKRKQDEAA KAERDYQYAT ADTAGKLAIL QGELANTNAD QSEYWRIKTQ ILQLEAQQQK ELEASAEKMK GVGGAAKGAA KGVGALVPPF TKVKDGADET NQSMQDAATG AEAAADRYAD LKDRQVEMAT ATAPVPSLID RIRNAFSQAA PFIESVKGAF MGIAALFTGM GLVSLFSGLA GAVAALVSPM GLLVAGAAAL GLAWQTNFAT IRTITTEVFT SIQSMVQTAW GIITSWFREN GEQIVTFLRN AWGRIEGIVS TVLGAVGTVI QTALGAIRTF LEQHGEEIKG VLTMAWETIQ VVIDGALDVI NGAIIPFFHG IATFLSDHKD QITGLLSGAW SIIKGIVEAA MSVIQGVIKT VSAVIQGDWS GAWTAIKDVF AGLWNGIIDI VKGALEVVWN LLVVAWDLIG TGISTAWDGI SDYFRTMFGG VLDNLDGFLE QFKNGVAVAW QWIKDASAAV WDEITNGIVE AFRGILSGIK SPLNLMITAV NKLIEGANTV GSALGFGGIP LIPYLADGVK NWAGGLAFAS EPWKGIEAMR TRGGDFALLP PGISNIPRGA EVFTAEETKG MAARMVVPQG GMVGTLSNAL PTNQPSAPSI INQITIDARQ ATNPAAIKAA VEEVWNKKMK DILGSANILL KTNPYGK
|
| |