Gene Haur_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0549 
Symbol 
ID5732283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp641054 
End bp645097 
Gene Length4044 bp 
Protein Length1347 aa 
Translation table11 
GC content57% 
IMG OID641277676 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_001543325 
Protein GI159897078 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.321201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATG AAGAATTAGG GCGAGCAATC ATTAAAATCG GCGCGGATGA TGCCGAGCTT 
GATCGCTCGA TCAGCAACGC CGAAACCAAA GCCAAGGGCT TCGCGCAGTC GTTAGCCGAA
AGTCTGAGTA CCGCACTGAC CACCGCCATT GCGGGCGCGG TGGCTGCCGC AGGCGCAGCC
TTAGCAGGAC TGGCGGTCAC GGGGGTCAAC GCGTTCGCGG GCTTCCAACA GCAGATGAAT
AGCGTGTTCA CGTTGCTGCC GGGTATTTCC CAACAGGCCA TGCAGCAGAT GGGTGAGCAG
GTCAAAGACT TTGCGGTGGA ATTTGGAGCG TTGCCCGAGA AAGTGATCCC CGCGCTCTAT
GAAGCCTTGT CCTCGGGCGT GCCCAAAGAC AATGTGTTTT CCTTCCTCGA AACCGCGCAG
AAAGCGGCTA TCGGCGGCGT AACCGATACC CAAACCACGG TGGACGGCCT GACCAGTGTA
GTCAATGCCT ACGGCGCGGA CGTGCTGAGC GTGCAAGATG CCAGCGACCA GATGTTTACC
GCCGTGGCCT ATGGCAAAAC CACGTTTGCC GAACTGGCGA GCACGCTCTA CAACGTCAAC
CCGATTGCCG CGAGCTTGGG CGTATCGTTC AGCGATGTCT CTGCCGCGAT TGCCGCTATG
ACCTCGCAAG GTGTGCCTAC CGCCCAAACC ACCACGATGT TGCGCCAGTT GTTTGTGGAG
TTATCGCAAT CAGGTGGCGA AGTCTCTGAG CTATTCCAGC AACTTTCTGG CACCTCGTTT
AAATCCTTTA TTGAGAGCGG TGGCAACGTT CAGCAAGCGC TGCAACTGCT GGAGCAGCAC
GCCAATGCCA GTGGCGTGGG CATCAATGAC TTATTTAGCT CTGTCGAAGC GGGTAGCGCG
GCGCTGACCC TGACCGGACG CGGCACCGAG ATGTTCACGG GGGCGCTCGA TGCGATGCAA
AACAGCGCCG GAGCCACCGA GGCCGCCTAT AACCAAATGG ATCAAGGCTT AGTGCGAGCC
TTTGACGGGA TTCGTGCCCA ATTTGCCGTG TTGCAAACGA ATATTGGGAA TGCATTAGCA
CCCACCGTCC AAGCCTTTGC CGATTGGTTG GCCGAAGCCA TGCCCCGCAT CAGCGCCGTG
GTTGTGGGAG CGTTTGAGCG GATTGGTACA GCCGTCCAGA TGGTTGGCCC GTATTTCCTT
GACTTCGCCA GCACGGCGAG CGAAGCCTTT GGTCTGTTCA TTGACCTTGC CAGCAGCGCG
GTGCAATGGG GTAGCAACAT TGCCACCCAG CTAGCCAACG GGATTATGGG TGGTGCTGGT
GCGGTCGTCG ATAGCCTTGG CTACATTGGC GACCTCGTGA CCTATTGGCT TGAGCCGCAT
TCTCCACCCA AGCTGCTTCC CAATATCGAC ACCTGGGGCC GCGATACGGC GCAAGTCTGG
ATGGATGGTT GGAATGATGC GCGGCTGCCA ACCGATGGGC TACTGGCCTA TCTCGAAACC
GAACTGCAAT CGATTGAAGA TGCCCAGCAG CGGGTCAAGG AAGCAGCCCA AGAAAAGCAG
CTCTTGGCGC TGATTAATAG TACGGGTGGC AAGGATAGTG ACCGCGAGCA GGCCAAACTG
GAGCTAGCCG CCCTGAAGCT GCGCCAGAAA ATCCGCGAGG AACAAGCCAA AGCCGCCGAG
CAGGAAGTCA AAAAGCCCAA AGCCACGGGC GGCGGGGGTG GCGGCGGCAA ATCGCCCATG
GACGATGCCG CCAAGAAAGC TGAAGCGGCG GCCCGTGCGC AGTGGGAATA CAGCTTCAGT
ATTGCCGGAA CTGCCGATCG CTTACAGATG CTCAAGGACA AGCAGGCCGC CTATGCCACA
ACCGATGCCG AGTATTGGCG ACTGCAAGGG CAGATTAATC AGGAAGAGCA GAAACGTCAG
GCGGAACTCA ATAAACTGGC CAAAGAGCAA CGCGACTATG AGTTAAGCCT CATGTCCACC
GAGGATCAGT TAGCTCGCTT GCGCCAAGAG CAGGGCCAAT ACGCCGAGGG CAGCGCCGAA
TACAACGATA TTCAACAGGA GATCAATAAG GCCGAGAAAC AGCGCCAGCG CGAGCTGGAG
GAGGTCAAGC GGAAACAGGA CGAAGCCGCC AAAGCCGAAC GCGATTATCA GTATGCCACA
GCGGATACTG CGGGCAAGCT GGCAATCCTC CAAGGTGAAT TGGCCAATAC CAACGCTGAC
CAATCCGAAT ATTGGCGCAT CAAAACTCAA ATCTCGCAGC TCGAAGCCCA GCAACAAAAA
GAGTTAGAGG CTTCAGCCGA GAAAATGAAA GGTGTGGGCG GCGCGGCCAA GGGTGCGGCC
AAAGGTGTCG GTGCGCTCGT TCCACCATTC ACCAAGGTTA AGGATGGCGC GGACGAAACC
AATCAATCCA TGCAGGATGC GGCAACCGGA GCCGAGGTAG CCGCCGATCG CTATGCTGAT
TTGAAGGATC GCCAAGTCGA AATGGCCAAT GCCACCGCGC CTGTGCCATC CTTGATCGAC
CGGATTCGCA ACGCTTTCAG CCAAGCTGCA CCGTTTATTG AATCGGTCAA AGGCGCATTC
ATGGGCATTG CCACCCTCTT CACGGGTATG GGCCTGGTGT CACTGTTCAC GGGCTTGGCG
GGAGCGGTCG CCGCACTCGT GTCACCGATG GGCTTACTGG TAGCCGGAGC CGCCGCGCTG
GGATTGGCGT GGCAAACCAA TTTCGCCAAT ATCCGCACCA TCACCGGAGA GGTTTTCAAT
GCGGTACAGA CCACCGTACA AACCGCCTTT GGGCTAATTA CCAGTTGGTT CCGCGAGAAC
GGCGAGCAGA TCGTGACCTT CCTGCGTGAT GCGTGGGGCC GGATTGAGGG CATTGTGAGC
ACGGTGCTGG GCGCGGTTGG CACGGTGATT CAAACCGCAC TCGACGCAAT CCGAAGCTTC
CTTGAGCAGC ACGGCGAAGA AATCAAAGGT GTGCTGACGA TGGCGTGGGA AACTATTCAG
GTCGTCATTG ATGGCGCACT GGATGTGATT GATGGTGCGA TTATTCCATT CTTCCACGGC
ATCGCCACCT TCCTCAGCGA CCACAAAGAC CAGATCACGG GCCTGCTCAG TGGGGCATGG
AGCATCATCA AGGGCATCGT CGAAGCCGCC ATGAGCGTGA TTCAAGGCGT GATCAAAACC
GTCACTGCCG TGATTCACGG GGATTGGTCA GGCGCATGGA CGGCAATCAA GGATGTCTTT
GCCGGACTCT GGAATGGCAT CATCGACATC GTGAAGGGGG CGCTGGAGGT TGTCTGGAAC
CTCTTGGTGG TCGCGTGGGA TCTCATTGGC ACGGGCATTC GCACCGCGTG GGATGGCATC
AGCGACTACT TCCGCACGAT GTTTGGCGGT GTGCTCGACA ACCTTGACGG CTTTTTGGAG
CAATTCAAAA ACGGGGTTGC GGTGGCGTGG CAATGGATTA AGGATGCCAG TGCTGCGGTG
TGGGATGAAA TTACCAACGG GATTGTGGAA GCCTTTCGCG GCATCCTCAG TGGCATTAAA
TCTCCGCTCA ACCTGATGAT CACCGCCGTC AACAAACTCA TTGAAGGCGC GAATACCGTT
GGCTCCGCCT TGGGCTTCGG TGGCATTCCA CTGATTCCCT ACTTAGCCGA CGGGGTGAAG
AATTGGGCTG GCGGTTTGGC CTTTGCCTCC GAGCCGTGGA AGGGCATCGA AGCCATGCGC
ACACGCGGCG GCGACTTTGC GCTGTTGCCG CCAGGCATTA GCAACATTCC ACGCGGCGCG
GAGGTCTTCA CCGCCGAAGA AACCAAGGGC ATGACGGCTC GCATGGTCGT ACCCCAAGGC
GGTATGGTCG GCGCACGCGG CATGGAGGGA GCGAGTGGGA CAACCATTGT CAACTATTAC
ACCTATGAGA TTGCCGTTGA TGCCCGTGAA GCCATGAATC CCGCAGCGGT CGAAGCGGCG
GCCATTCGTG GCGCTGAAAA GGCCATCAAG AACTATGTCG AAAAGGCAAA TATTCAGAAG
AAAACCAATC CCTTTGGCAA ATAA
 
Protein sequence
MADEELGRAI IKIGADDAEL DRSISNAETK AKGFAQSLAE SLSTALTTAI AGAVAAAGAA 
LAGLAVTGVN AFAGFQQQMN SVFTLLPGIS QQAMQQMGEQ VKDFAVEFGA LPEKVIPALY
EALSSGVPKD NVFSFLETAQ KAAIGGVTDT QTTVDGLTSV VNAYGADVLS VQDASDQMFT
AVAYGKTTFA ELASTLYNVN PIAASLGVSF SDVSAAIAAM TSQGVPTAQT TTMLRQLFVE
LSQSGGEVSE LFQQLSGTSF KSFIESGGNV QQALQLLEQH ANASGVGIND LFSSVEAGSA
ALTLTGRGTE MFTGALDAMQ NSAGATEAAY NQMDQGLVRA FDGIRAQFAV LQTNIGNALA
PTVQAFADWL AEAMPRISAV VVGAFERIGT AVQMVGPYFL DFASTASEAF GLFIDLASSA
VQWGSNIATQ LANGIMGGAG AVVDSLGYIG DLVTYWLEPH SPPKLLPNID TWGRDTAQVW
MDGWNDARLP TDGLLAYLET ELQSIEDAQQ RVKEAAQEKQ LLALINSTGG KDSDREQAKL
ELAALKLRQK IREEQAKAAE QEVKKPKATG GGGGGGKSPM DDAAKKAEAA ARAQWEYSFS
IAGTADRLQM LKDKQAAYAT TDAEYWRLQG QINQEEQKRQ AELNKLAKEQ RDYELSLMST
EDQLARLRQE QGQYAEGSAE YNDIQQEINK AEKQRQRELE EVKRKQDEAA KAERDYQYAT
ADTAGKLAIL QGELANTNAD QSEYWRIKTQ ISQLEAQQQK ELEASAEKMK GVGGAAKGAA
KGVGALVPPF TKVKDGADET NQSMQDAATG AEVAADRYAD LKDRQVEMAN ATAPVPSLID
RIRNAFSQAA PFIESVKGAF MGIATLFTGM GLVSLFTGLA GAVAALVSPM GLLVAGAAAL
GLAWQTNFAN IRTITGEVFN AVQTTVQTAF GLITSWFREN GEQIVTFLRD AWGRIEGIVS
TVLGAVGTVI QTALDAIRSF LEQHGEEIKG VLTMAWETIQ VVIDGALDVI DGAIIPFFHG
IATFLSDHKD QITGLLSGAW SIIKGIVEAA MSVIQGVIKT VTAVIHGDWS GAWTAIKDVF
AGLWNGIIDI VKGALEVVWN LLVVAWDLIG TGIRTAWDGI SDYFRTMFGG VLDNLDGFLE
QFKNGVAVAW QWIKDASAAV WDEITNGIVE AFRGILSGIK SPLNLMITAV NKLIEGANTV
GSALGFGGIP LIPYLADGVK NWAGGLAFAS EPWKGIEAMR TRGGDFALLP PGISNIPRGA
EVFTAEETKG MTARMVVPQG GMVGARGMEG ASGTTIVNYY TYEIAVDARE AMNPAAVEAA
AIRGAEKAIK NYVEKANIQK KTNPFGK