Gene Haur_1369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1369 
Symbol 
ID5733261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1585164 
End bp1589207 
Gene Length4044 bp 
Protein Length1347 aa 
Translation table11 
GC content57% 
IMG OID641278507 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_001544142 
Protein GI159897895 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.275102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATG AAGAATTAGG GCGAGCCATT ATTAAAATCG GCGCGGATGA TGCCGAGCTT 
GATCGTTCGA TCAGCAACGC CGAAACCAAA GCCAAAGGCT TCGCGCAATC CTTAGCCGAA
AGTCTGAGTA CCGCACTGAC CACCGCCATT GCAGGCGCGG TGGCTGCCGC AGGCGCAGCC
TTGGCAGGAC TGGCGGTCAC GGGGGTCAAC GCGTTTGCGG GCTTCCAAGA GCAGATGAAC
AGCGTGTTCA CTCTCTTGCC TGGTATCTCC CAACAGGCCA TGCAGCAGAT GGGTGAGCAG
GTCAAAGACT TTGCGGTGGA GTTTGGAGCG CTGCCTGAAA AGGTGATCCC CGCGCTCTAT
GAAGCCCTCT CGTCCGGCGT GCCCAAAGAC AATGTATTTT CCTTCCTTGA AACCGCACAG
AAAGCGGCTA TCGGCGGCGT AACGGATACC CAAACCACCG TGGACGGTTT GACCAGCGTG
GTCAATGCCT ACGGCGCGGA CGTGCTGAGC GTGCAGGACG CGAGTGATCA GATGTTCACC
GCCGTGGCCT ATGGCAAAAC CACCTTTGCC GAACTGGCGA GCACGCTCTA CAACGTCAAC
CCGATTGCCG CCAGCTTGGG CGTATCCTTC AGCGACGTGT CGGCAGCCAT CGCCGCCATG
ACTTCCCAAG GTGTGCCCAC GGCCCAAACC ACCACGATGT TGCGCCAGTT GTTTGTGGAG
TTATCGCAAT CCGGTGGCGA GGTTTCTGAC CTCTTCCAAC AGTTATCGGG CACGTCGTTC
AAGAGCTTTA TTGAAAGTGG TGGCAATGTT CAACAAGCGC TGCAACTGCT GGAGCAGCAC
GCCAATGCCA GCGGCGTGGG CATCAATGAC TTATTTAGTA GTGTTGAAGC GGGTAGTGCA
GCACTGACCC TGACAGGCCG TGGCACCGAG ATGTTTACGG GGGCGCTCGA TGCGATGCAG
AACAGCGCCG GAGCCACCGA GGCCGCCTAC ACCCAAATGG ATCAGGGGTT AGCGCGAGCC
TTTGACGGGA TTCGTGCCCA GTTTGCCGTG CTGCAAACGA ATATTGGCAA TGCGTTAGCC
CCCACCGTGC AAGCCTTTGC CGATTGGTTG GCCGAAGCCA TGCCCCGCAT CAGCGCCGTG
GTCGTGGGAG CGTTTGAGCG GATTGGCGCA GCGGTTCAGA TGGTTGGCCC GTATTTCCTT
GACTTTGCCA GCACCGCGAG CGAAGCCTTT GGTCTGTTCA TTGACCTCGC ATCAAGCGCC
GTGCAATGGG GCAGCAACAT TGCCACCCAG TTAGCCAACG GGATTATGGG TGGTGCTGGT
GCGGTCGTCG ATAGCTTGGG ATACATTGGC GACCTTATCA CCTACTGGCT TGAACCGCAC
TCCCCACCAA AGTTATTACC CAACATCGAC ACATGGGGTC GCGATACCGC CCAAGTCTGG
ATGGATGGCT GGAATGATGC GCGGCTGCCC ACCGATGGCT TGCTGGCCTA TCTCGAAACC
GAACTGAAAT CCATTGAAGA TGCGCAGCAG CGGGTCAAGG AAGCAGCCCA AGAACAGCAG
CTCTTGGCGC TGATCAACAG CACGGGTGGT AAAGACAGCG ACCGCGAACA GGCCAAGCTC
GAATTGGCCG CGCTGAAGCT GCGCCAAAAG ATTCGCGAGG AGCAGGCTAA AGCTGCCGAG
CAAGAAACCA AAAAGCCCAA AGCGAGTGGT GGCGGAGGTG GCGGTGGCAA ATCCCCGATG
GACGATGCCG CCAAGAACGC TGAAGCGGCG GCCCGTGCGC AGTGGGAATA CACATTCAGT
ATTGCCGGAA CTGTCGATCG CTTGCAGATG CTCAAGGACA AGCAAGCCGC CTATGCTACG
ACCGATGCCG AGTATTGGCG ACTGCAAGGG CAGATTAATC AGGAAGAGCA GAAACGTCAG
GCGGAACTCA ATAAACTGGC CAAGGAGCAA CGCGACTATG AGTTAAGCCT CATGTCCACT
GAGGATCAGT TAGCTCGCTT GCGCCAAGAA CAGGGTCAAT ATGCCGAGGG CAGCGCCGAA
TACAACGACA TTCAGCAGGA GATTAACAAA GCCGAGAAAC AACGCCAGCG CGAGCTGGAG
GAAGTCAAAC GCAAACAGGA CGAAGCGGCC AAAGCCGAGC GGGATTATCA GTATGCCACC
GCTGACACCG CTGGCAAACT AGCAATCCTA CAAGGTGAAT TGGCCAATAC CAACGCCGAT
CAATCCGAAT ACTGGCGCAT CAAAACCCAG ATTTTGCAGC TCGAAGCCCA GCAACAAAAA
GAGCTGGAAG CGTCAGCCGA GAAGATGAAG GGTGTAGGCG GAGCCGCCAA GGGAGCAGCC
AAAGGCGTGG GTGCGCTCGT TCCACCATTC ACCAAGGTCA AGGATGGGGC GGACGAAACC
AATCAATCCA TGCAGGATGC AGCCACCGGA GCCGAGGCAG CCGCCGATCG CTATGCTGAT
TTGAAGGATC GCCAAGTCGA AATGGCGACT GCCACTGCGC CTGTACCGTC CTTGATCGAC
CGGATTCGCA ACGCCTTCAG CCAAGCCGCC CCGTTTATTG AATCGGTTAA AGGCGCATTC
ATGGGCATTG CGGCACTCTT CACGGGTATG GGACTGGTGT CCCTGTTCTC CGGTTTGGCG
GGAGCGGTAG CCGCACTCGT TTCACCGATG GGCTTGCTGG TAGCTGGAGC CGCAGCGCTG
GGATTGGCGT GGCAAACCAA TTTCGCCACT ATTCGCACCA TCACGACCGA GGTCTTTACA
TCGATTCAGT CCATGGTACA AACCGCGTGG GGCATCATCA CCAGCTGGTT CCGCGAGAAC
GGCGAGCAGA TCGTGACGTT CCTCCGCAAT GCGTGGGGCC GGATTGAGGG CATTGTCAGC
ACGGTGCTGG GCGCGGTTGG CACGGTGATT CAAACGGCAC TCGGCGCAAT CCGAACCTTC
CTCGAGCAGC ACGGCGAAGA AATCAAAGGC GTGCTGACGA TGGCGTGGGA AACCATTCAG
GTCGTCATTG ATGGCGCACT GGATGTGATC AATGGTGCGA TCATTCCATT CTTCCACGGC
ATCGCCACCT TCCTCAGCGA CCACAAAGAC CAGATCACGG GCCTGCTCAG TGGCGCATGG
AGCATCATCA AGGGCATCGT TGAAGCCGCC ATGAGCGTGA TTCAAGGCGT GATCAAAACC
GTCAGCGCCG TGATTCAGGG CGATTGGTCA GGCGCATGGA CGGCAATCAA GGATGTCTTT
GCTGGACTCT GGAATGGCAT CATCGACATC GTGAAGGGAG CACTGGAGGT CGTCTGGAAC
CTCTTGGTGG TGGCGTGGGA TCTCATTGGC ACGGGCATCA GCACCGCGTG GGATGGCATC
AGCGACTACT TCCGCACGAT GTTTGGCGGT GTGCTCGACA ACCTTGATGG CTTTTTGGAG
CAATTCAAAA ACGGGGTTGC GGTGGCATGG CAATGGATTA AGGATGCCAG TGCTGCGGTG
TGGGATGAAA TTACCAACGG GATTGTAGAA GCCTTTCGCG GCATCCTCAG TGGCATTAAA
TCCCCACTCA ACCTGATGAT CACCGCCGTC AACAAACTCA TTGAAGGCGC GAATACCGTT
GGCTCCGCCT TGGGCTTCGG TGGTATTCCA CTGATTCCCT ACTTAGCAGA CGGCGTGAAA
AACTGGGCGG GCGGTTTGGC CTTTGCGTCC GAGCCGTGGA AAGGCATCGA AGCCATGCGC
ACGCGGGGTG GTGACTTTGC CCTGCTGCCA CCAGGTATTA GCAATATTCC ACGCGGCGCG
GAGGTGTTCA CCGCCGAAGA AACCAAGGGC ATGGCTGCGC GGATGGTCGT CCCGCAAGGC
GGCATGGTGG GCACGCTGTC CAATGCGTTG CCAACGAATC AACCATCTGC GCCGTCGATC
ATCAATCAGA TCACCATCGA TGCGCGACAG GCCACCAATC CGGCAGCCAT CAAAGCCGCC
GTTGAAGAAG TCTGGAACAA GAAGATGAAA GATATTTTGG GTAGCGCCAA TATCCTGCTC
AAAACCAATC CATATGGCAA ATAA
 
Protein sequence
MADEELGRAI IKIGADDAEL DRSISNAETK AKGFAQSLAE SLSTALTTAI AGAVAAAGAA 
LAGLAVTGVN AFAGFQEQMN SVFTLLPGIS QQAMQQMGEQ VKDFAVEFGA LPEKVIPALY
EALSSGVPKD NVFSFLETAQ KAAIGGVTDT QTTVDGLTSV VNAYGADVLS VQDASDQMFT
AVAYGKTTFA ELASTLYNVN PIAASLGVSF SDVSAAIAAM TSQGVPTAQT TTMLRQLFVE
LSQSGGEVSD LFQQLSGTSF KSFIESGGNV QQALQLLEQH ANASGVGIND LFSSVEAGSA
ALTLTGRGTE MFTGALDAMQ NSAGATEAAY TQMDQGLARA FDGIRAQFAV LQTNIGNALA
PTVQAFADWL AEAMPRISAV VVGAFERIGA AVQMVGPYFL DFASTASEAF GLFIDLASSA
VQWGSNIATQ LANGIMGGAG AVVDSLGYIG DLITYWLEPH SPPKLLPNID TWGRDTAQVW
MDGWNDARLP TDGLLAYLET ELKSIEDAQQ RVKEAAQEQQ LLALINSTGG KDSDREQAKL
ELAALKLRQK IREEQAKAAE QETKKPKASG GGGGGGKSPM DDAAKNAEAA ARAQWEYTFS
IAGTVDRLQM LKDKQAAYAT TDAEYWRLQG QINQEEQKRQ AELNKLAKEQ RDYELSLMST
EDQLARLRQE QGQYAEGSAE YNDIQQEINK AEKQRQRELE EVKRKQDEAA KAERDYQYAT
ADTAGKLAIL QGELANTNAD QSEYWRIKTQ ILQLEAQQQK ELEASAEKMK GVGGAAKGAA
KGVGALVPPF TKVKDGADET NQSMQDAATG AEAAADRYAD LKDRQVEMAT ATAPVPSLID
RIRNAFSQAA PFIESVKGAF MGIAALFTGM GLVSLFSGLA GAVAALVSPM GLLVAGAAAL
GLAWQTNFAT IRTITTEVFT SIQSMVQTAW GIITSWFREN GEQIVTFLRN AWGRIEGIVS
TVLGAVGTVI QTALGAIRTF LEQHGEEIKG VLTMAWETIQ VVIDGALDVI NGAIIPFFHG
IATFLSDHKD QITGLLSGAW SIIKGIVEAA MSVIQGVIKT VSAVIQGDWS GAWTAIKDVF
AGLWNGIIDI VKGALEVVWN LLVVAWDLIG TGISTAWDGI SDYFRTMFGG VLDNLDGFLE
QFKNGVAVAW QWIKDASAAV WDEITNGIVE AFRGILSGIK SPLNLMITAV NKLIEGANTV
GSALGFGGIP LIPYLADGVK NWAGGLAFAS EPWKGIEAMR TRGGDFALLP PGISNIPRGA
EVFTAEETKG MAARMVVPQG GMVGTLSNAL PTNQPSAPSI INQITIDARQ ATNPAAIKAA
VEEVWNKKMK DILGSANILL KTNPYGK