Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0946 |
Symbol | |
ID | 5592651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 943233 |
End bp | 946310 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640920116 |
Product | phage tail tape measure protein |
Protein accession | YP_001457683 |
Protein GI | 157160365 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATA ACAACCTGCG TCTGCAGGTC ATTCTTAATG CGGTTGACAA GCTCACCCGC CCATTTCGAT CTGCGCAGGC CAGTTCAAGA GAACTGGCTG CTGCTGTCAA AAAATCCCGC GATGCAATAA AGCAGCTTGA TCAGGCCGGG AGCAGTCTGG ACAGCTTCCG AAAGCTGCAG GCAGAAAATC AGAAATTAGG CGACAGGCTG AACTATGCCC GCCAGCGTGC AAATTTGCTC AGTCAGGAAC TGGGAGCGAT GGGGCCGCCT TCGCAACGTC AGGTTGTTGC TCTGGGCCGT CAACGGCTGG CTGTTCAGCG CCTGGAAGAA CGCCAGAAAA AGCTGCAGCA GCAGACGGCG CTTGTGCGTG CTGAACTGTA CCGGGCGGGA ATTTCTGCGA AAGACGATGC GGGAGCAACT GCCCGTTTAG CCCGTGAAAC ATCACGTTAT AACCAGGAAC TTTCGAAACA GGAGGCGCGG CTGAAGCGAC TGGGGGAAGC TCAGCGCAGG ATGAATGCAG CGCGTGCCAG TTATACCCGT TCGCTGGAGG TGCGTGATCG TATTGCAGGT GCCGGAGCCA CCACCACGGC TGCAGGGCTG GCAATGGGTG CGCCAGTGAT GGCGGCAGTA AAAAGCTATA CCAGCATGGA AGATGCCATG AAAGGTGTGG CAAAGCAGGT CAATGGTCTG CGTGACGATA ATGGCAACCG CACTGCACGT TTTTATGAAA TGCAGGATGC CATCAAGGCT GCCAGCGAAC AGTTGCCGAT GGAAAACGGT GCGGTGGACT TCGCTGCACT GGTTGAAGGT GGTGCGCGCA TGAACGTCGC AAACCCTGAC GACAGCTGGG AAGATCAGAA ACGTGACCTG CTGGCCTTCG CCAGTACGGC AGCAAAGGCG GCAACAGCCT TTGAGCTGCC AGCGGATGAA CTGTCAGAAA GTCTGGGGAA AATCGCCCAG CTCTACAAAA TCCCTACCCG CAATATTGAA CAGCTCGGTG ATGCGCTGAA TTATCTGGAT GATAACGCCA TGTCGAAAGG GGCAGACATC ATTGATGTGA TGCAACGTCT GGGCGGTGTG GCTGACCGTC TGGATTATCG TAAAGCGGCG GCGCTGGGTT CCACCTTCCT GACACTGGGC GCTGCGCCGG AGGTTGCAGC CAGTGCAGCA AACGCGATGG TGCGTGAATT GTCCATTGCC ACCATGCAAA GCAAGAGTTT TTTTGAAGGG ATGAATCTGC TGAAACTCAA TCCTGAAGTG ATTGAAAAGC AGATGACGAA GGATGCGATG GGAACCATCC AGCGCGTGCT GGAGAAGGTG AACGCACTGC CGCAGGATAA GCGCCTGTCT GCCATGACCA TGTTGTTTGG TAAAGAGTTT GGCGATGATG CGGCGAAACT GGCAAACAAC CTGCCGGAAC TGCAGCGCCA GCTAAAACTG ACAGCGGGCA ATGATGCGCT CGGTTCGATG CAGAAAGAAT CCGACATTAA CAAGGACTCA CTTTCTGCGC AGTGGTTGCT GGTTAAAACC GGAGCGCAGA ACACCTTCAG CAGCCTGGGC GAAACGCTGC GCCAGCCGCT GATGGATATT CTGTACACGG TGAAAAGCAT CACGGGGGCG TTGCGCCGCT GGGTGGAAGC TAACCCGGAA CTGACAGGCA CACTGATGAA AGTAGCGGCT GTTGTGGCTG CGGTTACCGT AGGCCTCGGC ACCTTAGCGG TGGCGCTGGC TGCAGTTCTG GGGCCGCTGG CAGTGATCCG TCTGGGATTC TCTGTGCTGG GTATCAAAAC GTTACCTTCC GTTACGGCAG CAGTAACTCG AACCAGCAGC GCGTTGTCCT GGCTGGCTGG CGCACCACTG GCACTGCTGC GACGCGGGCT TGCTTCATCG GGCAACGCCG CAGGTTTACT TACTGCGCCG TTGTCGTCTT TGCGCCGCAC GGCATCACTG ACGGGAAATG TCCTGAAAAC TGTAGCAGGT GCGCCGGTTG CACTGTTGCG GTCTGGATTA TCCGGTTTAC GTGCGGTTGC TGTGATGTTT ATGAATCCAC TGGCAGCAAT ACGCGGCGGG CTGGCTGCCG CAGGCACGGT GCTGCGAGTA CTGGCATCTG GTCCACTGGC GATGTTGCGC GTTGCCCTGT ATGCCGTATC TGGTCTGTTA GGTGCTCTGC TCAGTCCGAT AGGTCTTGTG GTTACTGCAC TGGCGGGTGT GGCACTGGTT GTCTGGAAAT ACTGGCAACC CATCACCGCA TTTCTCGGTG GCGTGGTGGA AGGATTCAAA GCGGCGGCAG GTCCCATCAG TGCAGCGTTC GAACCGCTTA AGCCTGTGTT CCAGTGGATT GGTGACAAAG TGCAGGCGCT GTGGGGCTGG TTTACTGATC TGCTGACGCC CGTTACGTCG ACCTCTGCCG AACTGCAGAG CGCAGCGGCA ATGGGGCGAC AATTCGGGGA GGCACTGGCG GAAGGGCTGA ATAAGGTTAT GCATCCGCTG GACTCCCTGA AATCCGGCGT TTCCTGGTTG CTGGAGAAAC TTGGCATTGT CAGTAAAGAG GCCGCAAAGG CAAAACTGCC GGAAAGCGTG ACGCGTCAGC AACCTGCGAC GGTGAATGCA GACGGTAAAG TGATGATGCC ATCGGGTGGT TTTCCGTCAT GGGGATATGG CTTTGCGGGG ATGTATGACA GCGGCGGGTA TATCCCGCGC GGGCAGTTTG GCATCGTCGG TGAAAACGGG CCGGAAATTG TTAACGGCCC GGCAAATGTG ACCAGCCGGA GAAATACAGC TGCACTGGCT GCCGTTGTTG CCGGAATGAT GGGCGTTGCT GCCGCGCCAG CAGAGCTTCC ACCGTTGCAC CCTTTGGCAC TTCCCGCGAA AGGTGGAGAA GCAATTGTGA GTCGCGCAGC CACTGTGCCG CCCGTTCAAC GGATTGAGGC ACCGACGCAG ATCATCATTC AGACGCAGCC AGGACAAAGT GCGCAGGATA TTGCGCGGGA GGTGGCACGC CAGCTTGATG AACGTGAACG CAGGCTGAAG GCAAAAGCCA GGAGTAACTA CAGCGATCAG GGGGGATACG ACGCATGA
|
Protein sequence | MSDNNLRLQV ILNAVDKLTR PFRSAQASSR ELAAAVKKSR DAIKQLDQAG SSLDSFRKLQ AENQKLGDRL NYARQRANLL SQELGAMGPP SQRQVVALGR QRLAVQRLEE RQKKLQQQTA LVRAELYRAG ISAKDDAGAT ARLARETSRY NQELSKQEAR LKRLGEAQRR MNAARASYTR SLEVRDRIAG AGATTTAAGL AMGAPVMAAV KSYTSMEDAM KGVAKQVNGL RDDNGNRTAR FYEMQDAIKA ASEQLPMENG AVDFAALVEG GARMNVANPD DSWEDQKRDL LAFASTAAKA ATAFELPADE LSESLGKIAQ LYKIPTRNIE QLGDALNYLD DNAMSKGADI IDVMQRLGGV ADRLDYRKAA ALGSTFLTLG AAPEVAASAA NAMVRELSIA TMQSKSFFEG MNLLKLNPEV IEKQMTKDAM GTIQRVLEKV NALPQDKRLS AMTMLFGKEF GDDAAKLANN LPELQRQLKL TAGNDALGSM QKESDINKDS LSAQWLLVKT GAQNTFSSLG ETLRQPLMDI LYTVKSITGA LRRWVEANPE LTGTLMKVAA VVAAVTVGLG TLAVALAAVL GPLAVIRLGF SVLGIKTLPS VTAAVTRTSS ALSWLAGAPL ALLRRGLASS GNAAGLLTAP LSSLRRTASL TGNVLKTVAG APVALLRSGL SGLRAVAVMF MNPLAAIRGG LAAAGTVLRV LASGPLAMLR VALYAVSGLL GALLSPIGLV VTALAGVALV VWKYWQPITA FLGGVVEGFK AAAGPISAAF EPLKPVFQWI GDKVQALWGW FTDLLTPVTS TSAELQSAAA MGRQFGEALA EGLNKVMHPL DSLKSGVSWL LEKLGIVSKE AAKAKLPESV TRQQPATVNA DGKVMMPSGG FPSWGYGFAG MYDSGGYIPR GQFGIVGENG PEIVNGPANV TSRRNTAALA AVVAGMMGVA AAPAELPPLH PLALPAKGGE AIVSRAATVP PVQRIEAPTQ IIIQTQPGQS AQDIAREVAR QLDERERRLK AKARSNYSDQ GGYDA
|
| |