Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2753 |
Symbol | |
ID | 6066221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3025168 |
End bp | 3028245 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641602159 |
Product | TP901 family phage tail tape measure protein |
Protein accession | YP_001725708 |
Protein GI | 170020754 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000729903 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGATA ATAACCTGCG CCTGCAGGTC ATTCTTAATG CGGTTGACAA ACTCACCCGC CCATTCCGTG CTGCACAGGC CAGTTCGAAA GAGCTGGCTG GCGCAATCAG AAACTCCCGT GACGCATTAA AGCAACTCAA TCAGGCGGGT AACAGCCTGG AAAAATTTCG CAAGCTGCAG GCCGATAACA AAAAGTTAGG CGACAGGCTG AACTATGCCA GACAGAAGGC AAATTTGCTT AGTTCTGAGC TGGAAGCGAT GGAACAACCA TCACAAAGGC ATCTTGTGGC TTTAGGTCGG CAAACGCTGG CAGTCCAACG CCTGGAAGAA CAACAAAAAT ATTTGCAGAA GCAAACGGCG CTTGTGCGTG CAGAACTGTA CCGGGCGGGA ATTTCTGCGA AAGATGATGC GGGAGCAACT GCCCGTTTAG CCCGTGAAAC ATCACGTTAT AACCAGGAAC TTTCGAAACA AGAGGCGCGG CTGAAGCGAC TGGGGGAAGC TCAGCGCAGG ATGAATGCGG CGCGTGCCAG TTATGCCCGT TCGCTGGAGG TGCGTGATCG TATTGCAGGT GCCGGAGCCA CCACCACGGC TGCAGGGCTG GCAATGGGCG CACCAGTGAT GGCGGCAGTA AAAAGCTATA CCAGCATGGA AGATGCCATG AAAGGTGTGG CAAAGCAGGT CAATGGTCTG CGTGACGATA ATGGCAACCG CACCGCGCGT TTTTACGAAA TGCAGGATGT CATCAAGGCT GCCAGCGAAC AGTTGCCGAT GGAAAACGGT GCTGTGGACT TTGCCGCACT GGTTGAAGGT GGTGCGCGCA TGAACGTCGC AAATCCTGAC GACAGCTGGG AAGACCAGAA ACGTGACCTG CTGGCCTTCG CCAGCACGGC AGCAAAGGCG GCAACAGCCT TTGAGCTGCC AGCGGATGAA CTGTCAGAAA GTCTGGGGAA AATCGCCCAG CTCTACAAAA TCCCTACCCG CAATATTGAA CAGCTCGGTG ATGCGCTGAA CTATCTGGAT GATAACGCCA TGTCGAAAGG GGCAGACATC ATTGATGTCA TGCAACGCCT GGGCGGTGTG GCTGATCGTC TGGATTATCG TAAAGCGGCG GCGCTGGGTT CCACCTTTCT GACACTGGGC GCTGCGCCAG AGGTTGCAGC CAGTGCAGCA AACGCGATGG TGCGTGAATT GTCCATTGCC ACCATGCAAA GCAAGAGTTT CTTTGAAGGG ATGAATCTGC TGAAACTCAA TCCTGAAGTG ATTGAAAAGC AGATGACGAA GGATGCGATG GGAACTATCC AGCGTGTGCT GGAGAAGGTG AACGCACTGC CGCAGGACAA GCGTCTGTCT GCCATGACCA TGTTGTTTGG TAAAGAGTTT GGCGATGACG CGGCGAAACT GGCAAACAAC CTGCCGGAAC TACAGCGCCA GCTAAAACTG ACAGCGGGCA ATGATGCGCT CGGCTCCATG CAGAAAGAAT CCGACATCAA CAAGGACTCA CTTTCTGCTC AGTGGTTGCT GGTTAAAACC GGAGCGCAGA ATACCTTCAG CAGCCTGGGC GAAACGCTGC GCCAGCCGCT GATGGATATT CTGTACACGG TGAAAAGCGT TACGGGAGCG TTGCGTCGCT GGGTCGAAGC TAACCCGGAA CTGACGGGCA CACTGATGAA AGTAGCCGCT ATTGTGGCTG CGGTTACCGT AGGCCTCGGC ACCTTGGCTG TGGCGCTGGC TGCAGTGCTG GGGCCGCTGG CAGTCATCCG TCTGGGATTC TCTGTGCTGG GCATAAAAAC GTTACCTTCC GTTACGGCAG CAGTAACACG AACCAGCAGC GCGTTGTCCT GGCTGGCTGG CGCTCCACTG GCAGTGCTGC GACGCGGGCT TGCTTCATCG GGTAACGCAG CGGGTTTACT TACTGCGCCG TTGTCGTCTT TGCGCCGTAC GGCATCACTG ACGGGAAATG TCCTGAAAAC TGTAGCAGGT GCGCCGGTTG CACTATTGCG GTCTGGATTA TCCGCTTTAC GTGCTGTTGC TGTGATGTTT ATGAATCCTC TGGCGGTACT GCGCGGTGGA CTGGCTGCCG CAGGCGCGGT GCTTCGTGTG CTTGCATCTG GTCCGCTGGC GATGCTGCGC GTTGCCCTGT ATGCCATATC TGGTCTGTTA GGTGCTCTGC TCAGTCCGAT AGGTCTTGTG GTTACTGCAC TGGCGGGTGT GGCGCTGGTT GTCTGGAAAT ACTGGCAACC CATCACCGCA TTTCTCGGTG GCGTGGTGGA AGGATTCAAA GCGGCGGCAG GTCCCATCAG TGCTGCATTC GAACCACTTA AGCCTGTGTT TCAGTGGATT GGCGACAAAG TACAGGCGCT GTGGGGCTGG TTTACTGATC TGCTGACGCC CGTTAAGTCG ACCTCTGCCG AACTGCAGAG CGCAGCGGCA ATGGGGCGAC GATTCGGGGA GGCACTGGCG GAAGGGCTGA ATATGGTCAT GCATCCGCTG GACTCCCTGA AATCCGGTGT TTCCTGGTTG CTGGAGAAGC TCGGCATTGT CAGTAAAGAG GCTGCAAAGG CGAAACTGCC GGAAAGCGTG ACGCGTCAGC AACCTGCGAC GGTGAATGCA GACGGTAAAG TGATGATGCC ATCGGGTGGT TTTCCGTCAT GGGGATATGG CTTTGCGGGG ATGTATGACA GCGGCGGCTA TATCCCGCGC GGGCAGTTTG GCATCGTCGG TGAAAACGGG CCGGAAATTG TCAACGGCCC GGCAAACGTG ACCAGCCGGA GAAATACCGC TGCACTGGCT GCGGTTGTTG CCGGAATGAT GGGCGTTGCT GCCGCGCCTA CAGAGCTTCC ACCGTTACAT CCTTTGGCAC TTCCCGCGAA AGGCGGCGAA GCGATGGTGA GTCGTGCAGC CACTGTGCCG CCCGTTCAAC GGATTGAGGC ACCGACGCAG ATCATCATTC AGACGCAGCC AGGACAAAGT GCGCAGGATA TTGCGCGGGA GGTGGCCCGC CAGCTTGATG AACGTGAACG CAGGCTGAAG GCAAAAGCCA GGAGTAACTA CAGCGATCAG GGGGGATACG ACGCATGA
|
Protein sequence | MSDNNLRLQV ILNAVDKLTR PFRAAQASSK ELAGAIRNSR DALKQLNQAG NSLEKFRKLQ ADNKKLGDRL NYARQKANLL SSELEAMEQP SQRHLVALGR QTLAVQRLEE QQKYLQKQTA LVRAELYRAG ISAKDDAGAT ARLARETSRY NQELSKQEAR LKRLGEAQRR MNAARASYAR SLEVRDRIAG AGATTTAAGL AMGAPVMAAV KSYTSMEDAM KGVAKQVNGL RDDNGNRTAR FYEMQDVIKA ASEQLPMENG AVDFAALVEG GARMNVANPD DSWEDQKRDL LAFASTAAKA ATAFELPADE LSESLGKIAQ LYKIPTRNIE QLGDALNYLD DNAMSKGADI IDVMQRLGGV ADRLDYRKAA ALGSTFLTLG AAPEVAASAA NAMVRELSIA TMQSKSFFEG MNLLKLNPEV IEKQMTKDAM GTIQRVLEKV NALPQDKRLS AMTMLFGKEF GDDAAKLANN LPELQRQLKL TAGNDALGSM QKESDINKDS LSAQWLLVKT GAQNTFSSLG ETLRQPLMDI LYTVKSVTGA LRRWVEANPE LTGTLMKVAA IVAAVTVGLG TLAVALAAVL GPLAVIRLGF SVLGIKTLPS VTAAVTRTSS ALSWLAGAPL AVLRRGLASS GNAAGLLTAP LSSLRRTASL TGNVLKTVAG APVALLRSGL SALRAVAVMF MNPLAVLRGG LAAAGAVLRV LASGPLAMLR VALYAISGLL GALLSPIGLV VTALAGVALV VWKYWQPITA FLGGVVEGFK AAAGPISAAF EPLKPVFQWI GDKVQALWGW FTDLLTPVKS TSAELQSAAA MGRRFGEALA EGLNMVMHPL DSLKSGVSWL LEKLGIVSKE AAKAKLPESV TRQQPATVNA DGKVMMPSGG FPSWGYGFAG MYDSGGYIPR GQFGIVGENG PEIVNGPANV TSRRNTAALA AVVAGMMGVA AAPTELPPLH PLALPAKGGE AMVSRAATVP PVQRIEAPTQ IIIQTQPGQS AQDIAREVAR QLDERERRLK AKARSNYSDQ GGYDA
|
| |