Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1549 |
Symbol | |
ID | 6971405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1513050 |
End bp | 1516292 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643385517 |
Product | tail length tape measure protein |
Protein accession | YP_002270011 |
Protein GI | 209400951 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family [TIGR02675] tape measure domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0203252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00310541 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGACGT TACGTGAACT GATTATCAAA ATTTCGGCAA ATTCACAGTC ATTCCAGTCG GAGATCCAGC GGGCGTCCCG TATGGGCAGT GAATATTACC GGACCCTGCA GAATGGCGGG CGTCAGGCCG CTGCGGCAGC CCGGGAGCAG CGACGTGCCC TGGCAGAACT GCACAGCCAG TTGACGGAAA TTCGCGGTTC TGCTGTCGGA ATGGCTGGCG CATTTGCCGG TGCCTTTGCC TCCGGACACC TGATTTCACT GGCGGATGAG TGGAGTTCCG TAAATGCCCG TCTGAAACAG GCATCACAGT CATCCGATGA ATTTTCGTCA TCACAGAAAG TGCTGATGGA TATCAGCCAG CGGACAGGCA CGGCATTTTC GGATAATGCG GCCCTGTTTG CCCGTTCGGC TGCCTCGATG CGTGAATATG GTTACAGTGC CGGTGATGTA CTGAAGGTGA CGGAGGCCAT TTCCACGGGG CTGAAAATCT CCGGTGCCAG TACGGCTGAG GCGGGTTCGG TGATCACCCA GTTCAGCCAG GCGCTGGCAC AGGGTGTGTT GCGCGGTGAG GAATTTAATT CGGTCAATGA AAGTGGTGAC CGGATCGTAC GTGCACTGGC TGCGGGTATG GGCGTGGCCC GTAAAGATCT GAAGGCAATG GCGGATGACG GAAAACTGAC AGCGGATAAA GTGGTCCCCG CGTTAATCAG CCAGCTGGGG ATATTACGTG ATGAATATGC GGCCATGCCG GAAACGGTTT CCAGTAGTAT CACGAAGGTG GAAAACGCCT TTATGGCCTG GGTGGGCGGT GCGAATGAGG CCAGCGGGGT GACAAAAACG CTCTCCGGCA TGCTGAACGG TGTTGCCGGA CAGATTGATA ATGTGGCAAC AGCCGTGGGC GCGCTGGTTG CCGTCGGGGT TGCACGTTAC TTTGGCAATA TGGCCTCCGG AGCGATGTCT GCCACGGCAG GACTTGTGAC GGCTGCACGT AATGAAGTTG CACTGGCGGA AGCACAGTTC AGGGGAACGC AGATTGCCAC GGCGCGGGCA AGGGCAGCCG TGTACCGTGC TCAGCAGGCC GTGGCGGCAG CCCGCGGGAC GGAGATGCAG ATTGCAGCAG AGGCCCGTCT GGCGGCCACA CAGGAACGCC TGAACAGAAA TATTGCTGCC AGAACCGCCG CCCAGAATGC GCTGAACAGT ACAACGGCGG TGGGCTCACG TCTGATGAGC GGTGCGCTGG GGCTGGTTGG TGGCGTACCC GGACTGGTGA TGCTGGGGGC TGCAGCATGG TACACGCTGT ACCAGAATCA GGAGCAGGCC AGGGAGTCTG CGCGCCAGTA TGCACTGACG ATAGATGAAA TCGCGCATAA AACGCCGTCA ATGTCTCTGC CTGAAGCCTC AGATAATGAA GGACGAACAC GGGCGGCGCT GACAGAGCAG AACCGGCTGA TTGATGAACA GGCCAGTCGG GTGAAATCCC TGCAGGAAAA AATCGCAGGA TATCAGTATG TTCTGGCGAA CCCGGGCTGG ACGACCGGTG ACGGATTCAT GATAAACCAT CTGACATCGG TGAAGACCGT AACGGAAGGG CTTGCTCAGG CAACAGAGCA GCTTGCCGTT GAGCAGTCCC GTCTGGCACA GATGCAGGAA AAAGCGCAGT CCATTCAGGA TGTGCTTGCC GGGCTGGAAG ACCGTCGTGT GGCGTTAATT CGTCAGCAGG CGGCAGAGCA GAATAAGGTG TACCAGTCCA TGCTGGTTAT GAACGGTCAG TATACGGAAT TCAACCGTCT GCTGGGGCTG GGGAATGAAC TGCTTCAGCA GCGGCAGGGA CTGGTGAATG TGCCGTTACG GCTGCCACAG GCCACTCTGG ATGATAAACA GCAGAGTGCC CTGACAAAAA CAGAGCGTGA GCTGGCCCTG TCCAGACTGA AAGGGGAAGA AAAAGAGCGC GTCCGACTGG GGTATGCGGC GGATGACCTC GGTTTTGTGG GGGATCCGTA TCAGGAGGCG AGACAACGTT ATATCAGTAA TGCCCTGGAA GCCTGGCGCA ATAACGAGGT GAATAAACCC AAATCCCGGG GTGGAAAATC AGAGACGGAA AAAGCGGAAG ACAGTTTTTC CCGGCTGCTG AAGCAGCAGA AAGAGCAACT GGCACTGGTG GGGCAGAATA CAGAGCTGGC GAAGCTGAAA TACCAGACAG CGCTGGGTGA ACTGAAAACC CTGACGGAGA TGCAGAAGCA GGAACTGCTG CGTAACGCGA CCCTGATTGA CCAGCAAAAA ATCCGGGAAC AGTTGCGATC CCGGGAAGAG ACACTGAAGA ATGAGAATGC GGCTGCGCGT GCGTCGAATG ATGCTGAACT GCTGGGGTAC GGGCAGGGGG AGCGAGCCAG AGAACGCATG CGGGAGTTGC AGCAGATCCG CGACAGCTTC CGCCAGAAGG ATGCGGACCT TCAGTCTCAG TATCAGACCG GGGATATCAG TGAGGATTTT TACAGACAGG CTCTGGCACA GAACGCGCAG TATCTGAGCG AACGCCTTAA GGACCAGGCA GTCTTTTATG CCGAATCGGA TGTGCAGCGT GCGGACTGGC AGAAAGGGCT GCAGGAGGGA TTCAGTAACT GGGTGGATAA TGCGTCCGAT TACGCCTCAC AGGCAGCACA GCTGGCGACG GAGGGTATCT CAGGGATGGT GAATAACATC ACGGAGATGC TGAACGGAAA TAAAGTGGAA TGGCGCAGCT GGGCCTCATC AGTGCTGCAG GAAATATCAA AAGTTCTTAT GAATGCCGCG ATTGTCAACG GAATTAAGAC GGCGGCAAAC GGTATGTCCG GTGCGGGAGG ATTTCTCGGC AGCATTGGTG ACTGGCTGGG CGGAGCGGTG GCCAATGCAA AAGGCGGCGT GTATACCTCG GCAAACCTGA GCGCGTACAG CAACAGCATT GTGGACACGC CCACGTACTT TGCGTTTGCA AAAGGGGCCG GGCTGATGGG GGAAGCCGGA CCTGAAGCTA TTATGCCCCT GACCCGGGCG GCGGATGGCT CGCTGGGCGT ACGCGCCGTG GGCAGTATGA ACGGCAGTGC TGGTCTGGTG TATTCCCCGG TCTACCACAT CGCCATTCAG AATGACGGGG CTAACGGACA GATAGGGCCG GAGGCGGCAG GCAGTCTTGT GCAGCTGATT GACCAGCGGG TGCAGGCGGT GATGCTGTCC ATGCGACGTG ACGGAGGAAT GCTGAGTGGC TGA
|
Protein sequence | MATLRELIIK ISANSQSFQS EIQRASRMGS EYYRTLQNGG RQAAAAAREQ RRALAELHSQ LTEIRGSAVG MAGAFAGAFA SGHLISLADE WSSVNARLKQ ASQSSDEFSS SQKVLMDISQ RTGTAFSDNA ALFARSAASM REYGYSAGDV LKVTEAISTG LKISGASTAE AGSVITQFSQ ALAQGVLRGE EFNSVNESGD RIVRALAAGM GVARKDLKAM ADDGKLTADK VVPALISQLG ILRDEYAAMP ETVSSSITKV ENAFMAWVGG ANEASGVTKT LSGMLNGVAG QIDNVATAVG ALVAVGVARY FGNMASGAMS ATAGLVTAAR NEVALAEAQF RGTQIATARA RAAVYRAQQA VAAARGTEMQ IAAEARLAAT QERLNRNIAA RTAAQNALNS TTAVGSRLMS GALGLVGGVP GLVMLGAAAW YTLYQNQEQA RESARQYALT IDEIAHKTPS MSLPEASDNE GRTRAALTEQ NRLIDEQASR VKSLQEKIAG YQYVLANPGW TTGDGFMINH LTSVKTVTEG LAQATEQLAV EQSRLAQMQE KAQSIQDVLA GLEDRRVALI RQQAAEQNKV YQSMLVMNGQ YTEFNRLLGL GNELLQQRQG LVNVPLRLPQ ATLDDKQQSA LTKTERELAL SRLKGEEKER VRLGYAADDL GFVGDPYQEA RQRYISNALE AWRNNEVNKP KSRGGKSETE KAEDSFSRLL KQQKEQLALV GQNTELAKLK YQTALGELKT LTEMQKQELL RNATLIDQQK IREQLRSREE TLKNENAAAR ASNDAELLGY GQGERARERM RELQQIRDSF RQKDADLQSQ YQTGDISEDF YRQALAQNAQ YLSERLKDQA VFYAESDVQR ADWQKGLQEG FSNWVDNASD YASQAAQLAT EGISGMVNNI TEMLNGNKVE WRSWASSVLQ EISKVLMNAA IVNGIKTAAN GMSGAGGFLG SIGDWLGGAV ANAKGGVYTS ANLSAYSNSI VDTPTYFAFA KGAGLMGEAG PEAIMPLTRA ADGSLGVRAV GSMNGSAGLV YSPVYHIAIQ NDGANGQIGP EAAGSLVQLI DQRVQAVMLS MRRDGGMLSG
|
| |