Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2883 |
Symbol | |
ID | 6967827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2675967 |
End bp | 2679209 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643386727 |
Product | tail length tape measure protein |
Protein accession | YP_002271198 |
Protein GI | 209400454 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family [TIGR02675] tape measure domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.000541295 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGACGT TACGTGAACT GATTATCAAA ATTTCGGCAA ATTCACAGTC ATTCCAGTCG GAGATCCAGC GGGCGTCCCG TATGGGCAGT GAATATTACC GGACCCTGCA GAATGGCGGG CGTCAGGCCG CTGCGGCAGC CCGGGAGCAG CGACGTGCCC TGGCAGAACT GAACAGCCAG TTGACGGAAA TTCGCGGTTC TGCTGTCGGA ATGGCTGGCG CATTTGCCGG TGCCTTTGCC TCCGGACACC TGATTTCACT GGCGGATGAG TGGAGTTCCG TAAATGCCCG TCTGAAACAG GCATCACAGT CATCCGATGA ATTTTCGTCA TCACAGAAAG TGCTGATGGA TATCAGCCAG CGGACAGGCA CGGCATTTTC GGATAATGCG GCCCTGTTTG CCCGTTCGGC TGCCTCGATG CGTGAATATG GTTACAGTGC CGGTGATGTA CTGAAGGTGA CGGAGGCCAT TTCCACGGGG CTGAAAATCT CCGGTGCCAG TACGGCTGAG GCGGGTTCGG TGATCACCCA GTTCAGCCAG GCGCTGGCAC AGGGTGTGTT GCGCGGTGAG GAATTTAATT CGGTCAATGA AAGTGGTGAC CGGATCGTAC GTGCACTGGC TGCGGGTATG GGCGTGGCCC GTAAAGATCT GAAGGCAATG GCGGATGACG GAAAACTGAC AGCGGATAAA GTGGTCCCCG CGTTAATCAG CCAGCTGGGG ATATTACGTG ATGAATATGC GGCCATGCCG GAAACGGTTT CCAGTAGTAT CACGAAGGTG GAAAACGCCT TTATGGCCTG GGTGGGCGGT GCGAATGAGG CCAGCGGGGT GACAAAAACG CTCTCCGGCA TGCTGAACGG TGTTGCCGGA CAGATTGATA ATGTGGCAAC AGCCGTGGGC GCGCTGGTTG CCGTCGGGGT TGCACGTTAC TTTGGCAATA TGGCCTCCGG AGCGATGTCT GCCACGGCAG GACTTGTGAC GGCTGCACGT AATGAAGTTG CACTGGCGGA AGCACAGTTC AGGGGAACGC AGATTGCCAC GGCGCGGGCA AGGGCAGCCG TGTACCGTGC TCAGCAGGCC GTGGCGGCAG CCCGCGGGAC GGAGATGCAG ATTGCAGCAG AGGCCCGTCT GGCGGCCACA CAGGAACGCC TGAACAGAAA TATTGCTGCC AGAACCGCCG CCCAGAATGC GCTGAACAGT ACAACGGCGG TGGGCTCACG TCTGATGAGC GGTGCGCTGG GGCTGGTTGG TGGCGTACCC GGACTGGTGA TGCTGGGGGC TGCAGCATGG TACACGCTGT ACCAGAATCA GGAGCAGGCC AGGGAGTCTG CGCGCCAGTA TGCACTGACG ATAGATGAAA TCGCGCATAA AACGCCGTCA ATGTCTCTGC CTGAAGCCTC AGATAATGAA GGACGAACAC GGGCGGCGCT GACAGAGCAG AACCGGCTGA TTGATGAACA GGCCAGTCGG GTGAAATCCC TGCAGGAAAA AATCGCAGGA TATCAGTATG TTCTGGCGAA CCCGGGCTGG ACGACCGGTG ACGGATTCAT GATAAACCAT CTGACATCGG TGAAGACCGT AACGGAAGGG CTTGCTCAGG CAACAGAGCA GCTTGCCGTT GAGCAGTCCC GTCTGGCACA GATGCAGGAA AAAGCGCAGT CCATTCAGGA TGTGCTTGCC GGGCTGGAAG ACCGTCGTGT GGCGTTAATT CGTCAGCAGG CGGCAGAGCA GAATAAGGTG TACCAGTCCA TGCTGGTTAT GAACGGTCAG CATACGGAAT TCAACCGTCT GCTGGGGCTG GGGAATGAAC TGCTTCAGCA GCGGCAGGGA CTGGTGAATG TGCCGTTACG GCTGCCACAG GCCACTCTGG ATGATAAACA GCAGAGTGCC CTGACAAAAA CAGAGCGTGA GCTGGCCCTG TCCAGACTGA AAGGGGAAGA AAAAGAGCGC GTCCGACTGG GGTATGCGGC GGATGACCTC GGTTTTGTGG GGGATCCGTA TCAGGAGGCG AGACAACGTT ATATCAGTAA TGCCCTGGAA GCCTGGCGCA ATAACGAGGT GAATAAACCC AAATCCCGGG GTGGAAAATC AGAGACGGAA AAAGCGGAAG ACAGTTTTTC CCGGCTGCTG AAGCAGCAGA AAGAGCAACT GGCACTGGCC GGTCAGAACA CGGAGCTGGC GAAGCTGAAG TACCAGACAG CGCTGGGTGA ACTGAAAACC CTGTCGGAGA TACAGAAGCA GGAACTGCTG CGCAATGCGG CCCTGATTGA CCAGCAAAAA ATCCGGGAGC AGTTGCGGTA CCGGGAAGAG ACCCTGAAGA ATGATAATGT GGCTGCGCGT GCATCAAATG AATCTGAACT GCTGGGGTAC GGGCAGGGGG AACGAGCCAG GGAACGCATG CGGGAGTTGC AGCAGATCCG CGACAGCTTC CGCCAGAAGG ATGCGGACCT TCAGTCTCAG TATCAGACCG GGGATATCAG TGAGGATTTT TACAGACAGG CGCTGGCACA GAACGCGCAG TATCTGAGCG AACGCCTTAA GGACCAGGCA GTCTTTTATG CCGAATCGGA TGTGCAGCGT GCGGACTGGC AGAAAGGGCT GCAGGAGGGA TTCAGTAACT GGGTGGATAA TGCGTCCGAT TACGCCTCAC AGGCAGCACA GCTGGCGACG GAGGGTATCT CAGGGATGGT GAATAACATC ACGGAGATGC TGAACGGAAA TAAAGTGGAA TGGCGCAGCT GGGCCTCATC CGTACTGCAG GAAATATCAA AAGTTCTTAT GAATGCCGCG ATTGTCAACG GAATTAAGAC GGCGGCAAAC GGTATGTCCG GTGCGGGAGG ATTTCTCGGC AGCATTGGTG ACTGGCTGGG CGGAGCGGTG GCCAATGCAA AAGGCGGCGT GTATACCTCG GCAAACCTGA GCGCGTACAG CAACAGCATT GTGGACACGC CCACGTACTT TGCGTTTGCA AAAGGGGCCG GGCTGATGGG GGAAGCCGGA CCTGAAGCTA TTATGCCCCT GACCCGGGCG GCGGATGGCT CGCTGGGCGT ACGCGCCGTG GGCAGTATGA ACGGCAGTGC TGGTCTGGTG TATTCCCCGG TCTACCACAT TGCCATTCAG AATGACGGGG CTAACGGACA GATAGGGCCG GAGGCGGCAG GCAGTCTTGT GCAGCTGATT GACCAGCGGG TGCAGGCGGT GATGCTGTCC ATGCGACGTG ACGGAGGAAT GCTGAGTGGC TGA
|
Protein sequence | MATLRELIIK ISANSQSFQS EIQRASRMGS EYYRTLQNGG RQAAAAAREQ RRALAELNSQ LTEIRGSAVG MAGAFAGAFA SGHLISLADE WSSVNARLKQ ASQSSDEFSS SQKVLMDISQ RTGTAFSDNA ALFARSAASM REYGYSAGDV LKVTEAISTG LKISGASTAE AGSVITQFSQ ALAQGVLRGE EFNSVNESGD RIVRALAAGM GVARKDLKAM ADDGKLTADK VVPALISQLG ILRDEYAAMP ETVSSSITKV ENAFMAWVGG ANEASGVTKT LSGMLNGVAG QIDNVATAVG ALVAVGVARY FGNMASGAMS ATAGLVTAAR NEVALAEAQF RGTQIATARA RAAVYRAQQA VAAARGTEMQ IAAEARLAAT QERLNRNIAA RTAAQNALNS TTAVGSRLMS GALGLVGGVP GLVMLGAAAW YTLYQNQEQA RESARQYALT IDEIAHKTPS MSLPEASDNE GRTRAALTEQ NRLIDEQASR VKSLQEKIAG YQYVLANPGW TTGDGFMINH LTSVKTVTEG LAQATEQLAV EQSRLAQMQE KAQSIQDVLA GLEDRRVALI RQQAAEQNKV YQSMLVMNGQ HTEFNRLLGL GNELLQQRQG LVNVPLRLPQ ATLDDKQQSA LTKTERELAL SRLKGEEKER VRLGYAADDL GFVGDPYQEA RQRYISNALE AWRNNEVNKP KSRGGKSETE KAEDSFSRLL KQQKEQLALA GQNTELAKLK YQTALGELKT LSEIQKQELL RNAALIDQQK IREQLRYREE TLKNDNVAAR ASNESELLGY GQGERARERM RELQQIRDSF RQKDADLQSQ YQTGDISEDF YRQALAQNAQ YLSERLKDQA VFYAESDVQR ADWQKGLQEG FSNWVDNASD YASQAAQLAT EGISGMVNNI TEMLNGNKVE WRSWASSVLQ EISKVLMNAA IVNGIKTAAN GMSGAGGFLG SIGDWLGGAV ANAKGGVYTS ANLSAYSNSI VDTPTYFAFA KGAGLMGEAG PEAIMPLTRA ADGSLGVRAV GSMNGSAGLV YSPVYHIAIQ NDGANGQIGP EAAGSLVQLI DQRVQAVMLS MRRDGGMLSG
|
| |