Gene ECH74115_2173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2173 
Symbol 
ID6970006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2084532 
End bp2087774 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content57% 
IMG OID643386068 
Producttail length tape measure protein 
Protein accessionYP_002270557 
Protein GI209397200 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family
[TIGR02675] tape measure domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0499654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGT TACGTGAACT GATTATCAAA ATTTCGGCAA ATTCACAGTC ATTCCAGTCG 
GAGATCCAGC GGGCGTCCCG TATGGGCAGT GAATATTACC GGACCCTGCA GAATGGCGGG
CGTCAGGCCG CTGCGGCAGC CCGGGAGCAG CGACGTGCCC TGGCAGAACT GAACAGCCAG
TTGACGGAAA TTCGCGGTTC TGCTGTCGGA ATGGCTGGCG CATTTGCCGG TGCCTTTGCC
TCCGGACACC TGATTTCACT GGCGGATGAG TGGAGTTCCG TAAATGCCCG TCTGAAACAG
GCATCACAGT CATCCGATGA ATTTTCGTCA TCACAGAAAG TGCTGATGGA TATCAGCCAG
CGGACAGGCA CGGCATTTTC GGATAATGCG GCCCTGTTTG CCCGTTCGGC TGCCTCGATG
CGTGAATATG GTTACAGTGC CGGTGATGTA CTGAAGGTGA CGGAGGCCAT TTCCACGGGG
CTGAAAATCT CCGGTGCCAG TACGGCTGAG GCGGGTTCGG TGATCACCCA GTTCAGCCAG
GCGCTGGCAC AGGGTGTGTT GCGCGGTGAG GAATTTAATT CGGTCAATGA AAGTGGTGAC
CGGATCGTAC GTGCACTGGC TGCGGGTATG GGCGTGGCCC GTAAAGATCT GAAGGCAATG
GCGGATGACG GAAAACTGAC AGCGGATAAA GTGGTCCCCG CGTTAATCAG CCAGCTGGGG
ATATTACGTG ATGAATATGC GGCCATGCCG GAAACGGTTT CCAGTAGTAT CACGAAGGTG
GAAAACGCCT TTATGGCCTG GGTGGGCGGT GCGAATGAGG CCAGCGGGGT GACAAAAACG
CTCTCCGGCA TGCTGAACGG TGTTGCCGGA CAGATTGATA ATGTGGCAAC AGCCGTGGGC
GCGCTGGTTG CCGTCGGGGT TGCACGTTAC TTTGGCAATA TGGCCTCCGG AGCGATGTCT
GCCACGGCAG GACTTGTGAC GGCTGCACGT AATGAAGTTG CACTGGCGGA AGCACAGTTC
AGGGGAACGC AGATTGCCAC GGCGCGGGCA AGGGCAGCCG TGTACCGTGC TCAGCAGGCC
GTGGCGGCAG CCCGCGGGAC GGAGATGCAG ATTGCAGCAG AGGCCCGTCT GGCGGCCACA
CAGGAACGCC TGAACAGAAA TATTGCTGCC AGAACCGCCG CCCAGAATGC GCTGAACAGT
ACAACGGCGG TGGGCTCACG TCTGATGAGC GGTGCGCTGG GGCTGGTTGG TGGCGTACCC
GGACTGGTGA TGCTGGGGGC TGCAGCATGG TACACGCTGT ACCAGAATCA GGAGCAGGCC
AGGGAGTCTG CGCGCCAGTA TGCACTGACG ATAGATGAAA TCGCGCATAA AACGCCGTCA
ATGTCTCTGC CTGAAGCCTC AGATAATGAA GGACGAACAC GGGCGGCGCT GACAGAGCAG
AACCGGCTGA TTGATGAACA GGCCAGTCGG GTGAAATCCC TGCAGGAAAA AATCGCAGGA
TATCAGTATG TTCTGGCGAA CCCGGGCTGG ACGACCGGTG ACGGATTCAT GATAAACCAT
CTGACATCGG TGAAGACCGT AACGGAAGGG CTTGCTCAGG CAACAGAGCA GCTTGCCGTT
GAGCAGTCCC GTCTGGCACA GATGCAGGAA AAAGCGCAGT CCATTCAGGA TGTGCTTGCC
GGGCTGGAAG ACCGTCGTGT GGCGTTAATT CGTCAGCAGG CGGCAGAGCA GAATAAGGTG
TACCAGTCCA TGCTGGTTAT GAACGGTCAG TATACGGAAT TCAACCGTCT GCTGGGGCTG
GGGAATGAAC TGCTTCAGCA GCGGCAGGGA CTGGTGAATG TGCCGTTACG GCTGCCACAG
GCCACTCTGG ATGATAAACA GCAGAGTGCC CTGACAAAAA CAGAGCGTGA GCTGGCCCTG
TCCAGACTGA AAGGGGAAGA AAAAGAGCGC GTCCGACTGG GGTATGCGGC GGATGACCTC
GGTTTTGTGG GGGATCCGTA TCAGGAGGCG AGACAACGTT ATATCAGTAA TGCCCTGGAA
GCCTGGCGCA ATAACGAGGT GAATAAACCC AAATCCCGGG GTGGAAAATC AGAGACGGAA
AAAGCGGAAG ACAGTTTTTC CCGGCTGCTG AAGCAGCAGA AAGAGCAACT GGCACTGGTG
GGGCAGAATA CAGAGCTGGC GAAGCTGAAA TACCAGACAG CGCTGGGTGA ACTGAAAACC
CTGACGGAGA TGCAGAAGCA GGAACTGCTG CGTAACGCGA CCCTGATTGA CCAGCAAAAA
ATCCGGGAAC AGTTGCGATC CCGGGAAGAG ACACTGAAGA ATGAGAATGC GGCTGCGCGT
GCGTCGAATG ATGCTGAACT GCTGGGGTAC GGGCAGGGGG AGCGAGCCAG AGAACGCATG
CGGGAGTTGC AGCAGATCCG CGACAGCTTC CGCCAGAAGG ATGCGGACCT TCAGTCTCAG
TATCAGACCG GGGATATCAG TGAGGATTTT TACAGACAGG CTCTGGCACA GAACGCGCAG
TATCTGAGCG AACGCCTTAA GGACCAGGCA GTCTTTTATG CCGAATCGGA TGTGCAGCGT
GCGGACTGGC AGAAAGGGCT GCAGGAGGGA TTCAGTAACT GGGTGGATAA TGCGTCCGAT
TACGCCTCAC AGGCAGCACA GCTGGCGACG GAGGGTATCT CAGGGATGGT GAATAACATC
ACGGAGATGC TGAACGGAAA TAAAGTGGAA TGGCGCAGCT GGGCCTCATC CGTACTGCAG
GAAATATCAA AAGTTCTTAT GAATGCCGCG ATTGTCAACG GAATTAAGAC GGCGGCAAAC
GGTATGTCCG GTGCGGGAGG ATTTCTCGGC AGCATTGGTG ACTGGCTGGG CGGAGCGGTG
GCCAATGCAA AAGGCGGCGT GTATACCTCG GCAAACCTGA GCGCGTACAG CAACAGCATT
GTGGACACGC CCACGTACTT TGCGTTTGCA AAAGGGGCCG GGCTGATGGG GGAAGCCGGA
CCTGAAGCTA TTATGCCCCT GACCCGGGCG GCGGATGGCT CGCTGGGCGT ACGCGCCGTG
GGCAGTATGA ACGGCAGTGC TGGTCTGGTG TATTCCCCGG TCTACCACAT TGCCATTCAG
AATGACGGGG CTAACGGACA GATAGGGCCG GAGGCGGCAG GCAGTCTTGT GCAGCTGATT
GACCAGCGGG TGCAGGCGGT GATGCTGTCC ATGCGACGTG ACGGAGGAAT GCTGAGTGGC
TGA
 
Protein sequence
MATLRELIIK ISANSQSFQS EIQRASRMGS EYYRTLQNGG RQAAAAAREQ RRALAELNSQ 
LTEIRGSAVG MAGAFAGAFA SGHLISLADE WSSVNARLKQ ASQSSDEFSS SQKVLMDISQ
RTGTAFSDNA ALFARSAASM REYGYSAGDV LKVTEAISTG LKISGASTAE AGSVITQFSQ
ALAQGVLRGE EFNSVNESGD RIVRALAAGM GVARKDLKAM ADDGKLTADK VVPALISQLG
ILRDEYAAMP ETVSSSITKV ENAFMAWVGG ANEASGVTKT LSGMLNGVAG QIDNVATAVG
ALVAVGVARY FGNMASGAMS ATAGLVTAAR NEVALAEAQF RGTQIATARA RAAVYRAQQA
VAAARGTEMQ IAAEARLAAT QERLNRNIAA RTAAQNALNS TTAVGSRLMS GALGLVGGVP
GLVMLGAAAW YTLYQNQEQA RESARQYALT IDEIAHKTPS MSLPEASDNE GRTRAALTEQ
NRLIDEQASR VKSLQEKIAG YQYVLANPGW TTGDGFMINH LTSVKTVTEG LAQATEQLAV
EQSRLAQMQE KAQSIQDVLA GLEDRRVALI RQQAAEQNKV YQSMLVMNGQ YTEFNRLLGL
GNELLQQRQG LVNVPLRLPQ ATLDDKQQSA LTKTERELAL SRLKGEEKER VRLGYAADDL
GFVGDPYQEA RQRYISNALE AWRNNEVNKP KSRGGKSETE KAEDSFSRLL KQQKEQLALV
GQNTELAKLK YQTALGELKT LTEMQKQELL RNATLIDQQK IREQLRSREE TLKNENAAAR
ASNDAELLGY GQGERARERM RELQQIRDSF RQKDADLQSQ YQTGDISEDF YRQALAQNAQ
YLSERLKDQA VFYAESDVQR ADWQKGLQEG FSNWVDNASD YASQAAQLAT EGISGMVNNI
TEMLNGNKVE WRSWASSVLQ EISKVLMNAA IVNGIKTAAN GMSGAGGFLG SIGDWLGGAV
ANAKGGVYTS ANLSAYSNSI VDTPTYFAFA KGAGLMGEAG PEAIMPLTRA ADGSLGVRAV
GSMNGSAGLV YSPVYHIAIQ NDGANGQIGP EAAGSLVQLI DQRVQAVMLS MRRDGGMLSG