Gene ECH74115_2883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2883 
Symbol 
ID6967827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2675967 
End bp2679209 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content57% 
IMG OID643386727 
Producttail length tape measure protein 
Protein accessionYP_002271198 
Protein GI209400454 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family
[TIGR02675] tape measure domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000541295 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGACGT TACGTGAACT GATTATCAAA ATTTCGGCAA ATTCACAGTC ATTCCAGTCG 
GAGATCCAGC GGGCGTCCCG TATGGGCAGT GAATATTACC GGACCCTGCA GAATGGCGGG
CGTCAGGCCG CTGCGGCAGC CCGGGAGCAG CGACGTGCCC TGGCAGAACT GAACAGCCAG
TTGACGGAAA TTCGCGGTTC TGCTGTCGGA ATGGCTGGCG CATTTGCCGG TGCCTTTGCC
TCCGGACACC TGATTTCACT GGCGGATGAG TGGAGTTCCG TAAATGCCCG TCTGAAACAG
GCATCACAGT CATCCGATGA ATTTTCGTCA TCACAGAAAG TGCTGATGGA TATCAGCCAG
CGGACAGGCA CGGCATTTTC GGATAATGCG GCCCTGTTTG CCCGTTCGGC TGCCTCGATG
CGTGAATATG GTTACAGTGC CGGTGATGTA CTGAAGGTGA CGGAGGCCAT TTCCACGGGG
CTGAAAATCT CCGGTGCCAG TACGGCTGAG GCGGGTTCGG TGATCACCCA GTTCAGCCAG
GCGCTGGCAC AGGGTGTGTT GCGCGGTGAG GAATTTAATT CGGTCAATGA AAGTGGTGAC
CGGATCGTAC GTGCACTGGC TGCGGGTATG GGCGTGGCCC GTAAAGATCT GAAGGCAATG
GCGGATGACG GAAAACTGAC AGCGGATAAA GTGGTCCCCG CGTTAATCAG CCAGCTGGGG
ATATTACGTG ATGAATATGC GGCCATGCCG GAAACGGTTT CCAGTAGTAT CACGAAGGTG
GAAAACGCCT TTATGGCCTG GGTGGGCGGT GCGAATGAGG CCAGCGGGGT GACAAAAACG
CTCTCCGGCA TGCTGAACGG TGTTGCCGGA CAGATTGATA ATGTGGCAAC AGCCGTGGGC
GCGCTGGTTG CCGTCGGGGT TGCACGTTAC TTTGGCAATA TGGCCTCCGG AGCGATGTCT
GCCACGGCAG GACTTGTGAC GGCTGCACGT AATGAAGTTG CACTGGCGGA AGCACAGTTC
AGGGGAACGC AGATTGCCAC GGCGCGGGCA AGGGCAGCCG TGTACCGTGC TCAGCAGGCC
GTGGCGGCAG CCCGCGGGAC GGAGATGCAG ATTGCAGCAG AGGCCCGTCT GGCGGCCACA
CAGGAACGCC TGAACAGAAA TATTGCTGCC AGAACCGCCG CCCAGAATGC GCTGAACAGT
ACAACGGCGG TGGGCTCACG TCTGATGAGC GGTGCGCTGG GGCTGGTTGG TGGCGTACCC
GGACTGGTGA TGCTGGGGGC TGCAGCATGG TACACGCTGT ACCAGAATCA GGAGCAGGCC
AGGGAGTCTG CGCGCCAGTA TGCACTGACG ATAGATGAAA TCGCGCATAA AACGCCGTCA
ATGTCTCTGC CTGAAGCCTC AGATAATGAA GGACGAACAC GGGCGGCGCT GACAGAGCAG
AACCGGCTGA TTGATGAACA GGCCAGTCGG GTGAAATCCC TGCAGGAAAA AATCGCAGGA
TATCAGTATG TTCTGGCGAA CCCGGGCTGG ACGACCGGTG ACGGATTCAT GATAAACCAT
CTGACATCGG TGAAGACCGT AACGGAAGGG CTTGCTCAGG CAACAGAGCA GCTTGCCGTT
GAGCAGTCCC GTCTGGCACA GATGCAGGAA AAAGCGCAGT CCATTCAGGA TGTGCTTGCC
GGGCTGGAAG ACCGTCGTGT GGCGTTAATT CGTCAGCAGG CGGCAGAGCA GAATAAGGTG
TACCAGTCCA TGCTGGTTAT GAACGGTCAG CATACGGAAT TCAACCGTCT GCTGGGGCTG
GGGAATGAAC TGCTTCAGCA GCGGCAGGGA CTGGTGAATG TGCCGTTACG GCTGCCACAG
GCCACTCTGG ATGATAAACA GCAGAGTGCC CTGACAAAAA CAGAGCGTGA GCTGGCCCTG
TCCAGACTGA AAGGGGAAGA AAAAGAGCGC GTCCGACTGG GGTATGCGGC GGATGACCTC
GGTTTTGTGG GGGATCCGTA TCAGGAGGCG AGACAACGTT ATATCAGTAA TGCCCTGGAA
GCCTGGCGCA ATAACGAGGT GAATAAACCC AAATCCCGGG GTGGAAAATC AGAGACGGAA
AAAGCGGAAG ACAGTTTTTC CCGGCTGCTG AAGCAGCAGA AAGAGCAACT GGCACTGGCC
GGTCAGAACA CGGAGCTGGC GAAGCTGAAG TACCAGACAG CGCTGGGTGA ACTGAAAACC
CTGTCGGAGA TACAGAAGCA GGAACTGCTG CGCAATGCGG CCCTGATTGA CCAGCAAAAA
ATCCGGGAGC AGTTGCGGTA CCGGGAAGAG ACCCTGAAGA ATGATAATGT GGCTGCGCGT
GCATCAAATG AATCTGAACT GCTGGGGTAC GGGCAGGGGG AACGAGCCAG GGAACGCATG
CGGGAGTTGC AGCAGATCCG CGACAGCTTC CGCCAGAAGG ATGCGGACCT TCAGTCTCAG
TATCAGACCG GGGATATCAG TGAGGATTTT TACAGACAGG CGCTGGCACA GAACGCGCAG
TATCTGAGCG AACGCCTTAA GGACCAGGCA GTCTTTTATG CCGAATCGGA TGTGCAGCGT
GCGGACTGGC AGAAAGGGCT GCAGGAGGGA TTCAGTAACT GGGTGGATAA TGCGTCCGAT
TACGCCTCAC AGGCAGCACA GCTGGCGACG GAGGGTATCT CAGGGATGGT GAATAACATC
ACGGAGATGC TGAACGGAAA TAAAGTGGAA TGGCGCAGCT GGGCCTCATC CGTACTGCAG
GAAATATCAA AAGTTCTTAT GAATGCCGCG ATTGTCAACG GAATTAAGAC GGCGGCAAAC
GGTATGTCCG GTGCGGGAGG ATTTCTCGGC AGCATTGGTG ACTGGCTGGG CGGAGCGGTG
GCCAATGCAA AAGGCGGCGT GTATACCTCG GCAAACCTGA GCGCGTACAG CAACAGCATT
GTGGACACGC CCACGTACTT TGCGTTTGCA AAAGGGGCCG GGCTGATGGG GGAAGCCGGA
CCTGAAGCTA TTATGCCCCT GACCCGGGCG GCGGATGGCT CGCTGGGCGT ACGCGCCGTG
GGCAGTATGA ACGGCAGTGC TGGTCTGGTG TATTCCCCGG TCTACCACAT TGCCATTCAG
AATGACGGGG CTAACGGACA GATAGGGCCG GAGGCGGCAG GCAGTCTTGT GCAGCTGATT
GACCAGCGGG TGCAGGCGGT GATGCTGTCC ATGCGACGTG ACGGAGGAAT GCTGAGTGGC
TGA
 
Protein sequence
MATLRELIIK ISANSQSFQS EIQRASRMGS EYYRTLQNGG RQAAAAAREQ RRALAELNSQ 
LTEIRGSAVG MAGAFAGAFA SGHLISLADE WSSVNARLKQ ASQSSDEFSS SQKVLMDISQ
RTGTAFSDNA ALFARSAASM REYGYSAGDV LKVTEAISTG LKISGASTAE AGSVITQFSQ
ALAQGVLRGE EFNSVNESGD RIVRALAAGM GVARKDLKAM ADDGKLTADK VVPALISQLG
ILRDEYAAMP ETVSSSITKV ENAFMAWVGG ANEASGVTKT LSGMLNGVAG QIDNVATAVG
ALVAVGVARY FGNMASGAMS ATAGLVTAAR NEVALAEAQF RGTQIATARA RAAVYRAQQA
VAAARGTEMQ IAAEARLAAT QERLNRNIAA RTAAQNALNS TTAVGSRLMS GALGLVGGVP
GLVMLGAAAW YTLYQNQEQA RESARQYALT IDEIAHKTPS MSLPEASDNE GRTRAALTEQ
NRLIDEQASR VKSLQEKIAG YQYVLANPGW TTGDGFMINH LTSVKTVTEG LAQATEQLAV
EQSRLAQMQE KAQSIQDVLA GLEDRRVALI RQQAAEQNKV YQSMLVMNGQ HTEFNRLLGL
GNELLQQRQG LVNVPLRLPQ ATLDDKQQSA LTKTERELAL SRLKGEEKER VRLGYAADDL
GFVGDPYQEA RQRYISNALE AWRNNEVNKP KSRGGKSETE KAEDSFSRLL KQQKEQLALA
GQNTELAKLK YQTALGELKT LSEIQKQELL RNAALIDQQK IREQLRYREE TLKNDNVAAR
ASNESELLGY GQGERARERM RELQQIRDSF RQKDADLQSQ YQTGDISEDF YRQALAQNAQ
YLSERLKDQA VFYAESDVQR ADWQKGLQEG FSNWVDNASD YASQAAQLAT EGISGMVNNI
TEMLNGNKVE WRSWASSVLQ EISKVLMNAA IVNGIKTAAN GMSGAGGFLG SIGDWLGGAV
ANAKGGVYTS ANLSAYSNSI VDTPTYFAFA KGAGLMGEAG PEAIMPLTRA ADGSLGVRAV
GSMNGSAGLV YSPVYHIAIQ NDGANGQIGP EAAGSLVQLI DQRVQAVMLS MRRDGGMLSG