Gene ECH74115_2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2168 
Symbol 
ID6970387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2080258 
End bp2081826 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content59% 
IMG OID643386063 
Producthost specificity protein 
Protein accessionYP_002270552 
Protein GI209395902 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.524267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.854577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAG GTGGCGGTAA GGCACACACG CCTCGTGAGG CGAAGGATAA TCTCAAATCC 
ACGCAGATGA TGAGTGTGAT TGATGCGATT GGTGAGGGAC CGATAGAAGG TCCGGTGAAG
GGACTGCAGA GTATTCTGGT GAACAAAACC CCACTGACGG ACACGGACGG CAATCCCGTG
ATACACGGTG TGACCGCGGT CTGGCGTGCC GGGGAGCAGG AGCAGACACC ACCGGAAGGC
TTTGAGTCCT CCGGAGCTGA AACCGGACTG GGCGTGGAAG TGACGAAGGC AAAACCGGTG
ACGCGCACCA TTACGTCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA
CTGTTGGAGA CCACCTCAAA GGGCGACCGT AATCACTCTT CTGTCCGACT GCTGATTCAG
TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC
TCGCAGTTTC TGGCGTCGGT GATTCTGGAT AATCTGCCTG AGCGGCCCTT TAACATCCGG
ATGGTCCGGG AGACAGCGGA CAGCACCTCG GACCAGCTGC AGAATAAGAC GCTCTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCGAT TGTGGGGCTG
CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT
CGCATCATCC AGGTACCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG
GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCTGCCT GGTGCCTGTG GGACATGCTG
ACCCACCCGC GCTACGGAAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG
CTGTATGCCA TTGCGCAGTA CTGCGACCAG ACGGTCCCGG ATGGTTTCGG GGGCACAGAG
CCGCGGATGA CTTTCAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTTGTG
CAGGACCGTC CGTCAGATGT GGTGTGGCCC TACACCAGCA GTGATGTGGT GGTGGATGAT
AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG
GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG
GAAGCCATAC TGCGCTACGG ACGCAACCTG CTGAAGATGG ATGCGTTCGG CTGCACCAGT
CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG
ACGGTGGATT TCACGCTCGG GTCACAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA
ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTGTCCTGTC CATCGATGCC
GCCAGCCGCA CGCGCAGTAC TGCGACCAGA CGGTCCCGGA TGGTTTCGGG GGCACAGAGC
CGCGGATGA
 
Protein sequence
MGKGGGKAHT PREAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV 
IHGVTAVWRA GEQEQTPPEG FESSGAETGL GVEVTKAKPV TRTITSANID RLRVTFGVQS
LLETTSKGDR NHSSVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPERPFNIR
MVRETADSTS DQLQNKTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG
RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA
LYAIAQYCDQ TVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDRPSDVVWP YTSSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP
EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE
ICDNDYAGTM TGGRVLSIDA ASRTRSTATR RSRMVSGAQS RG