Gene ECH74115_2675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2675 
Symbol 
ID6970428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2516655 
End bp2517839 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content53% 
IMG OID643386536 
Productphage tail sheath protein 
Protein accessionYP_002271018 
Protein GI209399284 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.825453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0163922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA CTCGTTTTCA TGGTGCCCGT GTTACGGAAA GTACCGACCT GGTAACAGCG 
ATTAATGATG TTGATTCCAG TGTTATCGGT ATCGTGGCAA CGGCAGATGA TGCGGACGCG
GAGCTGTTCC CGCTGAACAA GCCCACACTG CTGACCCGCG TCAATGACGT GCTGGGAAAA
TGCGGGACTA CGGGGACGCT TTATCGTGCG CTTAAGGCTA TCGCAGACCA GGTGAGCACA
AAAGTGATCG TCGTTCGCGT GGCTGAACAC AAAGAAGAAG GCGGTAAGAC GCAGGATCAA
CTGGTTATCG GTGGTTCTGA ATCTGACGGC AGCTATACGG GGATGTATGC GCTGCTTGTT
GCAGAGCAGG ATGAAAGCAT CGGATACCGT CCGCGTATTC TGGCCGCGCC GGAGCTGGAC
ACGGAGGCGG TAACAAAATC CCTGTGCGTG ATTGCGGGTA AACTGCGCGC GTTTGTGTAT
GCCACATGTC ATGGTTGTAA CACGATGGCT GAGGCGATTA CCTACCGCCA GAAATTCAAC
GAACGTGAGG TGATGCTCTT ATGGCCTGAC TTCATCGCCT ACAACCTGAA AAGTGGCAAA
AACGAAACGT TCCCCGCGCC TGCTTATGCG TGCGGCCTTC GTGCGTACAT TGACCATGAG
CAGGGCTGGC ACAAATCGCT GTCCAACGTT CCGGTTAAAA ATGTGCTTGG GATGTCGAGG
CATGTGTTCT GGTCGTTGCA GGCCGAAGAC AGTGATGCCA ACAGCCTCAA CAACAAAGAA
ATCACGACCA TTATTCGTCG CAACGGGTTC CGCTTCTGGG GCAACCGCAC ACCGGAAACG
AACGCCTACA TCTTTGAGGT GTATACCCGA ACCGCACAGG TGCTGGCTGA TTCAATTGCG
GAAGCGCAGT TTGAAACCAT CGACAGTCCA CTGACACCTG CGAACGTGAA GGATGTTATC
AGTGCCATCA GGGCAAAACT GGATTCACTG GTTACTGCCG GGAAACTGAT TGGCGCGGAG
TGCTGGTATG ACATCGAGGA TAACAGCACC ACGAATTTAC GTCAGGGGCG TGTGCGTATT
CGCTACAAAT ATACGCCCGT TCCTCCGCTG GAAGACATGG AGCTTTACCA GTCGTTTACT
GATGAATTCT TTGGTCCCGC ATTTGCGGTG CTGGGAGGTG CCTGA
 
Protein sequence
MSETRFHGAR VTESTDLVTA INDVDSSVIG IVATADDADA ELFPLNKPTL LTRVNDVLGK 
CGTTGTLYRA LKAIADQVST KVIVVRVAEH KEEGGKTQDQ LVIGGSESDG SYTGMYALLV
AEQDESIGYR PRILAAPELD TEAVTKSLCV IAGKLRAFVY ATCHGCNTMA EAITYRQKFN
EREVMLLWPD FIAYNLKSGK NETFPAPAYA CGLRAYIDHE QGWHKSLSNV PVKNVLGMSR
HVFWSLQAED SDANSLNNKE ITTIIRRNGF RFWGNRTPET NAYIFEVYTR TAQVLADSIA
EAQFETIDSP LTPANVKDVI SAIRAKLDSL VTAGKLIGAE CWYDIEDNST TNLRQGRVRI
RYKYTPVPPL EDMELYQSFT DEFFGPAFAV LGGA