Gene ECH74115_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1066 
Symbol 
ID6967958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1087871 
End bp1089631 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content51% 
IMG OID643385078 
Producthypothetical protein 
Protein accessionYP_002269577 
Protein GI209395737 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0179082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA CATTTATCCC CGGCAAAGAT GCCGCACTGG AAGATTCCAT CGCTCGCTTC 
CAGCAAAAAC TTTCAGACCT CGGCTTTCAG ATTGAAGAGG CCTCCTGGCT GAATCCCGTG
CCTAACGTCT GGTCTGTACA TATTCGCGAC AAAGAGTGCG CACTGTGTTT TACCAACGGT
AAAGGCGCAA CCAAGAAAGC GGCGCTGGCT TCTGCACTCG GTGAATATTT CGAGCGTCTC
TCAACCAACT ACTTTTTTGC TGATTTCTGG CTGGGCGAAA CCATCGCCAA CGGTCCATTT
GTGCATTATC CCAACGAAAA ATGGTTCCCA CTGACCGAAA ATGACGATGT GCCAGAAGGG
CTGCTCGATG AACGTCTGCG CGCATTTTAC GATCCGGAGA ATGAATTGAC CGGCAGTATG
CTGATTGACC TACAATCCGG TAACGAAGAT CGTGGTATTT GCGGTCTGCC GTTTACACGT
CAGTCTGATA ATCAGACCGT TTATATTCCG ATGAATATCA TTGGTAACCT GTACGTTTCT
AACGGTATGT CTGCTGGCAA TACCCGTAAC GAAGCACGCG TTCAGGGGTT GTCCGAAGTT
TTCGAACGCT ACGTGAAAAA CCGCATTATT GCTGAAAGCA TCAGCTTGCC GGAAATCCCG
GCAGACGTGC TGGCGCGTTA CCCAGCGGTG GTTGAAGCGA TCGAAACACT GGAAGCAGAA
GGTTTCCCGA TCTTCGCATA TGATGGTTCC CTTGGCGGCC AGTATCCGGT GATTTGCGTG
GTACTGTTTA ATCCTGCTAA CGGCACCTGC TTTGCCTCTT TCGGTGCGCA TCCTGATTTT
GGCGTAGCAC TGGAACGTAC CGTGACCGAG CTGCTGCAAG GTCGTGGCCT GAAAGATTTG
GATGTGTTTA CTCCGCCAAC CTTCGATGAT GAAGAAGTCG CTGAACATAC CAACCTCGAA
ACGCACTTTA TCGATTCCAG CGGTTTAATC TCCTGGGACC TGTTCAAGCA GGATGCCGAT
TATCCGTTTG TGGACTGGAA TTTCTCCGGC ACCACGGAAG AAGAGTTTGC TACGCTGATG
GCTATCTTCA ACAAAGAAGA TAAAGAAGTT TATATTGCCG ATTACGAGCA TCTGGGCGTT
TATGCTTGCC GTATTATCGT GCCTGGCATG TCCGATATTT ATCCGGCTGA AGATCTGTGG
CTCGCGAATA ACAGTATGGG CAGCCATTTA CGTGAAACGA TTCTTTCGCT ACCAGGCAGC
GAGTGGGAAA AAGAAGATTA CCTGAACCTC ATCGAGCAAC TGGATGAAGA AGGTTTTGAT
GACTTTACCC GCGTGCGTGA GCTGTTGGGT CTGGCGACCG GGTCGGATAA CGGTTGGTAC
ACCCTGCGTA TTGGTGAATT AAAAGCCATG CTGGCGCTGG CTGGTGGCGA TCTGGAACAG
GCTCTGGTCT GGACCGAATG GACGATGGAG TTTAACTCAT CGGTATTCAG CCCGGAACGC
GCCAACTATT ATCGCTGCCT GCAAACGTTG TTATTACTGG CGCAGGAAGA AGATCGCCAG
CCGCTGCAAT ATCTGAATGC GTTTGTTCGC ATGTACGGCG CAGATGCCGT GGAAGCCGCC
AGTGCGGCAA TGAGCGGCGA AGCGGCGTTT TATGGCTTGC AACCAGTAGA TAGCGATCTG
CACGCGTTTG CTGCACATCA GTCGCTGTTG AAGGCCTACG AAAAGCTGCA GCGCGCCAAA
GCAGCATTCT GGGCAAAATA A
 
Protein sequence
MTQTFIPGKD AALEDSIARF QQKLSDLGFQ IEEASWLNPV PNVWSVHIRD KECALCFTNG 
KGATKKAALA SALGEYFERL STNYFFADFW LGETIANGPF VHYPNEKWFP LTENDDVPEG
LLDERLRAFY DPENELTGSM LIDLQSGNED RGICGLPFTR QSDNQTVYIP MNIIGNLYVS
NGMSAGNTRN EARVQGLSEV FERYVKNRII AESISLPEIP ADVLARYPAV VEAIETLEAE
GFPIFAYDGS LGGQYPVICV VLFNPANGTC FASFGAHPDF GVALERTVTE LLQGRGLKDL
DVFTPPTFDD EEVAEHTNLE THFIDSSGLI SWDLFKQDAD YPFVDWNFSG TTEEEFATLM
AIFNKEDKEV YIADYEHLGV YACRIIVPGM SDIYPAEDLW LANNSMGSHL RETILSLPGS
EWEKEDYLNL IEQLDEEGFD DFTRVRELLG LATGSDNGWY TLRIGELKAM LALAGGDLEQ
ALVWTEWTME FNSSVFSPER ANYYRCLQTL LLLAQEEDRQ PLQYLNAFVR MYGADAVEAA
SAAMSGEAAF YGLQPVDSDL HAFAAHQSLL KAYEKLQRAK AAFWAK