Gene ECH74115_4904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4904 
Symbol 
ID6969223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4543465 
End bp4545144 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content53% 
IMG OID643388590 
Producthypothetical protein 
Protein accessionYP_002273018 
Protein GI209398532 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAT TTACGCAAAA TACCGCCATG CCTTCTTCCC TCTGGCAATA CTGGCGCGGC 
CTTTCCGGCT GGAACTTCTA TTTTCTGGTT AAGTTCGGCC TGTTGTGGGC GGGATATCTT
AACTTCCATC CGCTCCTCAA TTTGGTGTTT GCCGCGTTTC TGCTGATGCC CATTCCGCGC
TACAGCCTGC ATCGCTTGCG CCACTGGATT GCCCTGCCGA TCGGCTTTGC TTTGTTCTGG
CATGACACCT GGTTGCCTGG CCCGGAAAGC ATAATGAGCC AGGGTTCGCA GGTGGCGGGG
TTCAGTACCG ATTATTTAAT CGACCTTGTC ACACGCTTTA TTAACTGGCA GATGATTGGG
GCCATTTTTG TTTTATTAGT GGCCTGGTTA TTCCTGTCAC AATGGATTCG CATTACCGTT
TTTGTGGTTG CCATACTGCT ATGGCTGAAC GTACTTACCC TGGCGGGACC AAGTTTCTCC
TTGTGGCCAG CCGGACAACC GACGACCACT GTAACAACGA CGGGTGGTAA CGCAGCGGCA
ACCGTTGCGG CGACGGGTGG CGCACCGGTA GTGGGTGATA TGCCCGCACA AACTGCACCG
CCAACAACGG CGAACCTTAA CGCCTGGCTG AATAATTTCT ATAACGCGGA GGCGAAACGT
AAATCGACCT TCCCGTCTTC GCTGCCCGCT GATGCTCAGC CATTTGAACT ACTGGTGATT
AACATCTGTT CGCTTTCCTG GTCGGATATA GAAGCCGCTG GGTTGATGTC GCATCCACTG
TGGTCGCATT TCGATATTGA GTTCAAGAAC TTTAACTCCG CCACCTCCTA CAGTGGCCCG
GCGGCGATCC GTTTACTGCG CGCCAGCTGC GGGCAGACTT CGCACACTAA TCTGTATCAA
CCGGCAAATA ACGACTGCTA TCTGTTTGAT AACCTGTCGA AACTGGGCTT TACCCAGCAC
CTGATGATGG GACATAACGG CCAGTTCGGC GGTTTTTTGA AAGAAGTTCG CGAAAATGGC
GGCATGCAGA CTGAATTGAT GGATCAAACA AATCTGCCGG TTATTTTGCT GGGCTTTGAT
GGTTCGCCGG TTTATGACGA TACCGCCGTG CTTAACCGCT GGCTGGACGT TACCGAAAAA
GATAAAAACA GCCGTAGTGC CACGTTCTAC AACACGCTTC CACTGCATGA CGGCAACCAT
TATCCGGGCG TCAGCAAAAC AGCGGATTAC AAAGCGCGGG CGCAGAAATT CTTTGATGAA
CTGGACGCCT TCTTTACTGA ACTGGAGAAA TCGGGTCGTA AAGTGATGGT GGTCGTGGTG
CCGGAACACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGG TATCTGGCCT ACGTGATATC
CCTAGCCCGT CTATCACCGA CGTCCCCGTT GGGGTGAAAT TCTTCGGCAT GAAGGCACCA
CATCAGGGGG CACCGATTGT CATCGACCAA CCGAGCAGCT TCCTGGCTAT CTCCGATCTG
GTGGTTCGCG TTCTTGATGG CAAGATTTTC ACCGAAGACA ATGTTGACTG GAAAAAACTC
ACCAGTGGGT TGCCACAAAC AGCACCGGTC TCAGAGAACT CAAATGCAGT AGTTATTCAA
TACCAGGATA AACCGTACGT TCGCCTGAAC GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
 
Protein sequence
MTQFTQNTAM PSSLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF AAFLLMPIPR 
YSLHRLRHWI ALPIGFALFW HDTWLPGPES IMSQGSQVAG FSTDYLIDLV TRFINWQMIG
AIFVLLVAWL FLSQWIRITV FVVAILLWLN VLTLAGPSFS LWPAGQPTTT VTTTGGNAAA
TVAATGGAPV VGDMPAQTAP PTTANLNAWL NNFYNAEAKR KSTFPSSLPA DAQPFELLVI
NICSLSWSDI EAAGLMSHPL WSHFDIEFKN FNSATSYSGP AAIRLLRASC GQTSHTNLYQ
PANNDCYLFD NLSKLGFTQH LMMGHNGQFG GFLKEVRENG GMQTELMDQT NLPVILLGFD
GSPVYDDTAV LNRWLDVTEK DKNSRSATFY NTLPLHDGNH YPGVSKTADY KARAQKFFDE
LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQVSGLRDI PSPSITDVPV GVKFFGMKAP
HQGAPIVIDQ PSSFLAISDL VVRVLDGKIF TEDNVDWKKL TSGLPQTAPV SENSNAVVIQ
YQDKPYVRLN GGDWVPYPQ