Gene ECH74115_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1524 
Symbol 
ID6972139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1496665 
End bp1497957 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID643385494 
ProductYjhS 
Protein accessionYP_002269988 
Protein GI209397659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAGGC CGCTGTATAA GGATCTCATC GGTCGTACAA AAGCCGCGCT GGCAAAGAAC 
CCGAAAAATG TGCTGCTTGC CGTGGTGTGG ATGCAGGGGG AATTTGACTT TGACGGAACG
CCAGCAAATC ACACAGCCCG TTTTACAGAA GTAGTGGAAC AATATCGTAC GGACCTTGCA
GATATGGTGG GACAGTGCGC TGGTGGTTCT GCTGACGGTG TTCCCTGGAT ATGTGGAGAC
ACAACTTATT TCTGGAAGCA GAAGAGCGAA TCCACTTACC AGACGGTGTA CGGCAGTTAC
AAAAACAAAA CGGAAAAGAA TATTCACTTT GTGCCGTTCA TGACCGATGA GAACGGAGCA
AATGTCCCGA CGAACAAACC GGAAGAAGAC CCGGATATTC CGGCATCAGG ATATTACGGT
GCGGCCTCCC GGACGTCGGC AAACTGGACG TCAGCAGACC GTGCGAGCCA TTTCAGCTCA
TGGGCACGCA GGGGGATTAT TTCTGACCGT CTTGCCTCAG CGATTCTTCT CCATGCAGGA
CGGACGGCTG AACTGGTGGG TGGGGAACAG GTTGTGATGC CGCCGGATGA GAAGCCGTCA
CCGGACACAC CATCAACACC GTCAACGGAC GGGAAATCAG TGACAACGCT GCTTTATTAC
CGTGCAACAG AGTCAGGTGG TTTACTGAAT CCGCAGGGAT GGGGAGCTGA AGGAGGGCGT
GCATTGGTAG TTGATGATGC AGGTGCTGCA GGAGGTAAGG CGCTGAGGTG GACCAAACAG
ACAGGAAGTT CCTCGTGGTT TATGCAGCAT GATGCCGGTA ATGGCGCAGA CCTGCTGGAG
AAGGGCGGGC TTATCAGTTG TCGTTTTAAA GTTGATGGCA CACTGACAGC TAATCAGTAC
GCACTGGCGC TGTACTGGCC GGTTTCTTCA CTGCCTCAGG GTGTCACACT GGAAGGTAAT
GCCGGTCATA ACCTGCTGGC GTCGTTTTAC GTACAGAGCG ATGCCACAGA CCTTAATGTG
ATGTACCACA AGGGAAATGC TGGTCAGAAC ACGAAGCTGG GGTCATTCGG CGCATTTGAT
AACGAATGGC ATACGCTGGG CTTCCGTTTT GCCGGTAACA ACAGTATTGA GGTGACGCCG
GTCATTGATG GTAAGGACGG GACGCCGTTC ATGCTGTCAC AGTCACCGGT CGGCACGTTT
ACGGCAGACA AATTGCGCGT GACCGATATC ACTAGCGGTG CGACATATCC GGTGCTGATT
GAAAGTATAA CAGTGGAAGT GAATAACCCG TAA
 
Protein sequence
MGRPLYKDLI GRTKAALAKN PKNVLLAVVW MQGEFDFDGT PANHTARFTE VVEQYRTDLA 
DMVGQCAGGS ADGVPWICGD TTYFWKQKSE STYQTVYGSY KNKTEKNIHF VPFMTDENGA
NVPTNKPEED PDIPASGYYG AASRTSANWT SADRASHFSS WARRGIISDR LASAILLHAG
RTAELVGGEQ VVMPPDEKPS PDTPSTPSTD GKSVTTLLYY RATESGGLLN PQGWGAEGGR
ALVVDDAGAA GGKALRWTKQ TGSSSWFMQH DAGNGADLLE KGGLISCRFK VDGTLTANQY
ALALYWPVSS LPQGVTLEGN AGHNLLASFY VQSDATDLNV MYHKGNAGQN TKLGSFGAFD
NEWHTLGFRF AGNNSIEVTP VIDGKDGTPF MLSQSPVGTF TADKLRVTDI TSGATYPVLI
ESITVEVNNP