Gene ECH74115_4537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4537 
Symbol 
ID6970840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4204919 
End bp4205968 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content49% 
IMG OID643388249 
Producthypothetical protein 
Protein accessionYP_002272684 
Protein GI209400395 
COG category[S] Function unknown 
COG ID[COG4804] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCGC TCTCAGAGAG AACCTCAACA GGCTACCAGC AAATCCACGA CGGTATTATT 
CACCTGGTCG ATAGCGCCCG GACGGAAACG GTACGTAGCG TTAACGCGCT GATGACCGCG
ACATACTGGG AAATTGGCCG ACGAATTGTC GAATTTGAAC AAGGTGGCGA GGCCAGGGCT
GCGTATGGTG CGCAGCTAAT CAAGCGACTA TCAAAGGATT TAAGTCTAAG GTATAAGCGT
GGGTTCTCTG CAAAAAACTT ACGCCAAATG AGGCTTTTTT ACCTCTTTTT TCAACATGTT
GAAATTCGCC AGACAGTGTC TGGCGAATTA ACACCATTGC CCTGGTCCAC TTACGTCCGT
TTACTTTCCG TTAAAAACGC TGACACCCGC AGCTTTTATG AAAAAGAGAC GCTCCGCTGT
GGCTGGTCTG TTCGCCAGCT AGAACGGCAA ATTGCGACCC AATTTTATGA GCGAACACTG
CTGTCACATG ACAAATCAGC CATGCTGCAA CAACACGCTC CTGCCGAGAC GCATATTCTT
CCGCAACAGG CGATACGCGA TCCCTTTGTG CTCGAATTTC TGGAATTGAA AGATGAATAC
TCAGAATCCG ATTTTGAGGA GGCGCTGATC AACCACCTGA TGGATTTCAT GCTGGAACTT
GGGGATGATT TTGCCTTTGT TGGTCGGCAG CGAAGGTTAC GCATTGATGA CAACTGGTTT
CGTGTCGATC TGCTGTTTTT CCACCGCCGT TTACGCTGCC TGCTAATCGT CGATCTAAAA
GTGGGCAAAT TCAGCTATAG CGATGCCGGA CAGATGAATA TGTATCTCAA CTACGCCAAA
GAGCACTGGA CGCTACCGGA TGAAAATCCG CCCATCGGTC TGGTTCTCTG TGCAGAGAAA
GGAGTCGGAG AAGCGCATTA TGCACTGGCT GGTTTGCCTA ACACCGTTCT GGCGAGTGAA
TATAAGATGC AACTACCTGA TGAAAAACGA CTCGCGGATG AACTCGTTCG AACACAGGCG
GTGCTAGAAG AAGGCTATAG ACTCCGTTAA
 
Protein sequence
MESLSERTST GYQQIHDGII HLVDSARTET VRSVNALMTA TYWEIGRRIV EFEQGGEARA 
AYGAQLIKRL SKDLSLRYKR GFSAKNLRQM RLFYLFFQHV EIRQTVSGEL TPLPWSTYVR
LLSVKNADTR SFYEKETLRC GWSVRQLERQ IATQFYERTL LSHDKSAMLQ QHAPAETHIL
PQQAIRDPFV LEFLELKDEY SESDFEEALI NHLMDFMLEL GDDFAFVGRQ RRLRIDDNWF
RVDLLFFHRR LRCLLIVDLK VGKFSYSDAG QMNMYLNYAK EHWTLPDENP PIGLVLCAEK
GVGEAHYALA GLPNTVLASE YKMQLPDEKR LADELVRTQA VLEEGYRLR