Gene ECH74115_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1967 
Symbol 
ID6968217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1860340 
End bp1861401 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content56% 
IMG OID643385893 
Producthypothetical protein 
Protein accessionYP_002270382 
Protein GI209397939 
COG category[S] Function unknown 
COG ID[COG3768] Predicted membrane protein 
TIGRFAM ID[TIGR01620] conserved hypothetical protein, TIGR01620 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.650951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC CGTTAAAACC ACGTATTGAT TTCGACGGTC CGCTGGAGGT CGAACAGAAT 
CCAAAATTCA GGGCGCAGCA GACCTTTGAC GAAAATCAGG CGCAAAATTT TGCCCCGGCC
ACGCTCGACG AAGCGCAGGA AGAAGAGGGG CAAGTCGAAG CGGTAATGGA CGCAGCGTTA
CGTCCGAAAC GCAGCCTGTG GCGCAAAATG GTGATGGGCG GGCTGGCTCT GTTTGGCGCA
AGCGTTGTCG GGCAGGGTGT ACAGTGGACA ATGAATGCCT GGCAAACTCA GGACTGGGTG
GCGCTGGGTG GATGTGCTGC TGGGGCATTG ATTATCGGCG CTGGCGTAGG TTCTGTGGTA
ACAGAGTGGC GGCGCTTATG GCGCTTGCGA CAGCGCGCCC ATGAACGCGA CGAAGCGCGC
GATTTGTTGC ACAGCCACGG CACGGGCAAA GGCCGCGCAT TTTGCGAAAA ACTGGCGCAG
CAGGCGGGTA TTGATCAGTC TCATCCAGCG CTGCAACGCT GGTATGCCTC AATCCATGAA
ACGCAGAACG ATCGTGAAGT GGTCAGCCTG TATGCTCATC TGGTCCAGCC GGTTTTAGAT
GCCCAGGCGC GGCGCGAAAT CAGCCGCTCA GCAGCTGAAT CAACGTTGAT GATTGCGGTC
AGCCCGCTGG CGCTGGTGGA TATGGCATTT ATCGCCTGGC GCAATCTGCG TTTGATTAAT
CGCATCGCCA CGCTGTATGG CATTGAACTG GGATATTACA GCCGTTTGCG CCTGTTCAAG
CTGGTATTGC TGAATATCGC TTTCGCCGGA GCCAGCGAAT TGGTGCGCGA AGTGGGAATG
GACTGGATGT CGCAAGATCT CGCTGCTCGT TTGTCTACCC GCGCAGCTCA GGGGATTGGT
GCAGGACTTC TGACGGCACG ACTGGGGATT AAAGCTATGG AGCTTTGCCG CCCGCTGCCG
TGGATTGACG ATGACAAACC TCGCCTCGGG GATTTTCGTC GTCAGCTTAT CGGTCAGGTG
AAAGAAACGC TGCAAAAAGG CAAAACGCCC AGCGAAAAAT AA
 
Protein sequence
MTEPLKPRID FDGPLEVEQN PKFRAQQTFD ENQAQNFAPA TLDEAQEEEG QVEAVMDAAL 
RPKRSLWRKM VMGGLALFGA SVVGQGVQWT MNAWQTQDWV ALGGCAAGAL IIGAGVGSVV
TEWRRLWRLR QRAHERDEAR DLLHSHGTGK GRAFCEKLAQ QAGIDQSHPA LQRWYASIHE
TQNDREVVSL YAHLVQPVLD AQARREISRS AAESTLMIAV SPLALVDMAF IAWRNLRLIN
RIATLYGIEL GYYSRLRLFK LVLLNIAFAG ASELVREVGM DWMSQDLAAR LSTRAAQGIG
AGLLTARLGI KAMELCRPLP WIDDDKPRLG DFRRQLIGQV KETLQKGKTP SEK