Gene ECH74115_5050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5050 
Symbol 
ID6966626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4695106 
End bp4696230 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content47% 
IMG OID643388728 
ProductEspD 
Protein accessionYP_002273154 
Protein GI209398054 
COG category[S] Function unknown 
COG ID[COG5613] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAACG TAAATAACGA TACCCTGTCT GTAACGTCTG GGGTTAATAC CGCCTCGGGT 
ACTTCTGGTA TTACTCAATC TGAAACGGGT TTATCGCTGG ATTTACAACT GGTTAAATCC
ATGAACTCGT CAGCAGGCTG GACAGAAAGT AGCCCTTTAC CGACGCCGCC GGCAGGTCAC
TCATTAGTGA CGCCCTCTGC TGCTGAGGAT GTCCTTAGTA AATTGTTTGG TGGTATTAGT
GGTGAGGTTA CAAGTCGCAC TGAGGAGGCA GAGCCACAGC GCACAAGCTA TCCCTATCTC
TCTCAGGTGA ATACCGTTGA CCCTCAGCAA ATGATGATGA TGGTCACTCT GTTATCCCTG
GATACTTCCG CGCAGAAAGT CTCGAGTCTG AAAAACTCTA ACGAGATTTA TATGGATGGG
CAAACTAAAG CGCTGGAGAA TAAAACGCAG GAGTATAAAA AACAGCTCGA AGAACAACAG
AAAGCCGAAG AGAAATCACA AAAAAGTAAA ATTGTTGGCC AGGTCTTTGG TTGGTTGGGC
GTCGCATTAA CAGCCGTTGC CGCTGTTTTT AACCCAGCAC TCTGGGCTGT TGTTGCCATT
GGTGCAACAG CAATGGCACT GCAAACGGCA GTCGATGTAA TGGGGGAAAA TGCCCCTCAG
GGATTAAAGA CTGCAGCACA GGTCTTTGGC GGAATATCTA TGGCCGCAAG CATTCTGACA
GCCGGCGTTG GCGGGGTGTC TTCACTGTTA TCTAAATTTG GTAATGTTGC TAACAAAATT
GGCTCAAGCG TTGTAAAAGT CGTTGAGAAG GCGGCAGAAG CGCTGGTTAA AAACGTTTTT
GCAAAAATTT CGACAGTGGC TGAGGGCGTT ACGAACGGTA TTCGTTCTGC CGGGACAACT
GCGTTGAATA ATGAGGCTGC GCAACTCCAA ATGTTGTCTC AGTTAGCTGC TTTCGCGGTG
CAAAACTTAA CTCGACAGAG TGAAAGCTTA GGTGAGAGTG CGAAGCTCGA GCTGGATAAA
GCGGCAAGCG AGTTACAAAA TCAGGCGAGC TATTTACAAA GTGTTTCTCA ACTGATGTCC
GATTCAGCAC GGGTAAATAG TCGTATTGTT AGTGGCCGAA TTTAA
 
Protein sequence
MLNVNNDTLS VTSGVNTASG TSGITQSETG LSLDLQLVKS MNSSAGWTES SPLPTPPAGH 
SLVTPSAAED VLSKLFGGIS GEVTSRTEEA EPQRTSYPYL SQVNTVDPQQ MMMMVTLLSL
DTSAQKVSSL KNSNEIYMDG QTKALENKTQ EYKKQLEEQQ KAEEKSQKSK IVGQVFGWLG
VALTAVAAVF NPALWAVVAI GATAMALQTA VDVMGENAPQ GLKTAAQVFG GISMAASILT
AGVGGVSSLL SKFGNVANKI GSSVVKVVEK AAEALVKNVF AKISTVAEGV TNGIRSAGTT
ALNNEAAQLQ MLSQLAAFAV QNLTRQSESL GESAKLELDK AASELQNQAS YLQSVSQLMS
DSARVNSRIV SGRI