Gene ECH74115_3238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3238 
Symbol 
ID6966606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2975980 
End bp2977041 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content35% 
IMG OID643387054 
Producthypothetical protein 
Protein accessionYP_002271518 
Protein GI209397577 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0332747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.000366296 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATACTT TAAAATATGA GAAATTCTCT GATTTTGATC ACAATGACCC ATTTTTTGAC 
TCTTTAAAAA AAGATTATAA AGAGTTTCCT CTTTGGTTAG AAAAAAAAGC CAGAGAAGGA
GAATCAGCTT ATGTGCTCTA TGATGACAAG CATAAAATCG AAGGTTTTAT GTATCTAAAA
GAAAATGATG ATGCAAATGA CATTAATCCA GCGCTCCCAC CAGGACGTCA TCTAAAGATA
GGAACATTCA AATTTGAATC TAAAGGCACC CTTCGCGGAC AACGATTTCT AAAAAAAGCG
TTTGACCATG CATTTTCATC AAAATCTGAT GATATTTATG TTACTGTTTT CGACAAACAC
GTCCATCTAA TAAAACTTTT CCAAACGTAC GGATTTTACA TTCATGGTGA AAAAGAAACA
CATAACGGGA AAGAGTTTGT ATATGCGAGG TCTTTGCATG AGCCTTATGG TGATATTTTA
TTAGATTACC CTCGAATAAT GACATCAAGA GCCAACAAAT ATTTACTGGC GATTTATCCC
GAATATCACA CTAGACTATT CCCTGATTCA AAACTTGTAA ATGAATCACC AGATATTGTC
AAAGATATAT CCCATGCTAA CAGCATTCAT AAAATTTACA TATGTGGAAT GCGTTCTGTG
ATGGGAATGA AAAGAGGAGA TATCATTGTC ATCTATAGAA CCGGAGACAA AAAAGGGCCA
GCTCGCTATC GTTCTGTAGC CAGTACATTA TGTGTAGTTG AGAGCGTAAA AAATATTTCT
GAATTTTTAA GCGAAGATAG TTTTGTAGAC TATTGTATTC GTTTTAGCGT ATTTTCTGAA
GATGAACTCA GAAAAATCTA TAAAGAACGT CGATACCCTT TCATTATAAG ATTCACATAC
AATCTGTCTT TGCCAAAGAG ACCCAATCGT GCTATTTTAA TAGATCATGT GGGGCTAAAT
GGTTCGCGTG CATTCCGATG GAGTCACTTT AAACTCACAA ATGAGCAGTT CTTAAAGATC
ATCGAGTTAG GCAAGATAAA TGAAAGTTTT ATTATCCATT AA
 
Protein sequence
MDTLKYEKFS DFDHNDPFFD SLKKDYKEFP LWLEKKAREG ESAYVLYDDK HKIEGFMYLK 
ENDDANDINP ALPPGRHLKI GTFKFESKGT LRGQRFLKKA FDHAFSSKSD DIYVTVFDKH
VHLIKLFQTY GFYIHGEKET HNGKEFVYAR SLHEPYGDIL LDYPRIMTSR ANKYLLAIYP
EYHTRLFPDS KLVNESPDIV KDISHANSIH KIYICGMRSV MGMKRGDIIV IYRTGDKKGP
ARYRSVASTL CVVESVKNIS EFLSEDSFVD YCIRFSVFSE DELRKIYKER RYPFIIRFTY
NLSLPKRPNR AILIDHVGLN GSRAFRWSHF KLTNEQFLKI IELGKINESF IIH