Gene ECH74115_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1152 
Symbol 
ID6968794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1173124 
End bp1174326 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content53% 
IMG OID643385153 
Producthypothetical protein 
Protein accessionYP_002269652 
Protein GI209395874 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.303869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGA TAAAGGCTTT TCAAAAAATA CATGTCGAAC CTCCTCTGGT TCTGTCGATT 
GGGAACCACA GATTATATCC GGAGGAAGGT TCGGCACCAG ATGAGGTAGC CATGCGTGAT
TACGCAAAAG TTTCTCCGCG ATTCTGGCTG GGAGAAACGG GGAGAGAACT TAGAAAGGCG
GGTGCAGAAG CGCAAGTTGT TGCTTTTTAC CTGATGACAT CCCCTCACGC AAATATGCTG
GGTTTGTATT ACCTGCCAGT TTTATACCTT GCTCATGAAA CCGGGCTTGG TCTGGAAGGG
GCTTCAAAGG GGCTTAAAAG GGCTGTTGAA GCTGGTTTTT GTAGCTATGA CCATGATGCA
GAGATGGTCT GGGTCCATGA AATGGCAGCC TGGCAGGTTG GGGAAACGTT GAAGCCTGGC
GATAACCGTT GTGCAGGTGT CAGGAATGAG TATGCATCAT TACCTGAAAA CGCTTTTCTG
TCAGTGTTTT ACGACAGATA TAAAACGGAT TTCCATCTGG ATGTGAGGCG GAATAATAGC
CGAAATTCGG TAAGGGGCTT CGAAGGGGCT TTTAAGGGGC TTCGAAGCCA AGAACAGGAA
CAGGAGCAGG AGAAAGAACA GGAACAGGAC AAAAACACTA TGGTTCATGG CAAAAAAAAC
ACCACGAACC AGGCAGGGGA TGTTCAGACC GTCAATCCTG GTCAGCCAGC AGGCACGACA
CCGGAAGCCG ATTCGGGCGC TGTGCAGCAG GTGATGACCG CAGGGTCGGA GCAATCACAC
CAACTGCAGC AGCCTGAAGC CGATTCCGCC ATTCAGCGGG AAGCCGATCG GGTAGTCCCG
GAAAGCACCG GGCAGTCTGT GGGACGAGTG GATTATCCGG ATGTGTTCGA ACAGGTCTGG
CGGGAATACC CGTTGCGTGC TGGGGCAAAC CCGAAGAAAT CCGCTTTCAG TGCCTGGAAG
GCCAGATTGC GCGAGGGGGT GCCACCAGAG ACCATGCTGG ATGGTGTGAG GCGTTACGCG
AGATACCTGG CGGCGACCGG GAAAGCGGGA ACGGAATTTG TTCAGCGAGC GACGACGTTT
TTTGGGCCGG ACCGGAATTT TGAAAACCCC TGGTTGCTCC CGGTAAGCGG CACGAACAAC
CAGCGTTGTG TGAATCATAT TTCTGAACCG GATACCGAAA TTCCGCCGGG ATTCAGGGGG
TGA
 
Protein sequence
MSLIKAFQKI HVEPPLVLSI GNHRLYPEEG SAPDEVAMRD YAKVSPRFWL GETGRELRKA 
GAEAQVVAFY LMTSPHANML GLYYLPVLYL AHETGLGLEG ASKGLKRAVE AGFCSYDHDA
EMVWVHEMAA WQVGETLKPG DNRCAGVRNE YASLPENAFL SVFYDRYKTD FHLDVRRNNS
RNSVRGFEGA FKGLRSQEQE QEQEKEQEQD KNTMVHGKKN TTNQAGDVQT VNPGQPAGTT
PEADSGAVQQ VMTAGSEQSH QLQQPEADSA IQREADRVVP ESTGQSVGRV DYPDVFEQVW
REYPLRAGAN PKKSAFSAWK ARLREGVPPE TMLDGVRRYA RYLAATGKAG TEFVQRATTF
FGPDRNFENP WLLPVSGTNN QRCVNHISEP DTEIPPGFRG