Gene ECH74115_5115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5115 
Symbol 
ID6967871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4758700 
End bp4760385 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content53% 
IMG OID643388788 
Producthypothetical protein 
Protein accessionYP_002273214 
Protein GI209396187 
COG category[R] General function prediction only 
COG ID[COG2985] Predicted permease 
TIGRFAM ID[TIGR01625] AspT/YidE/YbjL antiporter duplication domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.44003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATCTC AAGAGAAATG GACGATGAGT GATATAGCAT TAACGGTCAG TATTTTGGCT 
TTGGTGGCAG TCGTCGGTTT GTTTATCGGC AACGTCAAAT TTCGCGGCAT AGGATTAGGT
ATTGGCGGCG TGCTGTTTGG TGGGATCATC GTCGGCCATT TTGTTTCTCA GGCGGGGATG
ACATTAAGTA GCGATATGCT GCATGTTATT CAGGAATTTG GCCTGATCCT GTTCGTTTAT
ACCATCGGGA TTCAGGTAGG GCCGGGCTTC TTTGCCTCAT TGCGCGTCTC CGGATTACGC
CTCAACCTGT TTGCTGTTCT GATCGTCATC ATCGGTGGTC TGGTTACCGC CATCCTGCAT
AAACTGTTTG ATATTCCACT GCCGGTAGTG CTGGGGATTT TCTCCGGTGC GGTAACCAAT
ACGCCAGCGC TGGGGGCAGG GCAGCAGATT TTGCGCGACC TGGGTACACC AATGGAAATG
GTCGATCAGA TGGGGATGAG TTACGCGATG GCGTATCCAT TCGGCATTTG CGGGATATTG
TTCACCATGT GGATGTTGCG GGTTATTTTC CGCGTCAATG TCGAGACAGA AGCCCAGCAG
CACGAGTCTT CACGCACCAA TGGCGGCGCG CTGATCAAGA CTATCAATAT TCGCGTTGAG
AACCCTAACC TGCATGATTT AGCCATTAAA GATGTACCGA TTCTCAACGG CGACAAAATT
ATCTGCTCGC GTCTGAAACG CGAAGAAACC CTAAAAGTTC CTTCGCCAGA TACCATTATC
CAACTGGGCG ATTTGCTGCA TCTGGTGGGT CAGCCAGCGG ATTTACATAA TGCGCAACTG
GTGATTGGTC AGGAGGTCGA TACTTCGCTG TCCACGAAAG GCACTGATTT GCGCGTCGAG
CGTGTGGTGG TCACCAATGA AAACGTGCTT GGTAAGCGTA TTCGCGACCT GCATTTTAAA
GAACGCTATG ACGTTGTTAT CTCGCGCCTG AACCGTGCCG GGGTCGAACT GGTCGCCAGT
GGCGATATCA GCCTGCAGTT CGGCGATATC CTCAACCTGG TGGGGCGTCC GTCCGCAATT
GATGCCGTTG CCAATGTGCT GGGGAATGCG CAGCAAAAAC TGCAACAGGT TCAGATGCTG
CCAGTGTTTA TTGGCATCGG GCTTGGCGTA TTGTTAGGTT CTATTCCCGT CTTTGTGCCA
GGATTCCCGG CCGCGTTGAA ACTGGGGCTG GCGGGCGGTC CGCTGATTAT GGCGTTGATC
CTCGGGCGTA TCGGCAGTAT CGGCAAGCTG TACTGGTTTA TGCCGCCAAG TGCCAACCTC
GCGCTGCGGG AACTGGGGAT CGTGCTGTTC CTCTCGGTCG TTGGTCTGAA ATCTGGTGGG
GATTTTGTGA ATACCCTGGT CAATGGCGAA GGGCTAAGCT GGATTGGTTA TGGTGCCCTG
ATCACCGCCG TTCCGCTGAT TACTGTTGGC ATTCTGGCGC GGATGTTAGC CAAAATGAAT
TACCTGACCA TGTGCGGGAT GCTGGCAGGT TCCATGACCG ATCCTCCGGC GCTGGCGTTT
GCTAATAATC TTCATCCAAC CAGCGGTGCG GCGGCGCTCT CTTACGCCAC TGTCTATCCG
TTAGTGATGT TCCTGCGCAT TATCACCCCC CAATTACTGG CGGTGCTCTT CTGGAGTATC
GGTTAA
 
Protein sequence
MLSQEKWTMS DIALTVSILA LVAVVGLFIG NVKFRGIGLG IGGVLFGGII VGHFVSQAGM 
TLSSDMLHVI QEFGLILFVY TIGIQVGPGF FASLRVSGLR LNLFAVLIVI IGGLVTAILH
KLFDIPLPVV LGIFSGAVTN TPALGAGQQI LRDLGTPMEM VDQMGMSYAM AYPFGICGIL
FTMWMLRVIF RVNVETEAQQ HESSRTNGGA LIKTINIRVE NPNLHDLAIK DVPILNGDKI
ICSRLKREET LKVPSPDTII QLGDLLHLVG QPADLHNAQL VIGQEVDTSL STKGTDLRVE
RVVVTNENVL GKRIRDLHFK ERYDVVISRL NRAGVELVAS GDISLQFGDI LNLVGRPSAI
DAVANVLGNA QQKLQQVQML PVFIGIGLGV LLGSIPVFVP GFPAALKLGL AGGPLIMALI
LGRIGSIGKL YWFMPPSANL ALRELGIVLF LSVVGLKSGG DFVNTLVNGE GLSWIGYGAL
ITAVPLITVG ILARMLAKMN YLTMCGMLAG SMTDPPALAF ANNLHPTSGA AALSYATVYP
LVMFLRIITP QLLAVLFWSI G