Gene ECH74115_5571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5571 
Symbol 
ID6971233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5210129 
End bp5211421 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content33% 
IMG OID643389210 
Producthypothetical protein 
Protein accessionYP_002273607 
Protein GI209396870 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.422571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTATA ATGGTTTAAA TAATATGTTT TTCCCTCTTT GCCTGATTAA CGATAACCAC 
TCTGTCACAA GTCTATCACA TACAAAGAAA ACAAAATCAG ATAATTACAG CAAACATCAT
AAAAACACGT TAATTGACAA TAAAGCCCTC TCTCTTTTCA AAATGGATGA TCATGAAAAA
GTGATAGACT TGATTCAGAA AATGAAAAGA ATTTATGATA GTTTACCATC AGGAAAAATC
ACGAAAGAAA CGGACAGGAA AATACATAAA TATTTTATAG ATATAGCTTC ATATGCAAAT
AATAAATGTG ACGATAGAAT TACGAGAAGA GTTTACCTTA ATAAAGATAA GGAAGTGTCA
ATTAAGGTGG TATATTTTAT AAATAATGTC ACCGTCCATA ATAATACTAT CGAAATCCCA
CAGACAGTGA ATGGTGGTTA CGATTTTTCA CACCTTAGCC TGAAAGGTAT CGTGATTAAA
GATGAAGATT TATCCAATTC GAATTTTGCA GGTTGCAGAC TACAAAACGC TATTTTCCAG
GACTGTAATA TGTATAAAAC GAATTTTAAT TTCGCCATAA TGGAAAAAAT ACTTTTTGAT
AATTGTATTC TCGATGACTC ATATTTCGCT CAGATAAAAA TGACTGACGG AACTCTAAAT
TCATGCTCCG CTATGCATGT TCAATTCTAC AATGCAACAA TGAATAGAGC CAATATTAAA
AATACCTTCC TTGATTATTC AAATTTTTAT ATGGCATACA TGGCTGAGGT AAATCTTTAT
AAAGTAATAG CGCCATATAT TAATTTATTT AGAGCCGACC TTAGCTTCTC TAAACTTGAT
TTAATTAACT TTAAACATGC TGATCTGTCT CGCGTCAATC TGAACAAAGC AATCCTCCAG
AATATAAACT TAATTGATAG TAAACTCTTT TTTACACGGC TAACAAATAC GTTCCTCGAA
ATGGTTATAT GCACCGATTC TAATATGGCT AATGTTAATT TTAATAATGC CAATTTAAAC
AATTGCCATT TCAACTGTTC TGTTTTAACA AAAGCCTGGA TGTTTAATAC CCGTCTCTAT
CGGGTTAATT TCGATGAGGC TAGCGTCCAG GGAATGGGCA TTTCCATTCT CCGTGGGGAG
GAGAATATTC CCATTAATAG TGATACCCTG GTAACACTAC AGAAATTCTT TGAAGAAGAT
TGTACCTCTC ATACTGGCAT GTCACAAACT GAGAATAATA CTCATGAAGT AGCTATGAAG
ATTACTGCAG ATATTATGCA ACACGCAGAT TGA
 
Protein sequence
MRYNGLNNMF FPLCLINDNH SVTSLSHTKK TKSDNYSKHH KNTLIDNKAL SLFKMDDHEK 
VIDLIQKMKR IYDSLPSGKI TKETDRKIHK YFIDIASYAN NKCDDRITRR VYLNKDKEVS
IKVVYFINNV TVHNNTIEIP QTVNGGYDFS HLSLKGIVIK DEDLSNSNFA GCRLQNAIFQ
DCNMYKTNFN FAIMEKILFD NCILDDSYFA QIKMTDGTLN SCSAMHVQFY NATMNRANIK
NTFLDYSNFY MAYMAEVNLY KVIAPYINLF RADLSFSKLD LINFKHADLS RVNLNKAILQ
NINLIDSKLF FTRLTNTFLE MVICTDSNMA NVNFNNANLN NCHFNCSVLT KAWMFNTRLY
RVNFDEASVQ GMGISILRGE ENIPINSDTL VTLQKFFEED CTSHTGMSQT ENNTHEVAMK
ITADIMQHAD