Gene ECH74115_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1703 
Symbol 
ID6971148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1640590 
End bp1642020 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content51% 
IMG OID643385660 
Producthypothetical protein 
Protein accessionYP_002270154 
Protein GI209397160 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.401694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTTC ATTTTTTACC GGAAGTTACC GACGTTTTGA GCCGTTTCGT TCCTCGCATT 
ATTCCGTTTT ATTTACTCTT GCTGGCGGCA GGCGGTACAG CTAACGCACA ATCTACCTTC
GAGCAAAAAG CGGCAAATCC CTTTGATAAT AACAATGATG GTCTGCCGGA TTTAGGCATG
GCGCCTGAAA ATCATGATGG GGAAAAACAC TTTGCTGAAA TTGTGAAAGA TTTCGGCGAA
ACCAGTATGA ATGATAACGG GCTGGATACT GGCGAGCAGG CAAAAGCTTT CGCATTGGGA
AAAGTCCGCG ACGCGCTTAG TCAACAGGTT AATCAGCACG TAGAGTCGTG GCTATCACCG
TGGGGAAATG CCAGTGTTGA CGTCAAAGTG GATAACGAAG GTCATTTTAC CGGCAGTCGT
GGAAGCTGGT TTGTGCCGTT ACAAGATAAT GATCGTTATC TCACCTGGAG CCAGCTTGGT
CTTACTCAGC AGGATGATGG GCTGGTGAGC AATGTGGGCG TTGGGCAACG CTGGGCGCGC
GGCAACTGGC TGGTGGGTTA TAACACTTTT TATGACAACT TGCTGGATGA AAATCTTCAG
CGAGCGGGCT TTGGTGCCGA AGCGTGGGGC GAATATTTGC GACTATCGGC AAACTTTTAT
CAGCCGTTTG CTGCATGGCA TGAACAGACA GCCACGCAGG AACAACGGAT GGCGCGCGGG
TACGACCTGA CAGCCCGGAT GCGCATGCCG TTCTATCAAC ACCTCAATAC CAGTGTCAGC
GTAGAACAGT ATTTTGGTGA TCGTGTTGAT TTGTTTAACT CTGGTACGGG TTATCACAAT
CCCGTCGCGT TGAGTCTGGG ATTAAATTAC ACCCCTGTGC CATTAGTCAC TGTGACGGCC
CAGCATAAAC AGGGTGAAAG TGGCGAGAAT CAAAATAACC TCGGGCTGAA TCTTAATTAC
CGCTTTGGTG TACCGCTCAA AAAACAACTT TCTGCGGGCG AGGTTGCCGA AAGTCAGTCG
TTACGTGGTA GTCGCTATGA TAATCCGCAG CGAAATAATC TACCGACTCT TGAGTACCGA
CAGCGAAAAA CGTTAACGGT GTTTCTTGCG ACACCGCCGT GGGATCTAAA ACCTGGCGAA
ACAGTGCCGC TGAAATTACA AATCCGCAGT CGTTACGGTA TTCGGCAACT GATTTGGCAG
GGCGATACGC AGATATTAAG TTTGACGCCG GGCGCACAAG CCAACAGCGC GGAGGGCTGG
ACGCTGATCA TGCCTGACTG GCAGAACGGG GAAGGGGCGA GCAATCACTG GCGATTGTCG
GTGGTGGTGG AAGATAACCA GGGGCAGCGT GTCTCCTCCA ATGAGATCAC GCTAACGCTT
GTCGAACCGT TCGACGCATT GTCAAACGAC GAACTGCGCT GGGAACCGTA A
 
Protein sequence
MFFHFLPEVT DVLSRFVPRI IPFYLLLLAA GGTANAQSTF EQKAANPFDN NNDGLPDLGM 
APENHDGEKH FAEIVKDFGE TSMNDNGLDT GEQAKAFALG KVRDALSQQV NQHVESWLSP
WGNASVDVKV DNEGHFTGSR GSWFVPLQDN DRYLTWSQLG LTQQDDGLVS NVGVGQRWAR
GNWLVGYNTF YDNLLDENLQ RAGFGAEAWG EYLRLSANFY QPFAAWHEQT ATQEQRMARG
YDLTARMRMP FYQHLNTSVS VEQYFGDRVD LFNSGTGYHN PVALSLGLNY TPVPLVTVTA
QHKQGESGEN QNNLGLNLNY RFGVPLKKQL SAGEVAESQS LRGSRYDNPQ RNNLPTLEYR
QRKTLTVFLA TPPWDLKPGE TVPLKLQIRS RYGIRQLIWQ GDTQILSLTP GAQANSAEGW
TLIMPDWQNG EGASNHWRLS VVVEDNQGQR VSSNEITLTL VEPFDALSND ELRWEP