Gene ECH74115_0274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0274 
Symbol 
ID6970072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp288805 
End bp290544 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content59% 
IMG OID643384340 
Producttype III secretion protein, FHIPEP family 
Protein accessionYP_002268856 
Protein GI209397087 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1298] Flagellar biosynthesis pathway, component FlhA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.807219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCAC GCTCGGATTT ACTGACGCTG TTGACAATCA ACTTTATCGT CGTCACCAAA 
GGGGCCGAGC GTATTTCCGA GGTTTCTGCC CGCTTTACCC TGGATGCGAT GCCCGGCAAA
CAGATGGCGA TTGACGCCGA TCTTAACGCC GGATTGATCA ACCAGGCACA GGCGCAAACC
CGGCGTAAAG ATGTTGCCAG CGAGGCCGAT TTCTACGGCG CGATGGACGG GGCATCGAAG
TTTGTGCGCG GGGATGCCAT CGCCGGGATG ATGATTCTGG CGATTAACCT GATCGGCGGC
GTCTGTATCG GGATATTTAA ATACAACCTG AGCGCCGACG CCGCCTTCCA GCAATATGTG
CTGATGACCA TCGGCGATGG GCTGGTGGCG CAGATCCCTT CCCTCCTGCT CTCCACCGCA
GCGGCGATTA TCGTCACCCG CGTCAGCGAC AACGGCGATA TCGCCCATGA CGTGCGCCAC
CAGCTGCTGG CAAGCCCGTC GGTGCTCTAC ACCGCCACCG GGATTATGTT TGTGCTGGCG
GTGGTGCCGG GAATGCCGCA CCTGCCGTTT TTGCTGTTCA GCGCCCTGCT GGGTTTTACC
GGCTGGCGGA TGAGCAAACA GCCGCAGGCG GCGGAGGCGG AAGAGAAAAG CCTCGAAACG
CTGACCCGCA CCATCACTGA AACCAGCGAG CAACAGGTCA GTTGGGAAAC CATTCCGCTT
ATCGAGCCTA TCAGTTTAAG CCTCGGCTAC AAGCTGGTGG CGCTGGTCGA TAAAGCGCAG
GGCAACCCGC TCACCCAGCG TATTCGCGGC GTACGGCAGG TGATATCCGA CGGTAACGGC
GTGCTGCTGC CGGAGATCCG CATTCGGGAA AACTTCCGCC TCAAGCCCAG CCAGTACGCC
ATTTTCATCA ACGGCATTAA GGCCGATGAA GCGGATATTC CGGCGGATAA ACTGATGGCG
CTGCCCTCCA GCGAAACCTA CGGCGAGATT GACGGCGTGC TGGGGAACGA CCCGGCGTAC
GGGATGCCGG TGACCTGGAT CCAGCCCGCG CAGAAGGCGA AGGCGCTGAA TATGGGGTAT
CAGGTGATCG ACAGCGCCAG CGTGATTGCC ACGCATGTGA ACAAGATTGT GCGCAGCTAT
ATTCCTGATT TGTTTAATTA TGATGACATC ACGCAGCTGC ATAACCGCCT GTCGTCGATG
GCCCCGCGCC TGGCGGAAGA TTTGAGCGCG GCGCTGAATT ACAGCCAGTT GCTGAAAGTG
TACCGTGCGC TGCTGACCGA AGGCGTTTCC CTGCGCGATA TCGTCACCAT CGCCACCGTG
CTGGTCGCCA GTAGCGCGGT GACTAAAGAT CATATTCTGC TGGCGGCCGA TGTGCGCCTG
GCGCTGCGGC GCAGCATTAC CCATCCGTTC GTTCGCAAGC AGGAGCTGAC GGTGTATACG
CTGAATAATG AGCTGGAAAA TCTGCTGACT AACGTAGTGA ATCAGGCGCA ACAGGGCGGT
AAAGTGATGC TCGACAGCGT GCCAGTGGAC CCGAATATGC TCAACCAGTT CCAGAGCACG
ATGCCGCAGG TGAAAGAGCA GATGAAAGCG GCGGGGAAAG ACCCGGTGCT GCTGGTGCCG
CCGCAGCTGC GCCCTTTGCT GGCGCGCTAT GCAAGGTTGT TTGCGCCGGG GCTGCATGTG
CTGTCGTATA ACGAAGTGCC GGATGAGCTG GAGTTGAAGA TAATGGGGGC GTTGAGTTGA
 
Protein sequence
MLSRSDLLTL LTINFIVVTK GAERISEVSA RFTLDAMPGK QMAIDADLNA GLINQAQAQT 
RRKDVASEAD FYGAMDGASK FVRGDAIAGM MILAINLIGG VCIGIFKYNL SADAAFQQYV
LMTIGDGLVA QIPSLLLSTA AAIIVTRVSD NGDIAHDVRH QLLASPSVLY TATGIMFVLA
VVPGMPHLPF LLFSALLGFT GWRMSKQPQA AEAEEKSLET LTRTITETSE QQVSWETIPL
IEPISLSLGY KLVALVDKAQ GNPLTQRIRG VRQVISDGNG VLLPEIRIRE NFRLKPSQYA
IFINGIKADE ADIPADKLMA LPSSETYGEI DGVLGNDPAY GMPVTWIQPA QKAKALNMGY
QVIDSASVIA THVNKIVRSY IPDLFNYDDI TQLHNRLSSM APRLAEDLSA ALNYSQLLKV
YRALLTEGVS LRDIVTIATV LVASSAVTKD HILLAADVRL ALRRSITHPF VRKQELTVYT
LNNELENLLT NVVNQAQQGG KVMLDSVPVD PNMLNQFQST MPQVKEQMKA AGKDPVLLVP
PQLRPLLARY ARLFAPGLHV LSYNEVPDEL ELKIMGALS