Gene ECH74115_2616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2616 
SymbolflhA 
ID6969794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2471196 
End bp2473274 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content55% 
IMG OID643386481 
Productflagellar biosynthesis protein FlhA 
Protein accessionYP_002270963 
Protein GI209396703 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1298] Flagellar biosynthesis pathway, component FlhA 
TIGRFAM ID[TIGR01398] flagellar biosynthesis protein FlhA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0000268529 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAATC TGGCCGCGAT GCTGCGCCTG CCCACAAACC TGAAATCGAC ACAATGGCAG 
ATCCTTGCCG GACCGATTTT GATCCTGTTG ATCTTGTCGA TGATGGTGCT GCCACTGCCC
GCATTCATAC TCGACCTGTT GTTTACCTTC AATATTGCCT TGTCGATCAT GGTGTTGCTG
GTGGCGATGT TTACCCAGCG CACGCTTGAG TTTGCTGCGT TTCCGACCAT TTTGTTGTTT
ACCACGCTGT TGCGTCTGGC ACTTAACGTG GCTTCAACCC GTATCATTTT AATGGAAGGG
CATACCGGCG CAGCGGCGGC AGGGAAGGTG GTTGAAGCGT TCGGTCACTT CCTCGTTGGT
GGCAATTTCG CTATCGGTAT CGTGGTGTTT GTCATTCTCG TGATCATCAA CTTTATGGTC
ATTACCAAAG GTGCCGGGCG TATCGCAGAA GTGGGTGCGC GCTTTGTTCT CGATGGTATG
CCGGGTAAGC AGATGGCGAT TGACGCCGAC CTTAACGCCG GATTGATTGG TGAAGATGAG
GCGAAAAAAC GCCGCTCCGA AGTCACTCAG GAAGCCGATT TTTACGGCTC AATGGACGGG
GCAAGTAAGT TTGTTCGCGG CGATGCCATC GCCGGGATCC TCATCATGGT CATTAACGTT
GTCGGCGGGT TGCTGGTCGG CGTGCTGCAA CATGGCATGA GCATGGGACA CGCGGCGGAA
AGTTATACGC TCTTGACCAT TGGTGACGGT CTGGTGGCAC AAATCCCGGC TCTGGTGATT
TCTACCGCCG CGGGGGTCAT TGTTACGCGT GTCAGCACCG ATCAGGATGT TGGCGAGCAG
ATGGTGAATC AGCTTTTCAG TAACCCAAGC GTTATGTTGT TAAGCGCTGC CGTGCTCGGT
TTACTCGGCC TGGTGCCTGG AATGCCGAAC CTGGTATTTT TGCTGTTCAC TGCCGGATTG
CTGGGGCTGG CCTGGTGGAT CCGTGGACGC GAACAAAAAG CGCCTGCCGA ACCAAAACCG
GTAAAAATGG CAGAGAATAA TGCCGTTGTC GAAGCGACGT GGAACGATGT ACAACTGGAA
GATTCTCTGG GAATGGAAGT GGGTTATCGA CTGATCCCGA TGGTCGATTT CCAGCAGGAT
GGTGAGTTGT TGGGCCGTAT ACGCAGTATC CGCAAGAAAT TTGCCCAGGA GATGGGATTT
CTGCCGCCAG TGGTGCACAT TCGCGACAAT ATGGATCTGC AACCTGCCCG CTATCGCATT
TTGATGAAAG GCGTGGAGAT AGGCAGTGGT GATGCTTATC CGGGGCGCTG GCTGGCTATT
AACCCTGGAA CCGCTGCCGG GACGTTACCT GGTGAGGCGA CCGTCGATCC GGCATTTGGC
CTGAATGCTA TCTGGATTGA AAGTGCGCTA AAAGAACAGG CGCAGATTCA GGGGTACACA
GTGGTTGAGG CCAGCACGGT GGTAGCAACG CATCTTAACC ACCTCATTAG CCAGCATGCC
GCAGAGCTGT TTGGTCGTCA GGAGGCGCAA CAGCTGCTGG ATCGCGTCGC CCAGGAGATG
CCAAAGCTGA CGGAAGATCT CGTCCCTGGC GTCGTCACGC TCACCACACT GCATAAAGTG
CTGCAAAATC TCCTCGATGA AAAAGTACCG ATTCGCGATA TGCGCACTAT TCTCGAAACG
CTGGCGGAAC ATGCGCCCAT CCAAAGCGAT CCACATGAAT TAACCGCCGT CGTGCGCGTG
GCGTTGGGAC GGGCGATTAC CCAGCAGTGG TTTCCTGGCA AAGATGAAGT CCATGTTATT
GGCCTCGATA CACCGCTGGA ACGTTTGTTG CTACAGGCGT TGCAGGGCGG GGGCGGACTG
GAGCCAGGGC TGGCGGATCG TTTACTGGCG CAAACTCAGG AAGCGCTATC CCGTCAGGAG
ATGTTGGGTG CGCCGCCAGT TTTATTGGTG AACCACGCGC TGCGACCATT ATTGTCTCGC
TTTCTGCGCC GCAGCTTGCC GCAGTTAGTG GTGCTGTCGA ATCTGGAACT GTCTGATAAC
CGACATATCC GCATGACGGC GACAATTGGC GGTAAATGA
 
Protein sequence
MSNLAAMLRL PTNLKSTQWQ ILAGPILILL ILSMMVLPLP AFILDLLFTF NIALSIMVLL 
VAMFTQRTLE FAAFPTILLF TTLLRLALNV ASTRIILMEG HTGAAAAGKV VEAFGHFLVG
GNFAIGIVVF VILVIINFMV ITKGAGRIAE VGARFVLDGM PGKQMAIDAD LNAGLIGEDE
AKKRRSEVTQ EADFYGSMDG ASKFVRGDAI AGILIMVINV VGGLLVGVLQ HGMSMGHAAE
SYTLLTIGDG LVAQIPALVI STAAGVIVTR VSTDQDVGEQ MVNQLFSNPS VMLLSAAVLG
LLGLVPGMPN LVFLLFTAGL LGLAWWIRGR EQKAPAEPKP VKMAENNAVV EATWNDVQLE
DSLGMEVGYR LIPMVDFQQD GELLGRIRSI RKKFAQEMGF LPPVVHIRDN MDLQPARYRI
LMKGVEIGSG DAYPGRWLAI NPGTAAGTLP GEATVDPAFG LNAIWIESAL KEQAQIQGYT
VVEASTVVAT HLNHLISQHA AELFGRQEAQ QLLDRVAQEM PKLTEDLVPG VVTLTTLHKV
LQNLLDEKVP IRDMRTILET LAEHAPIQSD PHELTAVVRV ALGRAITQQW FPGKDEVHVI
GLDTPLERLL LQALQGGGGL EPGLADRLLA QTQEALSRQE MLGAPPVLLV NHALRPLLSR
FLRRSLPQLV VLSNLELSDN RHIRMTATIG GK