Gene EcSMS35_1308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1308 
SymbolflhA 
ID6147295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1296042 
End bp1298120 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content55% 
IMG OID641616186 
Productflagellar biosynthesis protein FlhA 
Protein accessionYP_001743366 
Protein GI170682236 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1298] Flagellar biosynthesis pathway, component FlhA 
TIGRFAM ID[TIGR01398] flagellar biosynthesis protein FlhA 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.201196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATC TGGCCGCGAT GCTGCGCCTG CCCGCAAACC TGAAATCGAC ACAATGGCAG 
ATCCTTGCCG GACCGATTTT GATCCTGTTG ATCTTGTCGA TGATGGTACT GCCACTGCCC
GCATTCATAC TCGACCTGCT GTTTACCTTC AATATTGCCT TGTCGATCAT GGTGTTGCTG
GTGGCGATGT TTACCCAGCG CACGCTTGAG TTTGCTGCGT TTCCGACCAT CCTGTTGTTT
ACCACGCTGT TGCGTCTGGC ACTTAACGTG GCTTCAACCC GTATCATTTT GATGGAAGGG
CATACCGGCG CGGCGGCGGC TGGAAAGGTG GTCGAAGCGT TCGGTCACTT CCTCGTTGGT
GGCAATTTCG CTATCGGTAT CGTGGTGTTT GTCATTCTCG TGATCATCAA CTTTATGGTC
ATTACCAAAG GTGCCGGGCG TATCGCAGAA GTGGGTGCGC GCTTTGTTCT CGATGGCATG
CCGGGTAAGC AGATGGCGAT TGACGCCGAC CTTAACGCTG GATTGATTGG TGAAGATGAG
GCGAAAAAAC GCCGCTCCGA AGTGACTCAG GAAGCCGATT TTTACGGCTC AATGGATGGG
GCAAGTAAGT TTGTTCGCGG CGATGCCATC GCCGGGATCC TCATCATGGT CATTAACGTT
GTCGGCGGGT TGCTGGTCGG CGTGCTGCAA CATGGCATGA GCATGGGACA CGCGGCGGAA
AGTTATACGC TATTGACCAT TGGCGACGGT CTGGTGGCAC AAATTCCGGC GTTGGTGATT
TCTACCGCCG CGGGGGTCAT CGTTACGCGT GTCAGCACCG ATCAGGATGT TGGCGAGCAG
ATGGTGAATC AGCTTTTCAG TAACCCAAGC GTTATGTTGT TAAGCGCCGC CGTGCTCGGT
TTACTCGGCC TGGTGCCTGG AATGCCAAAC CTGGTATTTT TGCTGTTCAC TGCCGGATTG
CTGGGGCTGG CCTGGTGGAT CCGTGGACGT GAACAAAAAG CGCCTGCCGA ACCAAAACCG
GTAAAAATGG CAGAGAATAA TACCGTTGTC GAAGCGACGT GGAACGATGT ACAACTGGAA
GATTCTCTGG GAATGGAAGT GGGTTATCGC CTGATCCCGA TGGTCGATTT CCAGCAGGAT
GGTGAGTTGC TGGGCCGTAT ACGCAGTATT CGCAAGAAAT TTGCCCAGGA GATGGGGTTT
CTGCCGCCGG TGGTGCACAT TCGCGACAAT ATGGATCTGC AACCTGCCCG CTATCGCATT
TTGATGAAAG GCGTGGAGAT AGGCAGTGGT GATGCTTATC CGGGGCGTTG GTTGGCGATT
AACCCTGGTA CCGCTGCCGG GACGTTACCT GGTGAGGCGA CCGTCGATCC GGCATTTGGC
CTGAATGCTA TCTGGATTGA AAGTGCGCTT AAAGAACAGG CGCAGATTCA GGGATACACC
GTGGTTGAGG CCAGCACGGT GGTGGCAACG CATCTTAACC ACCTCATTAG CCAGCATGCC
GCAGAGCTGT TTGGTCGTCA GGAGGCGCAA CAGCTGCTGG ATCGCGTCGC CCAGGAGATG
CCAAAGCTGA CGGAAGATCT CGTTCCTGGC GTCGTCACGC TCACCACACT GCATAAAGTG
CTGCAAAATC TGCTCGATGA AAAAGTACCG ATTCGCGATA TGCGCACTAT TCTCGAAACG
CTGGCGGAAC ATGCGCCCAT CCAAAGCGAT CCACATGAAT TAACCGCCGT CGTGCGCGTG
GCGTTGGGAC GGGCGATTAC CCAGCAGTGG TTTCCTGGCA AAGATGAAGT CCATGTTATT
GGCCTCGATA CACCGCTGGA ACGTTTGTTA CTACAGGCGC TGCAGGGCGG GGGCGGACTG
GAGCCAGGGC TGGCGGATCG TTTACTGGCG CAAACTCAGG AAGCGCTATC CCGTCAGGAG
ATGTTGGGTG CGCCGCCAGT TTTATTGGTG AACCACGCGC TGCGACCTTT ATTGTCTCGC
TTCCTGCGCC GCAGTCTGCC GCAGTTAGTG GTGCTGTCGA ATCTGGAACT GTCTGATAAC
CGACATATCC GCATGACGGC GACAATTGGC GGTAAATGA
 
Protein sequence
MSNLAAMLRL PANLKSTQWQ ILAGPILILL ILSMMVLPLP AFILDLLFTF NIALSIMVLL 
VAMFTQRTLE FAAFPTILLF TTLLRLALNV ASTRIILMEG HTGAAAAGKV VEAFGHFLVG
GNFAIGIVVF VILVIINFMV ITKGAGRIAE VGARFVLDGM PGKQMAIDAD LNAGLIGEDE
AKKRRSEVTQ EADFYGSMDG ASKFVRGDAI AGILIMVINV VGGLLVGVLQ HGMSMGHAAE
SYTLLTIGDG LVAQIPALVI STAAGVIVTR VSTDQDVGEQ MVNQLFSNPS VMLLSAAVLG
LLGLVPGMPN LVFLLFTAGL LGLAWWIRGR EQKAPAEPKP VKMAENNTVV EATWNDVQLE
DSLGMEVGYR LIPMVDFQQD GELLGRIRSI RKKFAQEMGF LPPVVHIRDN MDLQPARYRI
LMKGVEIGSG DAYPGRWLAI NPGTAAGTLP GEATVDPAFG LNAIWIESAL KEQAQIQGYT
VVEASTVVAT HLNHLISQHA AELFGRQEAQ QLLDRVAQEM PKLTEDLVPG VVTLTTLHKV
LQNLLDEKVP IRDMRTILET LAEHAPIQSD PHELTAVVRV ALGRAITQQW FPGKDEVHVI
GLDTPLERLL LQALQGGGGL EPGLADRLLA QTQEALSRQE MLGAPPVLLV NHALRPLLSR
FLRRSLPQLV VLSNLELSDN RHIRMTATIG GK