Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1308 |
Symbol | flhA |
ID | 6147295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1296042 |
End bp | 1298120 |
Gene Length | 2079 bp |
Protein Length | 692 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616186 |
Product | flagellar biosynthesis protein FlhA |
Protein accession | YP_001743366 |
Protein GI | 170682236 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1298] Flagellar biosynthesis pathway, component FlhA |
TIGRFAM ID | [TIGR01398] flagellar biosynthesis protein FlhA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.201196 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATC TGGCCGCGAT GCTGCGCCTG CCCGCAAACC TGAAATCGAC ACAATGGCAG ATCCTTGCCG GACCGATTTT GATCCTGTTG ATCTTGTCGA TGATGGTACT GCCACTGCCC GCATTCATAC TCGACCTGCT GTTTACCTTC AATATTGCCT TGTCGATCAT GGTGTTGCTG GTGGCGATGT TTACCCAGCG CACGCTTGAG TTTGCTGCGT TTCCGACCAT CCTGTTGTTT ACCACGCTGT TGCGTCTGGC ACTTAACGTG GCTTCAACCC GTATCATTTT GATGGAAGGG CATACCGGCG CGGCGGCGGC TGGAAAGGTG GTCGAAGCGT TCGGTCACTT CCTCGTTGGT GGCAATTTCG CTATCGGTAT CGTGGTGTTT GTCATTCTCG TGATCATCAA CTTTATGGTC ATTACCAAAG GTGCCGGGCG TATCGCAGAA GTGGGTGCGC GCTTTGTTCT CGATGGCATG CCGGGTAAGC AGATGGCGAT TGACGCCGAC CTTAACGCTG GATTGATTGG TGAAGATGAG GCGAAAAAAC GCCGCTCCGA AGTGACTCAG GAAGCCGATT TTTACGGCTC AATGGATGGG GCAAGTAAGT TTGTTCGCGG CGATGCCATC GCCGGGATCC TCATCATGGT CATTAACGTT GTCGGCGGGT TGCTGGTCGG CGTGCTGCAA CATGGCATGA GCATGGGACA CGCGGCGGAA AGTTATACGC TATTGACCAT TGGCGACGGT CTGGTGGCAC AAATTCCGGC GTTGGTGATT TCTACCGCCG CGGGGGTCAT CGTTACGCGT GTCAGCACCG ATCAGGATGT TGGCGAGCAG ATGGTGAATC AGCTTTTCAG TAACCCAAGC GTTATGTTGT TAAGCGCCGC CGTGCTCGGT TTACTCGGCC TGGTGCCTGG AATGCCAAAC CTGGTATTTT TGCTGTTCAC TGCCGGATTG CTGGGGCTGG CCTGGTGGAT CCGTGGACGT GAACAAAAAG CGCCTGCCGA ACCAAAACCG GTAAAAATGG CAGAGAATAA TACCGTTGTC GAAGCGACGT GGAACGATGT ACAACTGGAA GATTCTCTGG GAATGGAAGT GGGTTATCGC CTGATCCCGA TGGTCGATTT CCAGCAGGAT GGTGAGTTGC TGGGCCGTAT ACGCAGTATT CGCAAGAAAT TTGCCCAGGA GATGGGGTTT CTGCCGCCGG TGGTGCACAT TCGCGACAAT ATGGATCTGC AACCTGCCCG CTATCGCATT TTGATGAAAG GCGTGGAGAT AGGCAGTGGT GATGCTTATC CGGGGCGTTG GTTGGCGATT AACCCTGGTA CCGCTGCCGG GACGTTACCT GGTGAGGCGA CCGTCGATCC GGCATTTGGC CTGAATGCTA TCTGGATTGA AAGTGCGCTT AAAGAACAGG CGCAGATTCA GGGATACACC GTGGTTGAGG CCAGCACGGT GGTGGCAACG CATCTTAACC ACCTCATTAG CCAGCATGCC GCAGAGCTGT TTGGTCGTCA GGAGGCGCAA CAGCTGCTGG ATCGCGTCGC CCAGGAGATG CCAAAGCTGA CGGAAGATCT CGTTCCTGGC GTCGTCACGC TCACCACACT GCATAAAGTG CTGCAAAATC TGCTCGATGA AAAAGTACCG ATTCGCGATA TGCGCACTAT TCTCGAAACG CTGGCGGAAC ATGCGCCCAT CCAAAGCGAT CCACATGAAT TAACCGCCGT CGTGCGCGTG GCGTTGGGAC GGGCGATTAC CCAGCAGTGG TTTCCTGGCA AAGATGAAGT CCATGTTATT GGCCTCGATA CACCGCTGGA ACGTTTGTTA CTACAGGCGC TGCAGGGCGG GGGCGGACTG GAGCCAGGGC TGGCGGATCG TTTACTGGCG CAAACTCAGG AAGCGCTATC CCGTCAGGAG ATGTTGGGTG CGCCGCCAGT TTTATTGGTG AACCACGCGC TGCGACCTTT ATTGTCTCGC TTCCTGCGCC GCAGTCTGCC GCAGTTAGTG GTGCTGTCGA ATCTGGAACT GTCTGATAAC CGACATATCC GCATGACGGC GACAATTGGC GGTAAATGA
|
Protein sequence | MSNLAAMLRL PANLKSTQWQ ILAGPILILL ILSMMVLPLP AFILDLLFTF NIALSIMVLL VAMFTQRTLE FAAFPTILLF TTLLRLALNV ASTRIILMEG HTGAAAAGKV VEAFGHFLVG GNFAIGIVVF VILVIINFMV ITKGAGRIAE VGARFVLDGM PGKQMAIDAD LNAGLIGEDE AKKRRSEVTQ EADFYGSMDG ASKFVRGDAI AGILIMVINV VGGLLVGVLQ HGMSMGHAAE SYTLLTIGDG LVAQIPALVI STAAGVIVTR VSTDQDVGEQ MVNQLFSNPS VMLLSAAVLG LLGLVPGMPN LVFLLFTAGL LGLAWWIRGR EQKAPAEPKP VKMAENNTVV EATWNDVQLE DSLGMEVGYR LIPMVDFQQD GELLGRIRSI RKKFAQEMGF LPPVVHIRDN MDLQPARYRI LMKGVEIGSG DAYPGRWLAI NPGTAAGTLP GEATVDPAFG LNAIWIESAL KEQAQIQGYT VVEASTVVAT HLNHLISQHA AELFGRQEAQ QLLDRVAQEM PKLTEDLVPG VVTLTTLHKV LQNLLDEKVP IRDMRTILET LAEHAPIQSD PHELTAVVRV ALGRAITQQW FPGKDEVHVI GLDTPLERLL LQALQGGGGL EPGLADRLLA QTQEALSRQE MLGAPPVLLV NHALRPLLSR FLRRSLPQLV VLSNLELSDN RHIRMTATIG GK
|
| |