Gene ECD_01850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01850 
SymbolflhA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1908890 
End bp1910968 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content55% 
IMG OID 
Productflagellar biosynthesis protein A 
Protein accessionACT43704 
Protein GI253978034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.585433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC TGGCCGCGAT GCTGCGCCTG CCCGCAAACC TGAAATCGAC ACAATGGCAG 
ATCCTTGCCG GACCGATTTT GATCCTGTTG ATCTTGTCGA TGATGGTGCT GCCACTGCCC
GCATTCATAC TCGACCTGTT GTTTACCTTC AATATTGCCT TGTCGATCAT GGTGTTGCTG
GTGGCGATGT TTACCCAGCG CACGCTTGAG TTTGCTGCGT TTCCGACCAT TCTGTTGTTT
ACCACGCTGT TGCGTCTGGC ACTTAACGTG GCTTCAACCC GTATCATTTT AATGGAAGGG
CATACCGGCG CGGCGGCGGC AGGGAAGGTG GTCGAAGCGT TCGGTCACTT CCTCGTTGGT
GGCAATTTCG CTATCGGTAT CGTGGTGTTT GTCATTCTCG TGATCATCAA CTTTATGGTC
ATTACCAAAG GTGCCGGGCG TATCGCAGAA GTGGGTGCGC GCTTTGTTCT CGATGGTATG
CCGGGTAAGC AGATGGCGAT TGACGCCGAC CTTAACGCCG GATTGATTGG TGAAGATGAG
GCGAAAAAAC GCCGCTCCGA AGTGACTCAG GAAGCCGATT TTTACGGCTC AATGGACGGG
GCAAGTAAGT TTGTTCGCGG CGATGCCATC GCCGGGATCC TCATCATGGT CATTAACGTT
GTCGGCGGGT TGCTGGTCGG CGTGCTGCAA CATGGCATGA GCATGGGACA CGCGGCGGAA
AGTTATACGC TATTGACCAT TGGCGACGGT CTGGTGGCAC AAATTCCGGC GCTGGTGATT
TCTACCGCCG CGGGGGTCAT CGTTACGCGT GTCAGCACCG ATCAGGATGT TGGCGAGCAG
ATGGTGAATC AGCTTTTCAG TAACCCAAGC GTTATGTTGT TAAGCGCCGC CGTGCTCGGT
TTACTCGGCC TGGTGCCTGG AATGCCGAAC CTGGTATTTT TGCTGTTCAC TGCCGGATTG
CTCGGGCTGG CCTGGTGGAT ACGCGGACGC GAACAAAAAG CGCCTGCCGA ACCCAAACCG
GTAAAAATGG CAGAGAATAA TACCGTTGTC GAAGCGACGT GGAACGATGT ACAACTGGAA
GATTCTCTGG GAATGGAAGT GGGTTATCGA CTGATCCCGA TGGTCGATTT CCAGCAGGAT
GGTGAGTTGT TGGGCCGTAT ACGCAGTATC CGCAAGAAAT TTGCCCAGGA GATGGGATTT
CTGCCGCCAG TGGTGCACAT TCGCGACAAT ATGGATCTGC AACCTGCCCG CTATCGCATT
TTGATGAAAG GCGTGGAGAT TGGCAGTGGT GATGCTTATC CGGGGCGCTG GCTGGCGATT
AACCCTGGAA TCGCTGCCGG GACGTTACCT GGTGAGGCGA CCGTCGATCC GGCATTTGGC
CTGAATGCTA TCTGGATTGA AAGTGCGCTA AAAGAACAGG CGCAGATTCA GGGGTACACA
GTGGTTGAGG CCAGCACGGT GGTAGCAACG CATCTTAACC ACCTCATTAG CCAGCATGCC
GCAGAGCTGT TTGGTCGTCA GGAGGCGCAA CAGCTGTTGG ATCGCGTCGC CCAGGAGATG
CCAAAGCTGA CGGAAGATCT CGTTCCTGGC GTCGTCACGC TCACCACACT GCATAAAGTG
CTGCAAAATC TCCTCGATGA AAAAGTACCG ATTCGCGATA TGCGCACCAT TCTCGAAACG
CTGGCGGAAC ATGCGCCCAT CCAAAGCGAT CCACATGAAT TAACCGCCGT CGTGCGCGTG
GCGTTGGGAC GGGCGATTAC CCAGCAGTGG TTTCCTGGCA AAGATGAAGT CCATGTTATT
GGCCTCGATA CACCGCTGGA ACGTTTGTTA CTACAGGCGC TGCAGGGCGG GGGAGGACTG
GAGCCAGGGC TGGCGGATCG TTTACTGGCG CAAACTCAGG AAGCGCTATC CCGTCAGGAG
ATGCTGGGTG CGCCGCCAGT ATTGTTGGTG AACCACGCGC TGCGACCATT ATTGTCTCGC
TTCCTGCGCC GCAGCTTGCC GCAGTTAGTG GTCCTGTCGA ATCTGGAACT GTCTGATAAC
CGACATATCC GCATGACGGC GACAATTGGC GGCAAATAA
 
Protein sequence
MSNLAAMLRL PANLKSTQWQ ILAGPILILL ILSMMVLPLP AFILDLLFTF NIALSIMVLL 
VAMFTQRTLE FAAFPTILLF TTLLRLALNV ASTRIILMEG HTGAAAAGKV VEAFGHFLVG
GNFAIGIVVF VILVIINFMV ITKGAGRIAE VGARFVLDGM PGKQMAIDAD LNAGLIGEDE
AKKRRSEVTQ EADFYGSMDG ASKFVRGDAI AGILIMVINV VGGLLVGVLQ HGMSMGHAAE
SYTLLTIGDG LVAQIPALVI STAAGVIVTR VSTDQDVGEQ MVNQLFSNPS VMLLSAAVLG
LLGLVPGMPN LVFLLFTAGL LGLAWWIRGR EQKAPAEPKP VKMAENNTVV EATWNDVQLE
DSLGMEVGYR LIPMVDFQQD GELLGRIRSI RKKFAQEMGF LPPVVHIRDN MDLQPARYRI
LMKGVEIGSG DAYPGRWLAI NPGIAAGTLP GEATVDPAFG LNAIWIESAL KEQAQIQGYT
VVEASTVVAT HLNHLISQHA AELFGRQEAQ QLLDRVAQEM PKLTEDLVPG VVTLTTLHKV
LQNLLDEKVP IRDMRTILET LAEHAPIQSD PHELTAVVRV ALGRAITQQW FPGKDEVHVI
GLDTPLERLL LQALQGGGGL EPGLADRLLA QTQEALSRQE MLGAPPVLLV NHALRPLLSR
FLRRSLPQLV VLSNLELSDN RHIRMTATIG GK