Gene EcDH1_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1761 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1912229 
End bp1914307 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content55% 
IMG OID 
Productflagellar biosynthesis protein FlhA 
Protein accessionACX39420 
Protein GI260448998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.98256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC TGGCCGCGAT GCTGCGCCTG CCCGCAAACC TGAAATCGAC ACAATGGCAG 
ATCCTTGCCG GACCGATTTT GATCCTGTTG ATCTTGTCGA TGATGGTGCT GCCACTGCCC
GCATTCATAC TCGACCTGTT GTTTACCTTC AATATTGCCT TGTCGATCAT GGTGTTGCTG
GTGGCGATGT TTACCCAGCG CACGCTTGAG TTTGCTGCGT TTCCGACCAT TCTGTTGTTT
ACCACGCTGT TGCGTCTGGC ACTTAACGTG GCTTCAACCC GTATCATTTT AATGGAAGGG
CATACCGGCG CGGCGGCGGC AGGGAAGGTG GTCGAAGCGT TCGGTCACTT CCTCGTTGGT
GGCAATTTCG CTATCGGTAT CGTGGTGTTT GTCATTCTCG TGATCATCAA CTTTATGGTC
ATTACCAAAG GTGCCGGGCG TATCGCAGAA GTGGGTGCGC GCTTTGTTCT CGATGGTATG
CCGGGTAAGC AGATGGCGAT TGACGCCGAC CTTAACGCCG GATTGATTGG TGAAGATGAG
GCGAAAAAAC GCCGCTCCGA AGTGACTCAG GAAGCCGATT TTTACGGCTC AATGGACGGG
GCAAGTAAGT TTGTTCGCGG CGATGCCATC GCCGGGATCC TCATCATGGT CATTAACATT
GTCGGCGGGT TGCTGGTCGG CGTGCTGCAA CATGGCATGA GCATGGGACA CGCGGCGGAA
AGTTATACGC TATTGACCAT TGGCGACGGT CTGGTGGCAC AAATTCCGGC GCTGGTGATT
TCTACCGCCG CGGGGGTCAT CGTTACGCGT GTCAGCACCG ATCAGGATGT TGGCGAGCAG
ATGGTGAATC AGCTTTTCAG TAACCCAAGC GTTATGTTGT TAAGCGCCGC CGTGCTCGGT
TTACTCGGCC TGGTGCCTGG AATGCCGAAC CTGGTATTTT TGCTGTTCAC TGCCGGATTG
CTCGGGCTGG CCTGGTGGAT ACGCGGACGC GAACAAAAAG CGCCTGCCGA ACCCAAACCG
GTAAAAATGG CAGAGAATAA TACCGTTGTC GAAGCGACGT GGAACGATGT ACAACTGGAA
GATTCTCTGG GAATGGAAGT GGGTTATCGA CTGATCCCGA TGGTCGATTT CCAGCAGGAT
GGTGAGTTGT TGGGCCGTAT ACGCAGTATC CGCAAGAAAT TTGCCCAGGA GATGGGATTT
CTGCCGCCAG TGGTGCACAT TCGCGACAAT ATGGATCTGC AACCTGCCCG CTATCGCATT
TTGATGAAAG GCGTGGAGAT TGGCAGTGGT GATGCTTATC CGGGGCGCTG GCTGGCGATT
AACCCTGGAA CCGCTGCCGG GACGTTACCT GGTGAGGCGA CCGTCGATCC GGCATTTGGC
CTGAATGCTA TCTGGATTGA AAGTGCGCTA AAAGAACAGG CGCAGATTCA GGGGTACACA
GTGGTTGAGG CCAGCACGGT GGTAGCAACG CATCTTAACC ACCTCATTAG CCAGCATGCC
GCAGAGCTGT TTGGTCGTCA GGAGGCGCAA CAGCTGTTGG ATCGCGTCGC CCAGGAGATG
CCAAAGCTGA CGGAAGATCT CGTTCCTGGC GTCGTCACGC TCACCACACT GCATAAAGTG
CTGCAAAATC TCCTCGATGA AAAAGTACCG ATTCGCGATA TGCGCACCAT TCTCGAAACG
CTGGCGGAAC ATGCGCCCAT CCAAAGCGAT CCACATGAAT TAACCGCCGT CGTGCGCGTG
GCGTTGGGAC GGGCGATTAC CCAGCAGTGG TTTCCTGGCA AAGATGAAGT CCATGTTATT
GGCCTCGATA CACCGCTGGA ACGTTTGTTA CTACAGGCGC TGCAGGGCGG GGGAGGACTG
GAGCCAGGGC TGGCGGATCG TTTACTGGCG CAAACTCAGG AAGCGCTATC CCGTCAGGAG
ATGCTGGGTG CGCCGCCAGT ATTGTTGGTG AACCACGCGC TGCGACCATT ATTGTCTCGC
TTCCTGCGCC GCAGCTTGCC GCAGTTAGTG GTCCTGTCGA ATCTGGAACT GTCTGATAAC
CGACATATCC GCATGACGGC GACAATTGGC GGCAAATAA
 
Protein sequence
MSNLAAMLRL PANLKSTQWQ ILAGPILILL ILSMMVLPLP AFILDLLFTF NIALSIMVLL 
VAMFTQRTLE FAAFPTILLF TTLLRLALNV ASTRIILMEG HTGAAAAGKV VEAFGHFLVG
GNFAIGIVVF VILVIINFMV ITKGAGRIAE VGARFVLDGM PGKQMAIDAD LNAGLIGEDE
AKKRRSEVTQ EADFYGSMDG ASKFVRGDAI AGILIMVINI VGGLLVGVLQ HGMSMGHAAE
SYTLLTIGDG LVAQIPALVI STAAGVIVTR VSTDQDVGEQ MVNQLFSNPS VMLLSAAVLG
LLGLVPGMPN LVFLLFTAGL LGLAWWIRGR EQKAPAEPKP VKMAENNTVV EATWNDVQLE
DSLGMEVGYR LIPMVDFQQD GELLGRIRSI RKKFAQEMGF LPPVVHIRDN MDLQPARYRI
LMKGVEIGSG DAYPGRWLAI NPGTAAGTLP GEATVDPAFG LNAIWIESAL KEQAQIQGYT
VVEASTVVAT HLNHLISQHA AELFGRQEAQ QLLDRVAQEM PKLTEDLVPG VVTLTTLHKV
LQNLLDEKVP IRDMRTILET LAEHAPIQSD PHELTAVVRV ALGRAITQQW FPGKDEVHVI
GLDTPLERLL LQALQGGGGL EPGLADRLLA QTQEALSRQE MLGAPPVLLV NHALRPLLSR
FLRRSLPQLV VLSNLELSDN RHIRMTATIG GK