Gene EcHS_A2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2039 
SymbolfliG 
ID5594017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2034620 
End bp2035615 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content56% 
IMG OID640921183 
Productflagellar motor switch protein G 
Protein accessionYP_001458728 
Protein GI157161410 
COG category[N] Cell motility 
COG ID[COG1536] Flagellar motor switch protein 
TIGRFAM ID[TIGR00207] flagellar motor switch protein FliG 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAACC TGACAGGCAC CGATAAAAGC GTCATCCTGC TGATGACCAT TGGCGAAGAC 
CGGGCGGCAG AGGTGTTCAA GCACCTCTCC CAGCGCGAAG TGCAAACCCT GAGCGCTGCA
ATGGCGAACG TCACGCAGAT CTCCAACAAG CAGCTAACCG ATGTGCTGGC GGAGTTTGAG
CAAGAAGCTG AACAGTTTGC CGCACTGAAT ATCAACGCCA ACGATTATCT GCGTTCGGTA
TTGGTCAAAG CTCTGGGTGA GGAACGTGCC GCCAGCCTGC TGGAAGATAT TCTCGAAACT
CGCGATACCG CCAGCGGTAT TGAAACGCTC AACTTTATGG AGCCGCAGAG TGCCGCCGAT
CTGATTCGCG ATGAGCATCC GCAAATTATC GCCACCATTC TGGTGCATCT GAAGCGCGCC
CAGGCCGCCG ATATTCTGGC GTTGTTCGAT GAACGTCTGC GCCACGACGT GATGTTGCGT
ATCGCCACCT TTGGCGGCGT GCAGCCAGCC GCGCTGGCGG AACTGACCGA AGTACTGAAT
GGCTTGCTCG ACGGTCAGAA TCTCAAGCGC AGCAAAATGG GCGGCGTGAG AACGGCAGCC
GAAATTATCA ACCTGATGAA AACTCAGCAG GAAGAAGCCG TTATTACCGC CGTGCGTGAA
TTCGACGGCG AACTGGCGCA AAAAATTATC GACGAGATGT TCCTGTTCGA GAATCTGGTG
GATGTCGACG ATCGCAGCAT TCAGCGTCTG TTGCAGGAAG TGGATTCCGA ATCGTTGCTG
ATCGCGCTGA AAGGGGCCGA GCAGCCACTG CGCGAGAAAT TCCTGCGCAA TATGTCGCAA
CGTGCCGCCG ATATTCTGCG CGACGATCTC GCCAACCGTG GTCCGGTGCG TCTGTCGCAG
GTGGAAAACG AACAGAAAGC GATTCTGCTG ATTGTGCGCC GTCTTGCCGA AACCGGCGAG
ATGGTGATTG GCAGCGGCGA GGATACCTAT GTCTGA
 
Protein sequence
MSNLTGTDKS VILLMTIGED RAAEVFKHLS QREVQTLSAA MANVTQISNK QLTDVLAEFE 
QEAEQFAALN INANDYLRSV LVKALGEERA ASLLEDILET RDTASGIETL NFMEPQSAAD
LIRDEHPQII ATILVHLKRA QAADILALFD ERLRHDVMLR IATFGGVQPA ALAELTEVLN
GLLDGQNLKR SKMGGVRTAA EIINLMKTQQ EEAVITAVRE FDGELAQKII DEMFLFENLV
DVDDRSIQRL LQEVDSESLL IALKGAEQPL REKFLRNMSQ RAADILRDDL ANRGPVRLSQ
VENEQKAILL IVRRLAETGE MVIGSGEDTY V