Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1201 |
Symbol | flgG |
ID | 5595412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1198409 |
End bp | 1199191 |
Gene Length | 783 bp |
Protein Length | 260 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640920360 |
Product | flagellar basal body rod protein FlgG |
Protein accession | YP_001457923 |
Protein GI | 157160605 |
COG category | [N] Cell motility |
COG ID | [COG4786] Flagellar basal body rod protein |
TIGRFAM ID | [TIGR01396] flagellar basal-body rod protein FlgB [TIGR02488] flagellar basal-body rod protein FlgG, Gram-negative bacteria [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGTT CATTATGGAT CGCCAAAACG GGCCTTGACG CCCAGCAAAC CAATATGGAC GTCATTGCCA ACAACCTGGC AAACGTCAGT ACTAACGGTT ATAAGCGTCA GCGCGCGGTG TTTGAAGATC TGCTTTATCA AACCATTCGC CAGCCGGGGG CACAGTCTTC CGAACAAACC ACCTTACCCT CCGGATTACA AATCGGCACG GGGGTACGCC CGGTCGCCAC TGAACGCTTA CACAGCCAGG GAAACCTGTC GCAGACCAAC AACAGCAAAG ATGTCGCGAT TAAAGGGCAG GGCTTTTTCC AGGTGATGTT GCCAGATGGC TCATCAGCCT ATACCCGTGA CGGCTCTTTC CAGGTGGATC AGAACGGGCA GCTGGTGACG GCTGGTGGTT TTCAGGTACA GCCAGCGATC ACCATTCCGG CGAATGCGTT AAGTATCACC ATCGGTCGTG ATGGCGTGGT CAGCGTAACC CAACAAGGCC AGGCAGCTCC GGTTCAGGTT GGGCAGCTCA ATCTCACCAC CTTTATGAAT GATACCGGGC TGGAGAGCAT TGGCGAAAAC CTCTACACCG AAACGCAATC CTCCGGTGCA CCGAACGAAA GCACGCCGGG CCTGAACGGC GCGGGACTGC TGTATCAAGG GTATGTTGAA ACGTCTAACG TCAACGTGGC GGAAGAACTG GTCAATATGA TTCAGGTGCA ACGCGCTTAC GAAATCAACA GTAAAGCGGT GTCCACCACC GATCAGATGC TGCAAAAACT GACGCAACTC TAA
|
Protein sequence | MISSLWIAKT GLDAQQTNMD VIANNLANVS TNGYKRQRAV FEDLLYQTIR QPGAQSSEQT TLPSGLQIGT GVRPVATERL HSQGNLSQTN NSKDVAIKGQ GFFQVMLPDG SSAYTRDGSF QVDQNGQLVT AGGFQVQPAI TIPANALSIT IGRDGVVSVT QQGQAAPVQV GQLNLTTFMN DTGLESIGEN LYTETQSSGA PNESTPGLNG AGLLYQGYVE TSNVNVAEEL VNMIQVQRAY EINSKAVSTT DQMLQKLTQL
|
| |