Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2048 |
Symbol | flgI |
ID | 6143125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2069960 |
End bp | 2071057 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616924 |
Product | flagellar basal body P-ring protein |
Protein accession | YP_001744100 |
Protein GI | 170681975 |
COG category | [N] Cell motility |
COG ID | [COG1706] Flagellar basal-body P-ring protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.159552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.112526 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTAAAT TTCTCTCTGC ATTAATTCTT CTACTGGTTA CGACGGCGGT TCAGGCTGAG CGTATTCGCG ATCTCACCAG TGTTCAGGGG GTAAGGCAAA ACTCACTGAT TGGCTATGGC CTGGTGGTGG GGCTGGATGG CACCGGTGAC CAGACAACTC AGACGCCGTT TACCACACAA ACGCTTAATA ACATGCTCTC ACAGCTGGGA ATTACCGTTC CGACGGGCAC CAATATGCAG CTAAAAAACG TCGCTGCGGT AATGGTGACG GCGTCACTTC CACCGTTTGG ACGCCAGGGG CAAACCATTG ACGTGGTGGT TTCTTCCATG GGAAATGCCA AAAGTCTGCG TGGCGGTACG TTATTGATGA CACCGCTTAA GGGCGTTGAC AGTCAGGTGT ATGCGCTGGC ACAGGGCAAT ATTCTGGTTG GCGGCGCAGG GGCCTCCGCT GGCGGTAGCA GTGTTCAGGT GAACCAACTG AACGGTGGAC GGATCACCAA TGGTGCGGTT ATTGAACGTG AATTGCCCAG CCAGTTTGGC GTCGGGAATA CCCTTAATTT GCAACTTAAC GACGAAGATT TCAGTATGGC GCAGCAAATC GCTGACACCA TCAACCGCGT GCGTGGATAT GGCAGCGCCA CCGCGTTAGA TGCGCGGACT ATTCAGGTGC GCGTACCGAG TGGCAACAGT TCCCAGGTCC GTTTCCTTGC CGATATCCAG AATATGCAGG TTAATGTTAC CCCGCAGGAC GCTAAAGTAG TGATTAACTC GCGCACTGGT TCGGTGGTGA TGAATCGCGA AGTGACCCTC GACAGCTGCG CGGTAGCGCA GGGGAATCTC TCGGTTACGG TAAATCGGCA GGCCAATGTC AGCCAGCCAG ATACACCGTT TGGTGGCGGG CAGACTGTGG TTACTCCACA AACGCAGATC GATTTACGCC AGAGCGGCGG TTCGCTGCAA AGCGTACGTT CCAGCGCCAG CCTCAATAAC GTGGTGCGTG CGCTCAATGC GCTGGGCGCT ACGCCGATGG ATCTGATGTC CATACTGCAA TCAATGCAAA GTGCGGGATG TCTGCGGGCA AAACTGGAAA TCATCTGA
|
Protein sequence | MIKFLSALIL LLVTTAVQAE RIRDLTSVQG VRQNSLIGYG LVVGLDGTGD QTTQTPFTTQ TLNNMLSQLG ITVPTGTNMQ LKNVAAVMVT ASLPPFGRQG QTIDVVVSSM GNAKSLRGGT LLMTPLKGVD SQVYALAQGN ILVGGAGASA GGSSVQVNQL NGGRITNGAV IERELPSQFG VGNTLNLQLN DEDFSMAQQI ADTINRVRGY GSATALDART IQVRVPSGNS SQVRFLADIQ NMQVNVTPQD AKVVINSRTG SVVMNREVTL DSCAVAQGNL SVTVNRQANV SQPDTPFGGG QTVVTPQTQI DLRQSGGSLQ SVRSSASLNN VVRALNALGA TPMDLMSILQ SMQSAGCLRA KLEII
|
| |