Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2899 |
Symbol | fliF |
ID | 7873801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3140687 |
End bp | 3142402 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699820 |
Product | flagellar MS-ring protein |
Protein accession | YP_002889875 |
Protein GI | 237653561 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1766] Flagellar biosynthesis/type III secretory pathway lipoprotein |
TIGRFAM ID | [TIGR00206] flagellar basal-body M-ring protein/flagellar hook-basal body protein (fliF) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00622056 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCAG CAGAAAACAC CGCCGACCTG CCTCCGGGCG CCGCCCCCGC CGATGCGCAG GCGACGACGC GGATGCAGCA GGCGCTGCAG AACCTCGCCG CGCTCAGCAC GCGCCAGAAG ATCGCCGGCG CGGTCGCCAT CGCGCTGGCG ATCGCGCTCG TGGCCGGCTC CCTGCTGTGG AACCGCGCCC CGGAGTACGC GGTGCTGTTC TCCAACCTCG ACGAGCGCGA CGGCGGACAG ATCATCGCCG AGCTGCAGCA GCGCAACATC CCCTACAGGA TGTCGCCCAC CGGCCACGCC ATCCTGGTGC CGCAGGCGCA GGTGCATGAG ACGCGGCTGC GTCTGGCGGC CGACGGCCTG CCCAAGGGCA GCCTCGCCGG CTTCGAGCTG ATGGACGGCC AGAAGCTCGG CATCTCGCAG TTCAACGAGC AGGTGAACTA CCAGCGCGCG CTCGAAGGCG AGCTGACCCG CACCGTGCAG GCGATCGACG CGGTGGCCAG CGCCCGGGTG CACCTGGCGA TGCCCAAGCA GACCGCCTTC CTGCGCGACG ACCAGCGCCC CACCGCCTCG GTGATGGTGA ACCTGCGCGG CGGCCGCATC CTCTCGCCCG ACCAGGTCGC CGGCATCGTG CACCTGGTGT CGTCGAGCGT GCCACGCATG CACGCCGAGG GCGTGAAGAT CGTGGACCAG AACGGCAAGC TGCTCACCGA GCAGGCCGAC CCGCTGCTGC GCGCCGGGCT CGACGCCACC CAGCTCGAAT ACGTCCGCCT GCTCGAGCAG GGCTTCATCG AGCGCATCGA CAAGATCCTC GCGCCCCTGG TGGGCAAGGG CAACTACGGC GCCCAGGTGG CGGCCGACGT GGACTTCAAC CAGGTCGAGC AGACCGCCGA GACCTACAAG CCCAACCCCA CACCCGACCA GGCGATCCGC AGCCAGCAGA CCAGCGAGGC CTTCAACCCG CAGCCGGGCG CGCAGGGCGT GCCCGGCGCG CTCACCAACC AGCCCCCGGT GCCGGCCACC GCGCCGATCA CCAACCCGCA GGTGGCCGGC GGCGGCGGCG CGCAGGGCCT GGCGAGCGGC AACCGCAGCG CGGTGCTCAA CTACGAGCTC GACCGCAACA TCCAGCACGT CAAGCAGGCG GTCGGGCAGA TCAAGCGCCT GTCGGTCGCG GTGGTGGTGA ATAATCGCAC CCTCCCCGGG CCCGACGGCA CGCCCACCAA CGTGCCGCTG CCCGACGAGG AGATCGCGCG CATCACCAAC CTGGTGCGCG AGGCGGTGGG CTACAACGCC GACCGCGGCG ACACCATCAA CGTCGCCAGC GGCGCCTTCG CCGACGACGG CAGCGGAGCC GCACCGCCGC CGTGGAAGGA TCCGGAGATC GTCGCGCTCG GCAAGGAAGG CCTGAACTGG CTGCTGGTTC TGATCGCGAT CCTGTTCGCC TACTTTGGCG TGATCCGCCC GCTGCTGCGC ACCGTGGTGC CGCCCAAGCC GAAGGAAGAG AAGAAGGGCG CCGCCGCGGG CGAGGAAGGC GGGGAAGGCG AAGAGGGCGA GGAAGGCGTG CGCGTCACCC TGTCCGGCGA GACCGGCGAG GGTGAAGAGA CGGAGACCTT CGAGCAGCGC CTCGAGCGTG CCCGCGCCGC GGCGCGCAAC GATCCCAAGA TGGTCGCCAA CCTGATCAAG GACTGGATGG GCATGAACGA GGAGGCGCGC AAGTGA
|
Protein sequence | MAAAENTADL PPGAAPADAQ ATTRMQQALQ NLAALSTRQK IAGAVAIALA IALVAGSLLW NRAPEYAVLF SNLDERDGGQ IIAELQQRNI PYRMSPTGHA ILVPQAQVHE TRLRLAADGL PKGSLAGFEL MDGQKLGISQ FNEQVNYQRA LEGELTRTVQ AIDAVASARV HLAMPKQTAF LRDDQRPTAS VMVNLRGGRI LSPDQVAGIV HLVSSSVPRM HAEGVKIVDQ NGKLLTEQAD PLLRAGLDAT QLEYVRLLEQ GFIERIDKIL APLVGKGNYG AQVAADVDFN QVEQTAETYK PNPTPDQAIR SQQTSEAFNP QPGAQGVPGA LTNQPPVPAT APITNPQVAG GGGAQGLASG NRSAVLNYEL DRNIQHVKQA VGQIKRLSVA VVVNNRTLPG PDGTPTNVPL PDEEIARITN LVREAVGYNA DRGDTINVAS GAFADDGSGA APPPWKDPEI VALGKEGLNW LLVLIAILFA YFGVIRPLLR TVVPPKPKEE KKGAAAGEEG GEGEEGEEGV RVTLSGETGE GEETETFEQR LERARAAARN DPKMVANLIK DWMGMNEEAR K
|
| |