Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3851 |
Symbol | flhA |
ID | 8139225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4434431 |
End bp | 4436509 |
Gene Length | 2079 bp |
Protein Length | 692 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644871468 |
Product | flagellar biosynthesis protein FlhA |
Protein accession | YP_003023626 |
Protein GI | 253702437 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1298] Flagellar biosynthesis pathway, component FlhA |
TIGRFAM ID | [TIGR01398] flagellar biosynthesis protein FlhA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.000167765 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAACG GCGCGACCGA CGCCCTGGCA CTACCAGGAC CCAAGAAAAA TTCGGACATC TACATGGCGG TGGCCCTGAT CGGGATACTG TCCCTGATGA TCATCCCGGT CCCGGCATTC ATGCTGGACA TCTTCCTCGC GGCCAACATC ACCATCGCGC TCGTCATCCT GCTGGTCTGC CTCTACACCA TCCAACCGCT CGACTTCTCG GTGTTTCCGT CCATCCTGCT GGTCACCACG CTGTTCCGGC TGGCGCTTAA CATCGCCTCC ACCCGGTTGA TCCTTTTGCA CGGCAACGAG GGGGTCGAGG CGGCGGGAGG GGTGATCAAG GCTTTCGGGC AGTTCGTGGT CGGCGGCAAC TACGTGGTCG GCGCCGTCAT CTTCCTGATC CTCGTCATCA TCAACTTTGT CGTCATCACC AAGGGCGCCG GGCGCGTCGC CGAGGTCGCC GCCAGGTTCA CCCTGGACGC CATGCCCGGC AAGCAGATGG CCATCGACGC GGACCTCTCG AACGGTCTTC TGACCGACAA GGAGGCGAAG GAAAAGCGCA AGAAGATCGC GCGCGAGGCG GACTTCTACG GCTCGATGGA CGGCGCCAGC AAGTTCGTGC GCGGGGACGC CGTCGCCGGC ATCATGATCG TCATCGTGAA CATCGTCGGC GGCTTCATCA TCGGCGTCTG GCAAAAGGGG ATGGCGCTCG ATCAGGCGCT CACCAACTAC ACGCTCCTTA CCATCGGCGA GGGGCTCGTG GCCCAGGTCC CGGCGCTGAT CATCTCCACC GCGGCCGGCA TCATCGTTAC CCGCTCGGCA GACGAGAACA ACTTCGGCCA CGAGATCGCG GGACAGCTCC TCAATTACCC GAAGGCGTTC CAGGTGGCCT CCGGGGTTCT TTTCGTCTTC GCCCTGATCC CGGGATTGCC GCATTTCGCC TTCTTCCTCC TCTCCGGCAT CGCGTACCTG GTGAGCAAGA TGGCGGTGGA GAAAAAGGCG GAGGTCGAGG ATGTCGTCGA GACCCAGGCG GGCGCCGAGG ATCTGGACCA GATCAGCTCC ATCAGGCCTT TGGACATGCT GGAACTGGAG GTAGGCTACG GCCTGGTCCC CATGGTGGAC GCGAGCCAGC AGGGGGAACT CCTGGACCGG ATCCGCTCCA TCAGGAAGCA GGTGGCCGAC CGCATGGGGT TCATCGTTCC CCCTATCCAC ATCCACGACA ACCTGCAGCT GAAGCCTTAC GAGTACAACC TCCTGATCAA CGGCGCCAAG GTGGGAGGGG GGGAACTCTC CGGGCAGTAC CTCGCCATGG ACTCCGGCGG CGCTACCGGC CAACTGGACG GGATCAAGAC CACCGAGCCG GTATTCGGGC TCCCCGCGGT ATGGATCAAG GGGAAGGAGC GGGAGCTGGC GCAGGTCTCC GGCTACACGG TCGTGGACAA CACCACCATC CTCGCCACCC ACATCAGCGA GACCATCAAG AAGCACGCCC ACGAGCTTGT CGGGCGCCAG GAGCTGCAGC AGCTTCTGGA CAGCATCGCC GCCACGCTCC CGAAGGTGGT GGAGGAGCTG GTGCCGTCGC TCCTCTCCCT GGGGACGGTG CTGCGCGTGG TCAAGAACCT CTTGAAGGAA AACGTCTCCA TCAGGGACCT GCGCTCCATC CTGGAGACCT TGGCCGACTA CGGCGGGGTC ACCAAGGACC CGGACATGCT CACCGAGTTC GTGCGCCAGA GCCTGGGGCG CTACATCGTG GAGCAGTACA AGCGGGAGGA CGACACGCTC TGCGTCCTCA CCATGGATCG CGAGGTGGAG GAGATCATAG CCGACGCGGT GCAGCTATCG GAGCAGGGAA GCTACTTGGC CATCGAGCCG GGGGTGGCGC AGCGCATCCT GGCCGCCATC CGGAGAAACG CCGAGCAGTT CGACGCGACC GGCGTCCTGC CGGTCCTGAT GGCGTCGCCC AGCATACGGC GCCACGTGAA GAAGCTTACC GAACGTTACA TGCCCAACCT GGCGGTCATC TCGCACAACG AGATCCCGCC GAACATAAAA ATCCAATCTT TAGGCGTGGT GGTGCTCAAT GCTAGTTAA
|
Protein sequence | MANGATDALA LPGPKKNSDI YMAVALIGIL SLMIIPVPAF MLDIFLAANI TIALVILLVC LYTIQPLDFS VFPSILLVTT LFRLALNIAS TRLILLHGNE GVEAAGGVIK AFGQFVVGGN YVVGAVIFLI LVIINFVVIT KGAGRVAEVA ARFTLDAMPG KQMAIDADLS NGLLTDKEAK EKRKKIAREA DFYGSMDGAS KFVRGDAVAG IMIVIVNIVG GFIIGVWQKG MALDQALTNY TLLTIGEGLV AQVPALIIST AAGIIVTRSA DENNFGHEIA GQLLNYPKAF QVASGVLFVF ALIPGLPHFA FFLLSGIAYL VSKMAVEKKA EVEDVVETQA GAEDLDQISS IRPLDMLELE VGYGLVPMVD ASQQGELLDR IRSIRKQVAD RMGFIVPPIH IHDNLQLKPY EYNLLINGAK VGGGELSGQY LAMDSGGATG QLDGIKTTEP VFGLPAVWIK GKERELAQVS GYTVVDNTTI LATHISETIK KHAHELVGRQ ELQQLLDSIA ATLPKVVEEL VPSLLSLGTV LRVVKNLLKE NVSIRDLRSI LETLADYGGV TKDPDMLTEF VRQSLGRYIV EQYKREDDTL CVLTMDREVE EIIADAVQLS EQGSYLAIEP GVAQRILAAI RRNAEQFDAT GVLPVLMASP SIRRHVKKLT ERYMPNLAVI SHNEIPPNIK IQSLGVVVLN AS
|
| |