Gene GM21_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3851 
SymbolflhA 
ID8139225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4434431 
End bp4436509 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content63% 
IMG OID644871468 
Productflagellar biosynthesis protein FlhA 
Protein accessionYP_003023626 
Protein GI253702437 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1298] Flagellar biosynthesis pathway, component FlhA 
TIGRFAM ID[TIGR01398] flagellar biosynthesis protein FlhA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.000167765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAACG GCGCGACCGA CGCCCTGGCA CTACCAGGAC CCAAGAAAAA TTCGGACATC 
TACATGGCGG TGGCCCTGAT CGGGATACTG TCCCTGATGA TCATCCCGGT CCCGGCATTC
ATGCTGGACA TCTTCCTCGC GGCCAACATC ACCATCGCGC TCGTCATCCT GCTGGTCTGC
CTCTACACCA TCCAACCGCT CGACTTCTCG GTGTTTCCGT CCATCCTGCT GGTCACCACG
CTGTTCCGGC TGGCGCTTAA CATCGCCTCC ACCCGGTTGA TCCTTTTGCA CGGCAACGAG
GGGGTCGAGG CGGCGGGAGG GGTGATCAAG GCTTTCGGGC AGTTCGTGGT CGGCGGCAAC
TACGTGGTCG GCGCCGTCAT CTTCCTGATC CTCGTCATCA TCAACTTTGT CGTCATCACC
AAGGGCGCCG GGCGCGTCGC CGAGGTCGCC GCCAGGTTCA CCCTGGACGC CATGCCCGGC
AAGCAGATGG CCATCGACGC GGACCTCTCG AACGGTCTTC TGACCGACAA GGAGGCGAAG
GAAAAGCGCA AGAAGATCGC GCGCGAGGCG GACTTCTACG GCTCGATGGA CGGCGCCAGC
AAGTTCGTGC GCGGGGACGC CGTCGCCGGC ATCATGATCG TCATCGTGAA CATCGTCGGC
GGCTTCATCA TCGGCGTCTG GCAAAAGGGG ATGGCGCTCG ATCAGGCGCT CACCAACTAC
ACGCTCCTTA CCATCGGCGA GGGGCTCGTG GCCCAGGTCC CGGCGCTGAT CATCTCCACC
GCGGCCGGCA TCATCGTTAC CCGCTCGGCA GACGAGAACA ACTTCGGCCA CGAGATCGCG
GGACAGCTCC TCAATTACCC GAAGGCGTTC CAGGTGGCCT CCGGGGTTCT TTTCGTCTTC
GCCCTGATCC CGGGATTGCC GCATTTCGCC TTCTTCCTCC TCTCCGGCAT CGCGTACCTG
GTGAGCAAGA TGGCGGTGGA GAAAAAGGCG GAGGTCGAGG ATGTCGTCGA GACCCAGGCG
GGCGCCGAGG ATCTGGACCA GATCAGCTCC ATCAGGCCTT TGGACATGCT GGAACTGGAG
GTAGGCTACG GCCTGGTCCC CATGGTGGAC GCGAGCCAGC AGGGGGAACT CCTGGACCGG
ATCCGCTCCA TCAGGAAGCA GGTGGCCGAC CGCATGGGGT TCATCGTTCC CCCTATCCAC
ATCCACGACA ACCTGCAGCT GAAGCCTTAC GAGTACAACC TCCTGATCAA CGGCGCCAAG
GTGGGAGGGG GGGAACTCTC CGGGCAGTAC CTCGCCATGG ACTCCGGCGG CGCTACCGGC
CAACTGGACG GGATCAAGAC CACCGAGCCG GTATTCGGGC TCCCCGCGGT ATGGATCAAG
GGGAAGGAGC GGGAGCTGGC GCAGGTCTCC GGCTACACGG TCGTGGACAA CACCACCATC
CTCGCCACCC ACATCAGCGA GACCATCAAG AAGCACGCCC ACGAGCTTGT CGGGCGCCAG
GAGCTGCAGC AGCTTCTGGA CAGCATCGCC GCCACGCTCC CGAAGGTGGT GGAGGAGCTG
GTGCCGTCGC TCCTCTCCCT GGGGACGGTG CTGCGCGTGG TCAAGAACCT CTTGAAGGAA
AACGTCTCCA TCAGGGACCT GCGCTCCATC CTGGAGACCT TGGCCGACTA CGGCGGGGTC
ACCAAGGACC CGGACATGCT CACCGAGTTC GTGCGCCAGA GCCTGGGGCG CTACATCGTG
GAGCAGTACA AGCGGGAGGA CGACACGCTC TGCGTCCTCA CCATGGATCG CGAGGTGGAG
GAGATCATAG CCGACGCGGT GCAGCTATCG GAGCAGGGAA GCTACTTGGC CATCGAGCCG
GGGGTGGCGC AGCGCATCCT GGCCGCCATC CGGAGAAACG CCGAGCAGTT CGACGCGACC
GGCGTCCTGC CGGTCCTGAT GGCGTCGCCC AGCATACGGC GCCACGTGAA GAAGCTTACC
GAACGTTACA TGCCCAACCT GGCGGTCATC TCGCACAACG AGATCCCGCC GAACATAAAA
ATCCAATCTT TAGGCGTGGT GGTGCTCAAT GCTAGTTAA
 
Protein sequence
MANGATDALA LPGPKKNSDI YMAVALIGIL SLMIIPVPAF MLDIFLAANI TIALVILLVC 
LYTIQPLDFS VFPSILLVTT LFRLALNIAS TRLILLHGNE GVEAAGGVIK AFGQFVVGGN
YVVGAVIFLI LVIINFVVIT KGAGRVAEVA ARFTLDAMPG KQMAIDADLS NGLLTDKEAK
EKRKKIAREA DFYGSMDGAS KFVRGDAVAG IMIVIVNIVG GFIIGVWQKG MALDQALTNY
TLLTIGEGLV AQVPALIIST AAGIIVTRSA DENNFGHEIA GQLLNYPKAF QVASGVLFVF
ALIPGLPHFA FFLLSGIAYL VSKMAVEKKA EVEDVVETQA GAEDLDQISS IRPLDMLELE
VGYGLVPMVD ASQQGELLDR IRSIRKQVAD RMGFIVPPIH IHDNLQLKPY EYNLLINGAK
VGGGELSGQY LAMDSGGATG QLDGIKTTEP VFGLPAVWIK GKERELAQVS GYTVVDNTTI
LATHISETIK KHAHELVGRQ ELQQLLDSIA ATLPKVVEEL VPSLLSLGTV LRVVKNLLKE
NVSIRDLRSI LETLADYGGV TKDPDMLTEF VRQSLGRYIV EQYKREDDTL CVLTMDREVE
EIIADAVQLS EQGSYLAIEP GVAQRILAAI RRNAEQFDAT GVLPVLMASP SIRRHVKKLT
ERYMPNLAVI SHNEIPPNIK IQSLGVVVLN AS