Gene Mpe_A3078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3078 
SymbolflhA 
ID4786651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3271973 
End bp3274075 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content68% 
IMG OID640091649 
Productflagellar biosynthesis protein 
Protein accessionYP_001022266 
Protein GI124268262 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1298] Flagellar biosynthesis pathway, component FlhA 
TIGRFAM ID[TIGR01398] flagellar biosynthesis protein FlhA 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.984078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.823558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGA TGCTCTCCCA AGTCCAGGCC TGGCTGGGTC CCAACGCCCG CATGATCCGC 
AGCCTGGCAG TGCCGCTGCT GGTGCTGATG GTGCTGGCGA TGATGGTGCT GCCGCTGCCG
CCGCTGGCGC TGGACCTGCT GTTCACCTTC AACATCGCGA TCGCGCTGAT GGTGATGATG
GTGTCGGCCT ACATGGTCAA GCCGCTGGAC TTCTCGGCCT TCCCGGCGGT GATCCTGCTG
ACCACGCTGC TGCGGCTGTC GCTGAACGTG GCCTCGTCGC GCGTGGTGCT GATGGAAGGC
CACACCGGCC CGGGCGCCGC CGGCGCGGTG ATCGAGAGCT TCGGCCACTT CCTGATCGGC
GGCAACTTCG CGGTCGGCAT CATCGTGTTC GCGATCCTGG TGGTCATCAA CTTCATCGTG
GTGACCAAGG GCGCCGAGCG CATCGCCGAG GTCGGCGCGC GCTTCACCCT GGACGCGATG
CCCGGCAAGC AAATGGCGAT CGACGCCGAC CTGAACGCCG GCCTGATCGA CGAGAAGGAG
GCCAAGCGCC GTCGCGCCGA GGTGGGCAAC GAGGCGGAGT TCTTCGGCTC CATGGACGGT
GCCAGCAAGT TCGTGCGCGG CGACGCGATC GCCGGCATGC TGATCCTGTT CATCAACATC
GTGGGCGGCT TCATCATCGG CGTGGTCCAG CACGACCTCA GCGCCGGCAA GGCCGCCGAC
AGCTACATCC TGCTGGCCAT CGGCGACGCG CTGGTGGCGC AGATCCCGGC CCTGCTGATC
TCGGTGGCGG CCGCCATGGT GGTGTCGCGC GTCGGCAAGG AGCAGGACGT CGGCGAACAG
ATCATGGGCC AGATGTTCAA GACGCCCAAG TCGGTGGGCA TCGTTGCCGG CGTGATCGGG
CTGCTGGGCG TGATCCCGGG CATGCCGCAC TTCGTGTTCC TGCTGATCGC GAGCGCGCTC
GGCTACGTCG CATGGCTGAT GCACCAGCGC GAGCAGCGCG CCAAGCTGGC GCCCCCCAGG
CCGGCCGCCG CGGCCGGCCC GGCCCCCGAC GCCGAGGCCA GCTGGGACGA CCTGCAGCCG
GTCGACACGC TGGGCCTGGA AGTGGGCTAC CGCCTCATCG CGCTGGTGGA CAAGGAACGC
CAGGGCGACC TGATGACGCG CATCAAGGGC GTGCGCCGCA AGTTCGCGCA GGACGTGGGC
TTCCTGCCGC CGTCGGTGCA CATCCGCGAC AACCTCGAGC TGCGCCCCAG CATGTACCGG
CTCACGCTGC GCGGCGCCGT GATCGGCGAG GGCGAAGCCT TCCCCGGCAT GCTGATGGCC
ATCAACCCCG GCCACGCCAG CACGCCGTTG ATCGGTACCA CCACCACCGA CCCGGCCTTC
GGCCTGCCCG CCACGTGGAT CGAGGAGCGC CAGCGCGAAG CGGCGCAAAT GGCCGGCTAT
ACGGTCGTTG ATTGCTCGAC CGTGGTGGCG ACCCACCTCT CACACTTGAT GCAAGTGAAT
GCAGCGCGCC TGCTGGGCCG CGTGGAAACG CAGCAGCTGG TCGAGCATGT GACGAAGTTG
GCACCCAAAC TCATCGAAGA CGTGGTCCCG AAGATGGTCC CCATCGCCGC CCTCCAGAAA
CTGCTCCAGC TGCTGCTGGA GGAAGGCGTG CACATCCGCG ACATGCGCTC GGTGGTGGAA
GCGCTGGCCG AGCACGTCGG CGCCAACCCC AACCTCGCCA ACGACCCGCA GGAGCTGTCG
CGCCGTATCC GCGTGGCACT GGCGCCTGCG ATCGTGCAGC AGATCTACGG CCCGGTGCGC
GAGCTCGAGG TGATCGCCAT CGAGCCCGAC CTCGAGCGCC TGCTGTCGCA GGCCCTGACC
TCGCAGAACG GCCCGGTGCT GGACCCCAGC ATGGCCGACA TCCTGACCCG TTCCGCCGCC
GACTCCGCCA AGCGCCAGGA GGACCTGGGC CACCCGGCCT GCCTGCTGGT GCCCGATGCG
ATCCGCGTGC CCATCGCCCG TTTGCTCAAG CGCGCCGCGC CGCGGCTGCG TGTGCTCTCG
CACAGCGAGA TCCCAGACAC CCATTCGATC CGCATCGGCT CGATCATCGG CGCCGCCGCT
TGA
 
Protein sequence
MNPMLSQVQA WLGPNARMIR SLAVPLLVLM VLAMMVLPLP PLALDLLFTF NIAIALMVMM 
VSAYMVKPLD FSAFPAVILL TTLLRLSLNV ASSRVVLMEG HTGPGAAGAV IESFGHFLIG
GNFAVGIIVF AILVVINFIV VTKGAERIAE VGARFTLDAM PGKQMAIDAD LNAGLIDEKE
AKRRRAEVGN EAEFFGSMDG ASKFVRGDAI AGMLILFINI VGGFIIGVVQ HDLSAGKAAD
SYILLAIGDA LVAQIPALLI SVAAAMVVSR VGKEQDVGEQ IMGQMFKTPK SVGIVAGVIG
LLGVIPGMPH FVFLLIASAL GYVAWLMHQR EQRAKLAPPR PAAAAGPAPD AEASWDDLQP
VDTLGLEVGY RLIALVDKER QGDLMTRIKG VRRKFAQDVG FLPPSVHIRD NLELRPSMYR
LTLRGAVIGE GEAFPGMLMA INPGHASTPL IGTTTTDPAF GLPATWIEER QREAAQMAGY
TVVDCSTVVA THLSHLMQVN AARLLGRVET QQLVEHVTKL APKLIEDVVP KMVPIAALQK
LLQLLLEEGV HIRDMRSVVE ALAEHVGANP NLANDPQELS RRIRVALAPA IVQQIYGPVR
ELEVIAIEPD LERLLSQALT SQNGPVLDPS MADILTRSAA DSAKRQEDLG HPACLLVPDA
IRVPIARLLK RAAPRLRVLS HSEIPDTHSI RIGSIIGAAA