Gene Mpe_A3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3079 
SymbolflhF 
ID4786652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3274113 
End bp3275678 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content71% 
IMG OID640091650 
Productflagellar GTP-binding protein 
Protein accessionYP_001022267 
Protein GI124268263 
COG category[N] Cell motility 
COG ID[COG1419] Flagellar GTP-binding protein 
TIGRFAM ID[TIGR03499] flagellar biosynthetic protein FlhF 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.783936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCC AACGCTTCAC AGGCCGTACC TCGCGCGACG CGATGACCAA GATGCGCCAG 
GCCCTCGGCG ACGACGCCGT GGTGCTGTCG ACCAAGCCCT GCCCCGAAGG CATCGAGATG
CTGGCGATGG CGCCGGGCGC GCTGGCCGCG GTCGAGCGCC AGGCCGCCGT GCAGCAGCAG
GCCGCCGCCG CGCAGCCGGC GCCCGCCAAG CCCGCGAAGC CGGTCAAGGC CGCCAAGGGT
GGCCGTTCCG CGACGGCCGA CGATGTGCAG CAGGACGTGG AACAACTGTC GATGAGCACG
CTGTCCTTCC AGGACTACGT GCGCGACCGC ATGCTGAAGA AGCGCGAGGC CGCGTTGCGC
GGCGAAGCGC TGCGCGAGGA ACGCACCGAG CCGACCCTCA ACGTCGCCCC GATGGCGCCG
GCCCGTGCCG CCGCCCAGGC GACGCTGTAC GCCCAGGATC TGGACGACGA CGATGCGGTC
GACGCGGCGG TCGATGCCGT GCAGCACCGG ATTGCCAGCC AGCAGGCCGC CACCATGCCG
GTGCTGCGCG AGCAGGTGGT CTACGGCCGC CAGACACAGT CCGCCACCGC CAGCCCGCCG
ATCCTGCGCG AGGCCCAGGC CGATGCCACG ATGCTGAGCG AGCTGCGCTC GATGAAGGGG
CTGATCGAAG AGCGCTTCGG CGCGCTGGCC TTCATGGAGA AGCTGCAGCG CGAGCCCGCT
CAGGCCAAGC TGACCCAGAA GCTGCTCGAG TGCGGCTTCT CGCCGGTGCT GATCCGCAAG
CTGGTGGCTG GCATGACGGC CGACGTCGGC GACGAGCAGG CCTGGGCCGC CAGCGTGCTG
GAGCGCAACC TGATGACCGG CGAGCGCGAG CTGCCGATCG AGGACCAGGG CGGCGTGTTC
GCGATGATCG GCGCCACCGG CGTCGGCAAG ACCACCTCGA CCGCCAAGCT GGCCGCCGCC
TTCGCCACCC GGCATGGCGC GTCCAACCTC GGCCTGATCA CCCTCGACGC CTACCGTCTC
GGTGCCCACG AACAGCTGCG CGCCTACGGC CGCATCCTCG GCGTGCCGGT GCACACCGCC
CACGACCGCA CCGCGCTGGA AGACCTGCTG GAGCTGCTGT CGGCCAAGAA GATGGTGCTG
ATCGACACCG CCGGCGTCGC CCAGCGCGAC ACCCGCACGC GCGAACTGCT CGACATGCTG
GCGCACCCTT CGATCAACAA GCTGCTGGTG GTCAACACCG CCGTGCAGGG CGAGACCATC
GACGACGTGA TGACCTCCTA CCGTGCCGCC GCCTGCAAGG GCATCGTGCT GTCCAAGCTC
GACGAGGCCG TGAAGCTGGC GCCGGCACTC GACGCCGTGA TCCGCCACAA GCAGAAGATC
GTCGCGGTGG CCAACGGCCA GCGAGTGCCC GAGGACTGGC ATCGCCTGTC GGGTCAGGCC
CTGGTGCACC GCGCGCTGCG CGCCACCGGC AGCCCGGCCT ACAACTTCGA CGCGAGCGAG
ATGAACCTGG TGTTCGCCAC GCCGCAGATG ACGGAGCGCC GCCCCGTACC GGCCGGCCGC
GCCTGA
 
Protein sequence
MNVQRFTGRT SRDAMTKMRQ ALGDDAVVLS TKPCPEGIEM LAMAPGALAA VERQAAVQQQ 
AAAAQPAPAK PAKPVKAAKG GRSATADDVQ QDVEQLSMST LSFQDYVRDR MLKKREAALR
GEALREERTE PTLNVAPMAP ARAAAQATLY AQDLDDDDAV DAAVDAVQHR IASQQAATMP
VLREQVVYGR QTQSATASPP ILREAQADAT MLSELRSMKG LIEERFGALA FMEKLQREPA
QAKLTQKLLE CGFSPVLIRK LVAGMTADVG DEQAWAASVL ERNLMTGERE LPIEDQGGVF
AMIGATGVGK TTSTAKLAAA FATRHGASNL GLITLDAYRL GAHEQLRAYG RILGVPVHTA
HDRTALEDLL ELLSAKKMVL IDTAGVAQRD TRTRELLDML AHPSINKLLV VNTAVQGETI
DDVMTSYRAA ACKGIVLSKL DEAVKLAPAL DAVIRHKQKI VAVANGQRVP EDWHRLSGQA
LVHRALRATG SPAYNFDASE MNLVFATPQM TERRPVPAGR A