Gene Mpe_A3070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3070 
SymbolflgE 
ID4784932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3265971 
End bp3267233 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID640091641 
Productflagellar basal body and hook protein 
Protein accessionYP_001022258 
Protein GI124268254 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.232473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCC AGCAAGGTCT GTCCGGTCTG AACGCCGCCA GCAAGAACCT CGAGATCATC 
GGCAACAACG TCGCCAATGC ATCCACCTTC GGGGCCAAGA GCTCGCGCGC AGAGTTCGGC
GACGTGTACG CCAACGCGCT GAACGGCTCG GGCACCAACA TGGTCGGCAT CGGCGTCAAT
CTGGCCACCG TGGCCCAGCA GTTCACGCAG GGCAACATCA CCACCACCGA CAACCCGATG
GACCTGGCGA TCAACGGCTC AGGCTTCTTC CAGGTGAGCG ACGGCAAGAA CCCGACCATG
TACGCACGCA ACGGCCAGTT CAAGATCGAT CGCGAGGGGT TCATCGTCAA CAACTCGGGC
TACAAGTTGC TGGGCTACCC GGCCGACGGC CAGGGCGTGA TCGTGCCGGG CCAGGCCCAG
GCGATCCAGC TGCCGACCGC CGGCATCGCG CCGCGCGCGA CCGACCGCAT CGCGATCGAG
ATGAACCTCG ACGCGCGCCA GGCCGTCACC ACGCCGGCCA CCGGCGGCAT CGACTTCGAC
GATCCGGCCA CCTACAACAA CGCCACCTCG GTGACGGTCT ACGACGCCAA GGGCCAGGAC
GTGGCGCTGA CCTACTACTT CCAGAAGTCC GGCGCCGACC AGTGGGACGT GTACGTGACC
GCGAACGGCA CGCCGATCAG CGTCGACGGC ACTGGCGCCG CGCTGCCCAG CACCACGATG
ACCTTCCCGG CCAACGGCTC GGCGCCGACG GCGCCGGTCG GTGCGGTGCC GATCAACATC
CCCGCGACCA CCAACGCCGC CGGCGGCACC ACGCTGCCGA TCACCGGCAT CGAACTCGAC
GTGACGAGCG CGACGCAGTA CGGCTCGGGC TTCGGCGTGA CCGACATGTC GCAGACCGGC
TACGCGCCCG GCCAGCTGTC GGGCATCTCG ATCGAGGCCA ACGGCGTGAT CATGGCGCGC
TACAGCAACG GCCAGTCCAA GCCGGGCGGC CAGCTCGAGC TCGCCAACTT CCGCAACCCG
CAGGGCCTGC AGCCGCTGGG CAACAACGTC TGGGCCACCA CCTTCACCTC CGGCGATCCG
GTGGTCGGCG CGGCCGGCGA TGGCAACTTC GGCGTGCTGC AGTCCGGCGC GCTGGAGGAA
AGCAACATCG ACCTGACCGG CGAGCTGGTC AACATGATCA CCGCGCAACG CGTCTACCAG
GCCAACGCGC AGACCGTGAA GACGCAGGAC TCGATGATGC AGACGTTGGT CAACCTGCGC
TGA
 
Protein sequence
MSFQQGLSGL NAASKNLEII GNNVANASTF GAKSSRAEFG DVYANALNGS GTNMVGIGVN 
LATVAQQFTQ GNITTTDNPM DLAINGSGFF QVSDGKNPTM YARNGQFKID REGFIVNNSG
YKLLGYPADG QGVIVPGQAQ AIQLPTAGIA PRATDRIAIE MNLDARQAVT TPATGGIDFD
DPATYNNATS VTVYDAKGQD VALTYYFQKS GADQWDVYVT ANGTPISVDG TGAALPSTTM
TFPANGSAPT APVGAVPINI PATTNAAGGT TLPITGIELD VTSATQYGSG FGVTDMSQTG
YAPGQLSGIS IEANGVIMAR YSNGQSKPGG QLELANFRNP QGLQPLGNNV WATTFTSGDP
VVGAAGDGNF GVLQSGALEE SNIDLTGELV NMITAQRVYQ ANAQTVKTQD SMMQTLVNLR