Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3070 |
Symbol | flgE |
ID | 4784932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3265971 |
End bp | 3267233 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640091641 |
Product | flagellar basal body and hook protein |
Protein accession | YP_001022258 |
Protein GI | 124268254 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.232473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCC AGCAAGGTCT GTCCGGTCTG AACGCCGCCA GCAAGAACCT CGAGATCATC GGCAACAACG TCGCCAATGC ATCCACCTTC GGGGCCAAGA GCTCGCGCGC AGAGTTCGGC GACGTGTACG CCAACGCGCT GAACGGCTCG GGCACCAACA TGGTCGGCAT CGGCGTCAAT CTGGCCACCG TGGCCCAGCA GTTCACGCAG GGCAACATCA CCACCACCGA CAACCCGATG GACCTGGCGA TCAACGGCTC AGGCTTCTTC CAGGTGAGCG ACGGCAAGAA CCCGACCATG TACGCACGCA ACGGCCAGTT CAAGATCGAT CGCGAGGGGT TCATCGTCAA CAACTCGGGC TACAAGTTGC TGGGCTACCC GGCCGACGGC CAGGGCGTGA TCGTGCCGGG CCAGGCCCAG GCGATCCAGC TGCCGACCGC CGGCATCGCG CCGCGCGCGA CCGACCGCAT CGCGATCGAG ATGAACCTCG ACGCGCGCCA GGCCGTCACC ACGCCGGCCA CCGGCGGCAT CGACTTCGAC GATCCGGCCA CCTACAACAA CGCCACCTCG GTGACGGTCT ACGACGCCAA GGGCCAGGAC GTGGCGCTGA CCTACTACTT CCAGAAGTCC GGCGCCGACC AGTGGGACGT GTACGTGACC GCGAACGGCA CGCCGATCAG CGTCGACGGC ACTGGCGCCG CGCTGCCCAG CACCACGATG ACCTTCCCGG CCAACGGCTC GGCGCCGACG GCGCCGGTCG GTGCGGTGCC GATCAACATC CCCGCGACCA CCAACGCCGC CGGCGGCACC ACGCTGCCGA TCACCGGCAT CGAACTCGAC GTGACGAGCG CGACGCAGTA CGGCTCGGGC TTCGGCGTGA CCGACATGTC GCAGACCGGC TACGCGCCCG GCCAGCTGTC GGGCATCTCG ATCGAGGCCA ACGGCGTGAT CATGGCGCGC TACAGCAACG GCCAGTCCAA GCCGGGCGGC CAGCTCGAGC TCGCCAACTT CCGCAACCCG CAGGGCCTGC AGCCGCTGGG CAACAACGTC TGGGCCACCA CCTTCACCTC CGGCGATCCG GTGGTCGGCG CGGCCGGCGA TGGCAACTTC GGCGTGCTGC AGTCCGGCGC GCTGGAGGAA AGCAACATCG ACCTGACCGG CGAGCTGGTC AACATGATCA CCGCGCAACG CGTCTACCAG GCCAACGCGC AGACCGTGAA GACGCAGGAC TCGATGATGC AGACGTTGGT CAACCTGCGC TGA
|
Protein sequence | MSFQQGLSGL NAASKNLEII GNNVANASTF GAKSSRAEFG DVYANALNGS GTNMVGIGVN LATVAQQFTQ GNITTTDNPM DLAINGSGFF QVSDGKNPTM YARNGQFKID REGFIVNNSG YKLLGYPADG QGVIVPGQAQ AIQLPTAGIA PRATDRIAIE MNLDARQAVT TPATGGIDFD DPATYNNATS VTVYDAKGQD VALTYYFQKS GADQWDVYVT ANGTPISVDG TGAALPSTTM TFPANGSAPT APVGAVPINI PATTNAAGGT TLPITGIELD VTSATQYGSG FGVTDMSQTG YAPGQLSGIS IEANGVIMAR YSNGQSKPGG QLELANFRNP QGLQPLGNNV WATTFTSGDP VVGAAGDGNF GVLQSGALEE SNIDLTGELV NMITAQRVYQ ANAQTVKTQD SMMQTLVNLR
|
| |