Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0982 |
Symbol | |
ID | 4269333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1117577 |
End bp | 1119697 |
Gene Length | 2121 bp |
Protein Length | 706 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638125733 |
Product | flagellar biosynthesis protein FlhA |
Protein accession | YP_741825 |
Protein GI | 114320142 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1298] Flagellar biosynthesis pathway, component FlhA |
TIGRFAM ID | [TIGR01398] flagellar biosynthesis protein FlhA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.121658 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCA CCCTGATGGA CAATCTGCGC AGTGTCGGCC CGCTGGGCCT GGGTACGCCC CTGCTGGTGG TGGCCCTGCT GATGATGCTG GTGGTGCCGC TGCCGCCCAT CGGGCTGGAT CTGCTGTTCA CCTTCAACAT CGTGCTCTCG CTGTCCATCC TGCTGGTCAC TGTCTACGCC ATGCGGCCGC TGGACTTCGC GGTCTTCCCC ACCATCCTGC TCATCGCCAC CCTGCTGCGC CTGGCGCTGA ACGTGGCCTC CACCCGGATC ATCCTGCTCG ACGGCCATAC CGGTACCGCA GCCGCCGGGC GGGTGATCGA GTCCTTCGGC GAGTTCGTCA TCGGTGGCAA CTACGCCGTC GGCCTGGTGG TCTTCACCAT CCTGATCATC ATCAACTTCG TGGTGGTCAC CAAGGGGGCC GGCCGGGTGG CCGAGGTCTC CGCCCGCTTT ACCCTGGATG CGCTGCCTGG CAAGCAGATG GCCATCGACG CCGATCTGAA CGCCGGCATT ATCGACCAGA AGCAGGCCAA GGAGCGGCGC GAGGAGGTGG CCCGCGAGGC CGACTTCTAC GGCTCCATGG ATGGCGCGAG CAAGTTCGTC CGCGGCGATG CCATCGCCGG CATCCTGATC CTGGTCATCA ATCTGGTCGG CGGGCTGTCC ATCGGCATGC TGCAGCACGA CATGATCTTT GGCGATGCCG TGCGCAACTA TTCGCTGCTA ACCATCGGCG ACGGACTGGT GGCGCAGATC CCCTCGCTGG TGCTCTCCAC CGCCACCGCC ATCATCGTCA CCCGGGTAGC GGACAGCCAG AAGATGAGCG ACGCGGTCAC CGGGCAACTG TTCAGCAACC CGCGGGTGCT CTGGGTGGTG GGCGGTGTCA TCTTCTTCCT GGGGCTGATC CCGGGCATGC CGCTGTTGCC CTTCGCCTTT TTCGGCACCG TCTGCCTCAC CGCCGCCTAC CTGATTACCC AGCGCCAGAA GGAGGTCGCC GAGGCCGAGG CGCAGGCCGG TGCCAAGGCC AGCGGTGAGC AGGCCGAGGC CAAGTCGCCG GAGCAGCGCG AGCTCAGCTG GGACGACGTA CCGCCGGTGG ACGTGGTGGG CCTGGAGGTG GGTTACGGCC TGATCCCGAT GGTGGACCGC AACCAGGGCG GCCAGTTGCT CACCCGGATC AAGGGGGTGC GCAAGAAGCT CTCGCAGGAC CTCGGCTTTC TCATCCAGCC GGTGCACATC CGCGACAACC TGGACCTGGG CCCCAACACC TACCGGATCT CCATCAAGGG GGTGCCGGTG GCCGAGGCAG AGGTGCAGCC CGGGCGCGAA CTGGCCATCA ACCCCGGCAG CGTTCAGGGC CAGGTCAAGG GCATCCCCAC CAAGGACCCG GCCTTCGGCC TGGAGGCGGT CTGGATCGAG CCCTCGCAGC GGGATTACGC CCAGACCCTC GGCTACACGG TGGTGGACAC CAGCACGGTG ATCGCCACCC ACCTGTCGCA ACTGGTGCAG CAGAACGCCC ACCGCCTGCT GGGGCACGAG GAGGTGCAGA AGCTGCTGGA TATCCTCGCC CAGAGCCAGC CCAAGCTGGT GGAGGAGGTG GTGCCCAACA CCCTGCCGTT GTCGGTGGTG GTGAAGGTGC TGCAGAACCT GCTGGAGGAG CGCATCCCGG TGCGCGACAT GCGCAGCATT GTCGAGATCC TGGCCGAGTA CGGGGCGCGC AGCAAGGACC CGGCCCAGCT CACCCAGGCG GCGCGCGAGG CACTGGGCCG CCAGATCGTG CAGAACATCG CCGGCATGAC CCGGGAGCTG CCGGTGATGA CCCTGGATCC GCAACTGGAA CAGATGTTGC TAAATGCAAT ACAGGGCGGC GGGGAAACGT CGGGCGGCTT CGAGCCGGGC CTGGCCGAGC GCCTCCAGCA GTCGTTGGCG GACACCACCC GCAAGCAGGA GATGGCGGGT CAGCCGGCGG TCCTGCTCAC CTCGCCGCAA CTGAGGAGTT GGCTGCAGCG GCTTATCCGC CACAGCGTGC CCAGCCTGCA CGTGCTCTCG TACAACGAGA TCCCGGATGA CCGCCAGGTT CGCATCGTAG CCACCGTAGG CCAGCAACAG CAGGTCGGCG GCGGCCAATA A
|
Protein sequence | MNATLMDNLR SVGPLGLGTP LLVVALLMML VVPLPPIGLD LLFTFNIVLS LSILLVTVYA MRPLDFAVFP TILLIATLLR LALNVASTRI ILLDGHTGTA AAGRVIESFG EFVIGGNYAV GLVVFTILII INFVVVTKGA GRVAEVSARF TLDALPGKQM AIDADLNAGI IDQKQAKERR EEVAREADFY GSMDGASKFV RGDAIAGILI LVINLVGGLS IGMLQHDMIF GDAVRNYSLL TIGDGLVAQI PSLVLSTATA IIVTRVADSQ KMSDAVTGQL FSNPRVLWVV GGVIFFLGLI PGMPLLPFAF FGTVCLTAAY LITQRQKEVA EAEAQAGAKA SGEQAEAKSP EQRELSWDDV PPVDVVGLEV GYGLIPMVDR NQGGQLLTRI KGVRKKLSQD LGFLIQPVHI RDNLDLGPNT YRISIKGVPV AEAEVQPGRE LAINPGSVQG QVKGIPTKDP AFGLEAVWIE PSQRDYAQTL GYTVVDTSTV IATHLSQLVQ QNAHRLLGHE EVQKLLDILA QSQPKLVEEV VPNTLPLSVV VKVLQNLLEE RIPVRDMRSI VEILAEYGAR SKDPAQLTQA AREALGRQIV QNIAGMTREL PVMTLDPQLE QMLLNAIQGG GETSGGFEPG LAERLQQSLA DTTRKQEMAG QPAVLLTSPQ LRSWLQRLIR HSVPSLHVLS YNEIPDDRQV RIVATVGQQQ QVGGGQ
|
| |