Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0703 |
Symbol | |
ID | 4268092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 784745 |
End bp | 786178 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125452 |
Product | flagellar hook-associated 2 domain-containing protein |
Protein accession | YP_741547 |
Protein GI | 114319864 |
COG category | [N] Cell motility |
COG ID | [COG1345] Flagellar capping protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0307046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.26193 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGTA TCAGTTCCCT GGGTGTGGGG TCGGGTCTGG ATATCCGCGA TCTGGTCGAC CAGCTGGTGG CGGCCGAGCG CCAGCCCGGT CAGAACCGGC TGGACCGCCA GGAGTCGCGG CTGGAGGCCC AGATCTCCGG GCTGGGCCGG CTGCAGCAGG CCATCACCGA CTTCGGCGGT GCCCTAGGCA ACGTCTCCGC TGAGAGCGAC TTCCGCAGCG TGGTGGCCAC CAGCAACAAC GAGTCGGCGG TCAGCGTCTC CGCCGGCCGC GACGCCCCAC CCGGCAGCTA CGACGTCAAC GTCACTCAAC TGGCCCAGGC CCAGCGGCTC GCCACCAATT CGGACCTGTT CGAGGACGTG GAGGACTTCT CGGCGGGCAC CACCTCGCTG GGCACCGGCA GCTTCACTAT CGAGTACCAG GACGGCAGCG CCGAGACCTT TCAACTGGAG GAAGGGGCTG ACACCCTGCA GGACGTGCGC GCCGCCATCA ACAACCAGAG CGAGAACGTG CGTGCCTCGG TGGTGGACGA CGGCGAGGGC CCGCGGCTGG TGATCACCAG CCGGGAGACC GGCGACCAGA ACGCCGTCTC GGCGATCACC GTCGATCCCG ACGACCCGGA CAGCGACCCG CTGCTGGAGC GGCTGGCATT CGATGCTGAG AACCTGGCCG AGCCGGACGA GGATGGTGTG CGCGCCGGCG ACAACTTTTC CCAGTTGCGC GCTGCGCAGG ACGCCGAACT GTTCGTGGAC GGACTGCGAA TCACCCGGCC CGGCAATGAG ATCAGCGGCG TGATCGATGG CGTCACCCTC ACGCTCAGCG CGGTGGACAG CGCCCGGATC AACGTGGCGG AGGAGCCCGG GTCGGCCGCC TCCGCGGTGA GCGACTTTGT CGGGGCCTAC AACAGCCTGC AGCGGACCCT GGGTGAGTTG AACGCCTTCG ACCCGGAGTC CGGCGAGGCG GGTGAGCTCA AGGGCGACTC CACCCTCCGC TCGGTGCAGG CCCGCCTGCG CCAAATGATC AGCGAGCCGC TGCCGGGGGC GGGCGGGCCG GTGCAGACCC TCGCGGACCT GGGCATCACC ACCCGCCGGG ACGGTACCCT GGAGATCAAC GACGCCCGGC TGGATGATGC GCTGAGCGAG AACCGGCTGG ATGTCATCCG TCTGTTCACG GACGAGGAGA ACGGGCTGGC CGCGCGCCTG CAAGGCGCCG TGGATGAGTT CACCGGCCGG GACAGCGTGA TCAACAGCCG CACCGAGTCA CTGCAGGACC GGCTCGCCGC CCTGGCGCCA CAGCAGGAGC GCCTGGACCG GCGGATGGAT CAGCTGGAGG CGAGGCTGAT CCGCCAGTTT TCCGCCATGG ACTCGATGAT CGCGCAGATG AACCAGACCA GCGAGTTCCT GGATAACCAG CTTAACCTGT TGAATCAGCA ATAG
|
Protein sequence | MASISSLGVG SGLDIRDLVD QLVAAERQPG QNRLDRQESR LEAQISGLGR LQQAITDFGG ALGNVSAESD FRSVVATSNN ESAVSVSAGR DAPPGSYDVN VTQLAQAQRL ATNSDLFEDV EDFSAGTTSL GTGSFTIEYQ DGSAETFQLE EGADTLQDVR AAINNQSENV RASVVDDGEG PRLVITSRET GDQNAVSAIT VDPDDPDSDP LLERLAFDAE NLAEPDEDGV RAGDNFSQLR AAQDAELFVD GLRITRPGNE ISGVIDGVTL TLSAVDSARI NVAEEPGSAA SAVSDFVGAY NSLQRTLGEL NAFDPESGEA GELKGDSTLR SVQARLRQMI SEPLPGAGGP VQTLADLGIT TRRDGTLEIN DARLDDALSE NRLDVIRLFT DEENGLAARL QGAVDEFTGR DSVINSRTES LQDRLAALAP QQERLDRRMD QLEARLIRQF SAMDSMIAQM NQTSEFLDNQ LNLLNQQ
|
| |