Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0700 |
Symbol | |
ID | 4268859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 781683 |
End bp | 783143 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638125449 |
Product | flagellin domain-containing protein |
Protein accession | YP_741544 |
Protein GI | 114319861 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.255578 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGG TAATCAACAC CAACATCGCT TCTCTTAATG CGCAGCGGAA TCTGAACGCA TCGCAGAACC AACTGAATGT GTCGCTGGAG CGGCTCTCGT CCGGGCTGAG GATCAACAGC GCCAAGGATG ATGCCGCGGG TCTCGCCATC TCCGAGCGAT TCACCGCTCA GATCAACGGC AACAACCAAG CGGTCCGGAA CGCCAACGAT GGCATCTCCC TCTCTCAGAC CGCCGAAGGG GCCTTGGAAG AGATCGGGAA CATCGGGCAG CGCATTCGTG AGCTGGCCGT CCAGGCAGCC AATGACACCA ACTCCGCGTC GGACCGCCAG GCGTTGAACA ACGAGGTCCA GCAGCTGATT GCGGAGGCCG GGCGTATCGC CCAGGCGACC CAGTTCAACG ATCAGAACGT GTTGGACGGC AGCCTGGAGG AGATCCTGTT CCAAGTGGGT GCCAATCGTG GTCAAACCAT TTCGGTGGAT GGCGTGGATG CCCGCAGTGA CCAACTGGGT GCCCAGCTCT TTGCCGGTGA TGACATTGAC TTCACGGAAC TGGTGGATAA CGCGAATGCC ACCACCGCGA GCTTTAACAT CACCGATGAT CTGACCATCA ACGGTGAAGC GGTGGATCTG TCCGGCATCG AGGCCGAGGA CGGTTTCGCG GATGTGGATG ACATCGTTGC TGCGATCAAC GCCGTCTCTG GTGATACCGG CGTCGAGGCC GATAGGGCGC TGGAGGTCGA AGCCACCATT GATGCCAGTG GGCACACTGC CAGTAATACC GGGCTGGCCT TCTCGCTCAA TGGGGTTGAT ATCACCGTGG ACGGCACGGC GAGCCAGACC GAGGCCGCTA CCAGGCTGGC CGATGCAATC AATGAGGCCG GAGTGAGTGG GGTCAGTGCT CAGATCGATC CGGATGATGA CACCCAAGTG GTTCTCAGTG GGGACGGGGC TGATCTCCAA TTCACCCAAG GCGCGGATTC GCAGGGGATC ACCTTTGTCG ACGGGGCCGA TCTCGGTGCA AACGAAGATG ATATCGCAGG GTTTGCCCGG GCCATTGAGT TGACTGGGCA GCTGGGTGAG TCGGTCGACG TACAGGGTAC GGATGTCGGA AACCTGGGTC TGGATAACCT GGGTGACGGG GAGGAGAAAA CACTGGCCAG CGTAGATATC TCCACCCGTG ATAATGCCTC CGACGCCATC CGGTCCATGG ACTTTGTCCT CGCCCAGGTA AACAGCCTTC GGGCCGACCT CGGTGCGGTC CAGAACCGCT TCGAGTCCAC GGTCGCCAAC CTGTCGGTGA CCTCGGAGAA CCTGGAGGCC TCCCGCAGCC GGATCCTGGA CGCGGACTTC GCCGCTGAGA CCGCGGCCCT GACCCGCAGC CAGATCCTGC AGCAGGCTGG CACCTCGGTC CTGGCGCAGG CCAACCAGCT CCCGAACAAC GTGCTGGCCC TGCTCCAGTA A
|
Protein sequence | MAQVINTNIA SLNAQRNLNA SQNQLNVSLE RLSSGLRINS AKDDAAGLAI SERFTAQING NNQAVRNAND GISLSQTAEG ALEEIGNIGQ RIRELAVQAA NDTNSASDRQ ALNNEVQQLI AEAGRIAQAT QFNDQNVLDG SLEEILFQVG ANRGQTISVD GVDARSDQLG AQLFAGDDID FTELVDNANA TTASFNITDD LTINGEAVDL SGIEAEDGFA DVDDIVAAIN AVSGDTGVEA DRALEVEATI DASGHTASNT GLAFSLNGVD ITVDGTASQT EAATRLADAI NEAGVSGVSA QIDPDDDTQV VLSGDGADLQ FTQGADSQGI TFVDGADLGA NEDDIAGFAR AIELTGQLGE SVDVQGTDVG NLGLDNLGDG EEKTLASVDI STRDNASDAI RSMDFVLAQV NSLRADLGAV QNRFESTVAN LSVTSENLEA SRSRILDADF AAETAALTRS QILQQAGTSV LAQANQLPNN VLALLQ
|
| |