Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0698 |
Symbol | |
ID | 4268857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 777079 |
End bp | 778545 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638125447 |
Product | flagellin domain-containing protein |
Protein accession | YP_741542 |
Protein GI | 114319859 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.911513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.224965 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGG TCATCAACAC CAACATCGCT TCCCTGAACG CCCAGCGCAA TCTCAACGCC AGTCAGGGCC AGTTGGGCGT GGCCCTGGAG CGGCTGAGCT CGGGACTGCG GATCAACAGC GCCAAGGACG ACGCGGCGGG GCTGGCCATC AGCAGCCGCT TTACATCGCA GATCAACGGC CTGGACCAGG CCGTGCGGAA TGCCAACGAC GGCATCTCCT TTGCCCAGAC CGCGGAGGGG GCACTGGACG AGAGCACCAA CCTGCTGCAG CGGATCCGCG AGCTGGCGGT GCAGGCGTCC AATGACACCA ACTCCGCCTC CGATCGCGAT GCCCTGGACC AGGAGGTGCA GCAGGCCATC CGCGAGATCA GCCGCATTGC CGCCTCCACC CAGTTCAACA ACCAGAACAT CCTGGACGGC TCGCTCAGCG ACCTGATCTT CCAGGTGGGC GCCAACCGGG GGCAGATCAT CTCGGTGGAC GGGGTGGATG CCCGGGCCCA GTCGCTGGGG GCGCAGGTGG CAGACTCCGG TGCGGTGGCC ACCAACACCT TGTCCGAGGG TGGAGAGCTC TCCATTGCCG GGGTGACCAT CGATATGGAT GGTGCCGAGA ACATCAGCGA TGTGATCAGC CGAATCAACG ACAACTTCTC CGAGACGGGG GTGCAGGCCT TCCAGACCTC CGGTGGGTCC ATCGTTGCGG AAACGGGGCT AAGTGAGGAT GAGATCTCGT TTACCGCGGA TCCCGACCCG GTGCAGCTGA TGACCATCAA TGGGGTCAAT GTCTTCTCGG AGGCCGGGGC GGGCATCGAC AATCCGCAGG CGTTGGCGGA GCGGATCAAT GCCTATACGC CGTTGACCGG CGTGTCGGCG GTGGACGTGG ACGGGGAACT CGTCCTTGAG AGCCAGGCGG ATCAGGAGTC GATCCGGGTT ACGGATGTCA ATGCCGACCT GTTTGAGGCA ACCACCACGG ATTTCGGTGA GGAAACCGTC TTTTTCGATG AGAATGACGA CCTCCGGACC GAGTTGACCT TCGAGCGTGG CTTCGACCTA CGGGTGCCCC TGGAGGCCGA CCCTCCGGTG ATTGCCGATG ACAGCGATGG GGATCTCCTT CAGCGCCTGG GTCTGGTCGG TGGCGCCGAT CGCTTCGAGA CCTTCAGCGC GGACACCGTC AGTGTCGCCA CCCGCGGCGA GGCGCAGGAC GCCATCCGCA CGGTGGACAT CGCCCTGCAG GAGATCAACG GCATCCGCGC CGACCTGGGT GCGGTACAGA ACCGCTTCGA GGCCACCACC GCCAACCTGA CCATCACCTC GGAGAACCTC AGCGCCTCGC GCAGCCGCAT CATGGATGCC GACTTCGCCG CCGAGACCGC CGCACTGACC CGCGGCCAGA TCCTGCAGCA GGCCGGTACC TCGGTGCTGG CGCAGGCCAA CCAGCTACCC AACAACGTCC TCAACCTGCT GCAGTAA
|
Protein sequence | MAQVINTNIA SLNAQRNLNA SQGQLGVALE RLSSGLRINS AKDDAAGLAI SSRFTSQING LDQAVRNAND GISFAQTAEG ALDESTNLLQ RIRELAVQAS NDTNSASDRD ALDQEVQQAI REISRIAAST QFNNQNILDG SLSDLIFQVG ANRGQIISVD GVDARAQSLG AQVADSGAVA TNTLSEGGEL SIAGVTIDMD GAENISDVIS RINDNFSETG VQAFQTSGGS IVAETGLSED EISFTADPDP VQLMTINGVN VFSEAGAGID NPQALAERIN AYTPLTGVSA VDVDGELVLE SQADQESIRV TDVNADLFEA TTTDFGEETV FFDENDDLRT ELTFERGFDL RVPLEADPPV IADDSDGDLL QRLGLVGGAD RFETFSADTV SVATRGEAQD AIRTVDIALQ EINGIRADLG AVQNRFEATT ANLTITSENL SASRSRIMDA DFAAETAALT RGQILQQAGT SVLAQANQLP NNVLNLLQ
|
| |