Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2582 |
Symbol | |
ID | 4270291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2923506 |
End bp | 2924585 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 638127341 |
Product | Sel1 domain-containing protein |
Protein accession | YP_743412 |
Protein GI | 114321729 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.805298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0195888 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACAA CACCCCCCAC CGACGCCGCC GACGCCGGCC GCCACGCCTG GGAGCGCGGC GACCCCGCGG AGGCGGCCCG TCTCTGGCGC CCCGCGGCGG AACAGGGCGA CCCCGATGCC CAGGTGGGCC TGGGGCTGTT GCTGGCCCAC GGCGAGGGGC TGGAGCAGGA CCTGGCCGCC GCCCGCCGCT GGTGGGAGCA GGCCGCCGAG AAGGACCACG CGGATGCCTG GTTCAACCTC GGCCAGATCA CCGAATACGG CCTGGACGGG ACCCCGGACC CGGCCCAGGC GGCGGCACTC TACCGCCGGG CCGCCGACCA GGGCCACCCC CAGGGCCTGC ACGCCCTGGC CGCCCTACTG TTCCAGGGCC AGGGCGTGCC GGAGGACCCG GCCCAGGCGG TGGCCCTGTG GCGCCGGGCC GCCGAGGCCG GCCTGCCCGA TGCGGAGAAC AGCCTGGGGG TGGCCCACCA GATGGGCCGC GGCGTCGAGG AGGACTTCAG CGCCGCCGTG CGCCACTACC GCCGGGCCGC CGAACAGGGC CACCCGCAGG CCGCCGCCAA TCTGGCCGGC CTGCTGGCCA TGGGCCTGGG TGTGGCCCAG GACCCGACCG AGGCGGCGCG GTGGTGGCGC CTGGCGGCCG AGGCCGGCGA CCCGGACGCC CAGGTCCAAC TCGGCAACTG TTATCGCGAC GGTCGCGGTG TGGCCCAGGA CGACCAGGCG GCCGTGGACT GGTACTGGCG CGCCGCCCGT CAGGGCCATC CGGAGGGGCA GACCAACGTG GGCGTCATGC ACGATCAGGG CCGCGGGGTG TTCAAGGACC CGGCCAAGGC CGTCAAGTGG TACCGGCTCG CCGCCGAGCA GGGCTTCCCG CCGGCGCAGT ACAACCTGGC CATCATGTAC TCCGAGGGAC ACGGGGTGGA GGAGGACAAG ATCGAAGCCT GGTGCTGGTT CAGCCTGGCC GACCGCCAGG GCTACGCCCC CGCCCGCGAT GCCGTGACCT GGCTCGACGA GGTCATGGAC CCCATCAGCC GCGCCCGCGC CGAGGAACGT CTGCGAGTGC TGGCGGGTCA ATCAGGGTAG
|
Protein sequence | MATTPPTDAA DAGRHAWERG DPAEAARLWR PAAEQGDPDA QVGLGLLLAH GEGLEQDLAA ARRWWEQAAE KDHADAWFNL GQITEYGLDG TPDPAQAAAL YRRAADQGHP QGLHALAALL FQGQGVPEDP AQAVALWRRA AEAGLPDAEN SLGVAHQMGR GVEEDFSAAV RHYRRAAEQG HPQAAANLAG LLAMGLGVAQ DPTEAARWWR LAAEAGDPDA QVQLGNCYRD GRGVAQDDQA AVDWYWRAAR QGHPEGQTNV GVMHDQGRGV FKDPAKAVKW YRLAAEQGFP PAQYNLAIMY SEGHGVEEDK IEAWCWFSLA DRQGYAPARD AVTWLDEVMD PISRARAEER LRVLAGQSG
|
| |