Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2298 |
Symbol | |
ID | 4268396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2609445 |
End bp | 2610575 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638127058 |
Product | putative transcriptional regulator |
Protein accession | YP_743130 |
Protein GI | 114321447 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0550808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0348046 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGTTCA AACCCTCCAC GGCGCAAAAT ACCGCGATCC GGAGGGCCAT CTGTGCATTC GCCAACGATC TTGATGGGCA GGGCGAGATC GGCGTGATCT TCGTGGGGCT CAACGATGAC GGGACCTGCA AAGGGATCGA ACATCCAGAC CGCGACCAGA AGAAGCTATC CGACTGGGCC TTGGGGGGCG ACATCCTCCC CCTTCCTGAC GTGGAAATCG GTTATCGCGA GCTGGACGGT TGCCCTGTCA TAGTCGCCGA GATACGCCCC CACCAAGAAC CCCCGGTACG CTACCAGGGC CAGGCGTGGG TCAGGGTTGG CACCACCAAC CGCCGCGCTA CTCCGGAACA GGAGAGACGT TTGGCGGAGC GACGGCGGGG CGGAGACCTT CCCTTCGACC ATCGGCCGGC CCCCGGCGCC AGTCTTGATG ACCTCGATCT GAGGTACTTC GAGCGCGAGT ACCTGCCCAA TGCCGTCGCA CCGGAGGTCC TGGAAGAGAA CAACCGTAGC ATCGAAGATC AACTCAACGC CCTCAGGTTT CTCACCCAGG GCCAAGTCAA CCACGGTGCT CTGCTCGTCC TGGGGCACGA CCCGACCGCC TATATACCGG GGGCCTACGT GCAGTTCCTA CGCATCGACG GAACCGAGCT GGGCGACCCC ATCAAGGACG AAAAGAGACT GACCGGCAAC CTGCCCGAGG TCATGGCGCA GCTGGACGAG TTGTTGGAGG TGCACATCCA GGTTGCTGTC GACATCGAGG CCGGATCACG GGAGCAGCGC CAGCCCGACT ACCCCGTTGT TGCCTTGCAG CAACTCACCC GGAACGCCCT CATCCATCGC GCCTACGAAG GCAGCCACGC GCCGGTCCGC GTCTATTGGT TCAGAGACCG GGTGGAAATT CAGAACCCCG GCGGACTCTA CGGCCAAGTC ACCACCGACA ACTTCGGTCA GGGTGTCACC GATTACCGCA ACCCCCTGAT AGCAGAAGCC ATGCATATTC TCGGCTACGT ACAGAGATTC GGGTTCGGCA TCCCATTGGC ACAGCGCCAC CTGCGGGAAA ACGGCAACCC CGACCCCGAT TTTCAGTTCG CGCCAGAGTT CCTGGCCGTC ACCGTGAGGG CAGTGACGTG A
|
Protein sequence | MEFKPSTAQN TAIRRAICAF ANDLDGQGEI GVIFVGLNDD GTCKGIEHPD RDQKKLSDWA LGGDILPLPD VEIGYRELDG CPVIVAEIRP HQEPPVRYQG QAWVRVGTTN RRATPEQERR LAERRRGGDL PFDHRPAPGA SLDDLDLRYF EREYLPNAVA PEVLEENNRS IEDQLNALRF LTQGQVNHGA LLVLGHDPTA YIPGAYVQFL RIDGTELGDP IKDEKRLTGN LPEVMAQLDE LLEVHIQVAV DIEAGSREQR QPDYPVVALQ QLTRNALIHR AYEGSHAPVR VYWFRDRVEI QNPGGLYGQV TTDNFGQGVT DYRNPLIAEA MHILGYVQRF GFGIPLAQRH LRENGNPDPD FQFAPEFLAV TVRAVT
|
| |