Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1028 |
Symbol | |
ID | 4269769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1173506 |
End bp | 1175434 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125780 |
Product | Fis family GAF modulated sigma54 specific transcriptional regulator |
Protein accession | YP_741871 |
Protein GI | 114320188 |
COG category | [K] Transcription [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3284] Transcriptional activator of acetoin/glycerol metabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.833434 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGATC GCCGCATTTA TCAGGCCTGG GAGAGCTTCC TCAGTCAGGG GGAAACGCCG ACAGGGGTGC GTGACGAAGT CCTCGCCTCC TGGCAGCGCT CCCTGGACAA CAACGTACCG GTGGACCGCT CCCAGACCCA AGCGCTCAGC GATGGCGAGT TCCTGCGCGT CCGGCAGCAA AGCGGCCCCT TCCTGACGGC GGCACGCCCG GCACTGGAAC AGGGGCGCCG GTTTCTCAGC GAGGCCCGGG CCATGGTCAT GTTGAGCAAC GCCCATGGCA CCGTCCTGGA GACCGTCGGC GACCGGCGGG TGATCGAACA TGGCCAGGAC ATCGGCCTGT GCCGGGGCGG GTTGTGGGAC GAGGGCCACA TCGGCACGAA CGCCATCGGC ACCGCGCTGG CCAGCCAGCA ACCGGTCCAG ATCCATGGCT ACGAACACTA TTGCTGCCGG GTACAGCGCT GGACTTGCGC CGCCGCGCCG GTGTTCAGCC CAACGACCCG GCGCATTCTC GGTGTCGTGG ACCTCTCCGG ACCCGCGGAG AGCTTTAACC CACAAAGCCT GGCCTACGTG GTGGCAGTGG CCCGCCAGAT TGAGGGTGGG CTGATACAGG CCACCGAGGC CGACCACCGG CGGCTGATAG ACCGGTTTCT GGGGATGGGA CGGCGCTGGA AGCACCGCGA CGTCCTGGTG GTCAGTCGCA GCGGCGTCAT CGTGCATGGC AACGAACAGG TGCGACGCCA GATCAGCCGC GCCTCCCGTA ACCTGTTTTT CGAGAACACC ATCCCGCTAC TGCGGGACAC CCCTGCGGAA GAATGGCTGG ACAAGCTCCA CGCCCAGCTG CCCACCGCGG ACATTGAACC GGTGACCGTG GATGGCGAGC ACCTGGGGGT TATCCTGGCA CCGCGTCAGG GTCGGACCGG CATCCGGCCG CGTAATCGGG ACATGGGCGA GCACCCGTCG GACGGCTTCT CGCTGGACAC CCTGATCGGG GACAGCCCGG CCATGCGCGC CGCCTGCGAT AAGGCCCGCC GACTCGCGGC CACCGACGCC CCGATCCTGA TCGAGGGCGA GACCGGGGTC GGCAAAGAGC TCTTCGCACA GGGCATCCAC GCCCTGAGCA TGCTGACCGG GCCCTTTATC CCGGTCAATT GCGGGGCGCT TCCCAAGGAC CTGATCGGCG GCGAAATGTT TGGTTACGTG GGCGGGGCCT TCACCGGCGC CAGTCAGGAG GGCCGGCCCG GAAAACTGGA GGCCGCCGAC GGCGGCACCC TTTGCCTGGA CGAGGTCAGC GAGATGCCGC TGGACCTGCA ACCCACCCTG TTGCGCATCC TGGAGGATGG GGTCGTCTAT CGCATCGGCA GCCACCAGCC CCGACGGGTC CGGGCCCGGC TGCTGTCCAT GACCAACCGC AATCTGCCGG AGGAGATCGA GTCCGGACGC TTCCGTCAGG ATCTTTTCTA CCGGATCGCC GCCCTGCGCC TGCGCATCCC CCCGCTGCGG GAACGCGGAG ACGATATCGC CCTGCTGGCG GAGTACTACC TGCGGCAACA GGCCACCCGC AGTGGGCGAA CGCCCCAGTC ACTGTCCGCG GAGGCCATGG ACGCCCTGCT GCGCTACCAC TGGCCCGGGA ATGTGCGCCA ACTGCGCAAT GCTATCACCA CCACTGCCGC TCTGACTGAC GCCGCCACGA TCGACGTGGA GGCGTTGCCG GAAGAGATCC TCACCCCCGC CCCAGCCCCC ACGCCCGGCG AGGACGGCAA TCTGCAACTA GCCACGGTGG AGCGGGCCGC CATCGAGCAG GCCCTGCGCC GGTGTGAGGG CAATGTCTCC CGGGCTGCCC GGCAATTGGG CATCGCCCGC TCCACGCTCT ACTGCCGCAT CCAGGAACAG CACATCCCCA TCCCCCGGCG CCGACGCACA GCGCCGTGA
|
Protein sequence | MQDRRIYQAW ESFLSQGETP TGVRDEVLAS WQRSLDNNVP VDRSQTQALS DGEFLRVRQQ SGPFLTAARP ALEQGRRFLS EARAMVMLSN AHGTVLETVG DRRVIEHGQD IGLCRGGLWD EGHIGTNAIG TALASQQPVQ IHGYEHYCCR VQRWTCAAAP VFSPTTRRIL GVVDLSGPAE SFNPQSLAYV VAVARQIEGG LIQATEADHR RLIDRFLGMG RRWKHRDVLV VSRSGVIVHG NEQVRRQISR ASRNLFFENT IPLLRDTPAE EWLDKLHAQL PTADIEPVTV DGEHLGVILA PRQGRTGIRP RNRDMGEHPS DGFSLDTLIG DSPAMRAACD KARRLAATDA PILIEGETGV GKELFAQGIH ALSMLTGPFI PVNCGALPKD LIGGEMFGYV GGAFTGASQE GRPGKLEAAD GGTLCLDEVS EMPLDLQPTL LRILEDGVVY RIGSHQPRRV RARLLSMTNR NLPEEIESGR FRQDLFYRIA ALRLRIPPLR ERGDDIALLA EYYLRQQATR SGRTPQSLSA EAMDALLRYH WPGNVRQLRN AITTTAALTD AATIDVEALP EEILTPAPAP TPGEDGNLQL ATVERAAIEQ ALRRCEGNVS RAARQLGIAR STLYCRIQEQ HIPIPRRRRT AP
|
| |