Gene Mlg_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1028 
Symbol 
ID4269769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1173506 
End bp1175434 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content68% 
IMG OID638125780 
ProductFis family GAF modulated sigma54 specific transcriptional regulator 
Protein accessionYP_741871 
Protein GI114320188 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.833434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGATC GCCGCATTTA TCAGGCCTGG GAGAGCTTCC TCAGTCAGGG GGAAACGCCG 
ACAGGGGTGC GTGACGAAGT CCTCGCCTCC TGGCAGCGCT CCCTGGACAA CAACGTACCG
GTGGACCGCT CCCAGACCCA AGCGCTCAGC GATGGCGAGT TCCTGCGCGT CCGGCAGCAA
AGCGGCCCCT TCCTGACGGC GGCACGCCCG GCACTGGAAC AGGGGCGCCG GTTTCTCAGC
GAGGCCCGGG CCATGGTCAT GTTGAGCAAC GCCCATGGCA CCGTCCTGGA GACCGTCGGC
GACCGGCGGG TGATCGAACA TGGCCAGGAC ATCGGCCTGT GCCGGGGCGG GTTGTGGGAC
GAGGGCCACA TCGGCACGAA CGCCATCGGC ACCGCGCTGG CCAGCCAGCA ACCGGTCCAG
ATCCATGGCT ACGAACACTA TTGCTGCCGG GTACAGCGCT GGACTTGCGC CGCCGCGCCG
GTGTTCAGCC CAACGACCCG GCGCATTCTC GGTGTCGTGG ACCTCTCCGG ACCCGCGGAG
AGCTTTAACC CACAAAGCCT GGCCTACGTG GTGGCAGTGG CCCGCCAGAT TGAGGGTGGG
CTGATACAGG CCACCGAGGC CGACCACCGG CGGCTGATAG ACCGGTTTCT GGGGATGGGA
CGGCGCTGGA AGCACCGCGA CGTCCTGGTG GTCAGTCGCA GCGGCGTCAT CGTGCATGGC
AACGAACAGG TGCGACGCCA GATCAGCCGC GCCTCCCGTA ACCTGTTTTT CGAGAACACC
ATCCCGCTAC TGCGGGACAC CCCTGCGGAA GAATGGCTGG ACAAGCTCCA CGCCCAGCTG
CCCACCGCGG ACATTGAACC GGTGACCGTG GATGGCGAGC ACCTGGGGGT TATCCTGGCA
CCGCGTCAGG GTCGGACCGG CATCCGGCCG CGTAATCGGG ACATGGGCGA GCACCCGTCG
GACGGCTTCT CGCTGGACAC CCTGATCGGG GACAGCCCGG CCATGCGCGC CGCCTGCGAT
AAGGCCCGCC GACTCGCGGC CACCGACGCC CCGATCCTGA TCGAGGGCGA GACCGGGGTC
GGCAAAGAGC TCTTCGCACA GGGCATCCAC GCCCTGAGCA TGCTGACCGG GCCCTTTATC
CCGGTCAATT GCGGGGCGCT TCCCAAGGAC CTGATCGGCG GCGAAATGTT TGGTTACGTG
GGCGGGGCCT TCACCGGCGC CAGTCAGGAG GGCCGGCCCG GAAAACTGGA GGCCGCCGAC
GGCGGCACCC TTTGCCTGGA CGAGGTCAGC GAGATGCCGC TGGACCTGCA ACCCACCCTG
TTGCGCATCC TGGAGGATGG GGTCGTCTAT CGCATCGGCA GCCACCAGCC CCGACGGGTC
CGGGCCCGGC TGCTGTCCAT GACCAACCGC AATCTGCCGG AGGAGATCGA GTCCGGACGC
TTCCGTCAGG ATCTTTTCTA CCGGATCGCC GCCCTGCGCC TGCGCATCCC CCCGCTGCGG
GAACGCGGAG ACGATATCGC CCTGCTGGCG GAGTACTACC TGCGGCAACA GGCCACCCGC
AGTGGGCGAA CGCCCCAGTC ACTGTCCGCG GAGGCCATGG ACGCCCTGCT GCGCTACCAC
TGGCCCGGGA ATGTGCGCCA ACTGCGCAAT GCTATCACCA CCACTGCCGC TCTGACTGAC
GCCGCCACGA TCGACGTGGA GGCGTTGCCG GAAGAGATCC TCACCCCCGC CCCAGCCCCC
ACGCCCGGCG AGGACGGCAA TCTGCAACTA GCCACGGTGG AGCGGGCCGC CATCGAGCAG
GCCCTGCGCC GGTGTGAGGG CAATGTCTCC CGGGCTGCCC GGCAATTGGG CATCGCCCGC
TCCACGCTCT ACTGCCGCAT CCAGGAACAG CACATCCCCA TCCCCCGGCG CCGACGCACA
GCGCCGTGA
 
Protein sequence
MQDRRIYQAW ESFLSQGETP TGVRDEVLAS WQRSLDNNVP VDRSQTQALS DGEFLRVRQQ 
SGPFLTAARP ALEQGRRFLS EARAMVMLSN AHGTVLETVG DRRVIEHGQD IGLCRGGLWD
EGHIGTNAIG TALASQQPVQ IHGYEHYCCR VQRWTCAAAP VFSPTTRRIL GVVDLSGPAE
SFNPQSLAYV VAVARQIEGG LIQATEADHR RLIDRFLGMG RRWKHRDVLV VSRSGVIVHG
NEQVRRQISR ASRNLFFENT IPLLRDTPAE EWLDKLHAQL PTADIEPVTV DGEHLGVILA
PRQGRTGIRP RNRDMGEHPS DGFSLDTLIG DSPAMRAACD KARRLAATDA PILIEGETGV
GKELFAQGIH ALSMLTGPFI PVNCGALPKD LIGGEMFGYV GGAFTGASQE GRPGKLEAAD
GGTLCLDEVS EMPLDLQPTL LRILEDGVVY RIGSHQPRRV RARLLSMTNR NLPEEIESGR
FRQDLFYRIA ALRLRIPPLR ERGDDIALLA EYYLRQQATR SGRTPQSLSA EAMDALLRYH
WPGNVRQLRN AITTTAALTD AATIDVEALP EEILTPAPAP TPGEDGNLQL ATVERAAIEQ
ALRRCEGNVS RAARQLGIAR STLYCRIQEQ HIPIPRRRRT AP