Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2043 |
Symbol | |
ID | 4270177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2313740 |
End bp | 2315059 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126799 |
Product | sigma-54 dependent trancsriptional regulator |
Protein accession | YP_742875 |
Protein GI | 114321192 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.339367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCACGG CAGGGGGCAC CGGGCGCTGG CAGCCCGATG CGCGGGCGAT GCTGGAGGCC TTTGACGAGG CGGCCATCCT GATCGGCCTC GACTACGTGA TCGTCGCGGC CAACGACGCC TACCAACGCC TCTACGGCGA TGCCCCCATC GACCGGGGGC AGCGCTGTTT CGAGGTCTCC CACCACTACA ACGTCCCCTG TGACCAGGCC GGTGAGGCCT GTCCGCTGCG CGAGTCCTTG CGCTCCGGGC GGCGGCACCG GGTGCTGCAC GTCCATCATG GCCCTGAGGG TCAGGAGCAC GTGGATGTGG AGACCTGGCC GGTCCCCGGG CCGGACGGGC AGGTGCAGTT CTTTATTGAG GTGTTGCGCC GGCTGCCGGT GGCCAGTGCC CAGGCCAACG GCGGTGAGCT GACCGGCCGC TGCCGCGCCT TCAACCGGAT GCTTGGGCTG ATCAGCCGGG CCGCCCCCAG CGATGCCTCT GTGCTGCTGC AGGGCGAGAC CGGGACCGGC AAGGAGTTGG TGGCCCGTGC CGTGCACCAG GCCAGCCTGC GCCAGGAGGC GCCTTTCGTG CCGGTGGACT GCTCGGGGCT CACCGAGACC CTGTTCGAGA GCGAGCTGTT CGGTCATGAG AAGGGCTCGT TCACCGGTGC GGTCTCCTCG CGGATCGGCC TGGTGGAGGC GGCCTCGGGC GGGACGCTCT TCCTCGACGA GGTGGGGGAT ATCCCCCTGG CCCTCCAGGT CAAGCTGCTG CGCCTGCTGG AGACTCGCAG CTTCCGCCGG GTGGGCAGTG TCGAGCCGCG GCGGGCCGAT TTCCGCCTCG TGGCGGCCAC CCACCGCGAC TTGCGGCGGA TGGTGGAGCA GGGCCAGTTC CGCGAAGACC TCTACTACCG GATCTCCACC TTTCCCATCG ATGTGCCGCC ACTGCGCGCG CGGCTGGATG ACCTGCCGCT GCTCTGTCAG GCCATCCTGC GCCGGCTGCA GTACGGCCAG GGGCGGTATC TCTCCGACGA GGCATTGGCG GAGCTCCAGG GCTATGACTT TCCGGGCAAT GTCCGCGAGC TGCGCAATGT CCTGGAGCGT GCGAGCCTGC TGGCGGACGG CCCGGAGATC CTGCCAGAGC ACCTGTCACC AGAGGTCAGG GGGCGGCAGG GGCTTGCGGT GCTGCCGGAG GACGAGCTGC TGAGCCTGGA GGAGGCGGAG GCGCACTATC TGGGCCGGGC ACTGGCCCTG CATCAGGGCG ACCGGGCCAG CCTGGCCCGC CTGCTCGGGG TGAGTGAGCG GACCCTTTAC CGCAAGCTGC GGCGCCACGG ACTGGGTTGA
|
Protein sequence | MGTAGGTGRW QPDARAMLEA FDEAAILIGL DYVIVAANDA YQRLYGDAPI DRGQRCFEVS HHYNVPCDQA GEACPLRESL RSGRRHRVLH VHHGPEGQEH VDVETWPVPG PDGQVQFFIE VLRRLPVASA QANGGELTGR CRAFNRMLGL ISRAAPSDAS VLLQGETGTG KELVARAVHQ ASLRQEAPFV PVDCSGLTET LFESELFGHE KGSFTGAVSS RIGLVEAASG GTLFLDEVGD IPLALQVKLL RLLETRSFRR VGSVEPRRAD FRLVAATHRD LRRMVEQGQF REDLYYRIST FPIDVPPLRA RLDDLPLLCQ AILRRLQYGQ GRYLSDEALA ELQGYDFPGN VRELRNVLER ASLLADGPEI LPEHLSPEVR GRQGLAVLPE DELLSLEEAE AHYLGRALAL HQGDRASLAR LLGVSERTLY RKLRRHGLG
|
| |