Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0707 |
Symbol | |
ID | 4268096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 789573 |
End bp | 790982 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125456 |
Product | two component, sigma54 specific, Fis family transcriptional regulator |
Protein accession | YP_741551 |
Protein GI | 114319868 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.112148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.322603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAAC CCAGTGTGCT CATCGTGACC ACGGACCAGG AGTTCAGCCG GCGCCGCTGC CTGCTCCTGC GCGAGGCCGG GTTCCAGCCC ATCGGCGTTT CCGACTACAC CCACGCCCAG CGGCTGCTGG AGCGGCGCAG CCTGGCGCTG GTGATCGTGC AGGGGGGTGC CGGGGCCTGC CAGTGGCGGC AGCTCGCCCA GTTGCCCTCG GCGCCGCCGC TGGTGCTGCT GGCCCCCCGG CCTTCGGTGG AGGAGGCCAA GCAGGCGCTG CGTGCCGGTG CCGCCGAGTA CCTGGCCGAG GGCTGCTCCT CCCTGGAGCT GGTCAACACG GTGCGCCGGC TGGTCGGGCG CGAGCAGGAC CTGGTGGCCC GCGACCCCCG TTCCCGCGAG GTCTACCGCA TGGCCCGACG GGTGGCCCAG AAGGACGTGA GCGTGCTGAT TACCGGCGAG TCGGGCACCG GCAAGGAGAT GCTGGCCCGG CACATCCATG AGCACTCCGG GCGCGCCGAT GGGCCCTTTG TGGCGGTCAA TTGCGCGGCC ATCCCCGAAC AGATGATCGA GGCGGTGCTG TTCGGATTCG AGAAGGGGGC CTTCACCGGC GCCCACCGCA GCCACGCCGG TAAATTCGAG CAGGCCCAGG GGGGCACGCT GCTGTTGGAC GAGGTCTCGG AGATTGATAC CGGGCTGCAG GCCAAGCTCC TGCGGGTGCT GCAGGAGCGG GAGATCGAGC GCCTCTGCGG CAGCGAGAGC ATCCCGCTGG ACGTCCGGGT GCTGGCCAGC AGCAACCGCG ACCTGCGTGA ACAGGTGGCC GCCGGCCGCT TCCGCGAGGA CCTCTACTAC CGGCTTCACG TCTTCCCGCT GCACCTGCCG CCACTGCGTG AGCGCCTGGA CGATGTGTTG CCGCTGGCGG AGGCCTTCAT CGCCAAACAC GCCGGATTGG GCCCCGAGGG TGCCCGGCTG GACGACGCCG CCCGCAGCCG CCTGCTGGGT CACGACTGGC CCGGCAATGT GCGCGAGCTG GAGAACGTCA TCCAGCGCGC CATGATCCTG GCGGACGAGG CCCGGATCAG CGCCGACGAC CTGGTGATCG AACCGGTGCC GGCCGCCCCC GCCCGCCAGG CCGCCCCGGT GCAGGGGGGC GCCAATGAGC CGGCGGCACC CGCGGACAGG GGGAACGGCG ATGGCCGGCC GAAGGATGGG GCCTCCCCCA CCCTGGGCGA TAACCTGCGC GAGCGGGAGT TCCGCCTGAT CATGGACACC CTCCGGGCCT GCCGTGGCAA CCGTAAGCAG GCCGCGGAGC AGCTGGGCAT CTCCGATCGC ACCCTGCGTT ACCGGGTGGC CCGGTTGCGC AAGGAGGGCT TTCACGTGCC CTCGAAGGCG GGCGCGGAAT ATGCGTACGG CCAGGCGTGA
|
Protein sequence | MSQPSVLIVT TDQEFSRRRC LLLREAGFQP IGVSDYTHAQ RLLERRSLAL VIVQGGAGAC QWRQLAQLPS APPLVLLAPR PSVEEAKQAL RAGAAEYLAE GCSSLELVNT VRRLVGREQD LVARDPRSRE VYRMARRVAQ KDVSVLITGE SGTGKEMLAR HIHEHSGRAD GPFVAVNCAA IPEQMIEAVL FGFEKGAFTG AHRSHAGKFE QAQGGTLLLD EVSEIDTGLQ AKLLRVLQER EIERLCGSES IPLDVRVLAS SNRDLREQVA AGRFREDLYY RLHVFPLHLP PLRERLDDVL PLAEAFIAKH AGLGPEGARL DDAARSRLLG HDWPGNVREL ENVIQRAMIL ADEARISADD LVIEPVPAAP ARQAAPVQGG ANEPAAPADR GNGDGRPKDG ASPTLGDNLR EREFRLIMDT LRACRGNRKQ AAEQLGISDR TLRYRVARLR KEGFHVPSKA GAEYAYGQA
|
| |