Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0675 |
Symbol | |
ID | 4268470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 744538 |
End bp | 747465 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125424 |
Product | two-component sensor kinase CbrA |
Protein accession | YP_741519 |
Protein GI | 114319836 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0591] Na+/proline symporter [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTTA ACCTCAGCAC CCTCTTCTTC AGTGCCGCGG CCTACCTGCT GCTGCTGTTC GTCATCGCCC ACGGCACCGA ACGCCGTTGG CTGCCACGGG CGGTGGCCCA GCATCCGCTG TTCTACGTCC TGGCCCTGGG CGTCTACGCC ACCACCTGGA GCTTCTACGG CAGTGTCGGG TTTGCCGAGC GTTACGGTAT CGCCTACCTG CCCATCTACC TGGGCGCGAC CATTGCCTTT CTGCTCACGC CGGTCCTGCT CATGCCGCTG CTGCGATTGA CCCGCGACCA GCAGCTCACC TCGCTCTCGG ACGTCTTTGC CTTCCGCTTC CACAGCCAGG CCGCCGGGGT GCTGGTGACG CTATTGATGC TCACCGGCAT CCTGCCCTAC ATCGCCCTGC AGATCCGCGC CGTGGCCGAG TCCACCCAGT GGCTGGTGGG TGGCGAGACC TCGCCGCACG TGCTCGCCGT GGTGTTTTGC ATGACCGTGG CGGTGTTCGC CATCATCTTC GGCGCCCGCC ACGTCACCCC GCGGGAGAAA CACACCGGTC TGGTGGTGGC CATTGCCTTC GAATCGGTGG TCAAGCTGGC CGCGCTGATG GTGATCGCCT GGGTGGCCAT GAGCCTGGCC TTCGACGGAC CCATCGGCCT GAACCGCTGG TTGGCGGACA ATCCGGAGCG CCTGCAGGCC TTTTACCAGC CGGCCCTGGA CGGCCCCTGG ATCAGCCTGC TCTTTCTGGC CTTCTCGGCG GCGTTCCTAT TGCCCCGCCA GTTCCACATG ATCTTCACCG AGAACCTCAA ACCACGCAGC CTGCTCAAGG CCAGCTGGGG CTTCCCCGCC TACCTGCTGG CCCTGGCCAT CTGCATTCCC CCGATCCTCT GGGCCGGACA GGTGCTGCAA CCGGGCACGG GGCCGGAATA CTACGTGCTG GGGATGGCGG TGCTCAGCGA ATCCCCCGTG CTGGTGATCG TGACCTACCT GGGGGGGATC TCGGCCGCCA GCGCGATGAT CATCATCTCC ACCCTCGCCC TGTCCTCGAT GACCCTCAAT CACTTGGTGC TGCCCCTGAC CCGGGCCCGC CCACGCCAGG ACCTGGACCT GTACGCCACC CTGCGCTGGA CCCGGCGCGC ACTGATCCTG TTGATGATCG CCGCCGGCTA TTACTTCTTC CTGATGCTGG ACCCGGGCGA GGGGCTGGTG GACTGGGGCC TGATCTCCTT CCTGGCCATG GCCCAGTTCA TCCCCGGCAT CGTCGGCGTG CTCTGGTGGA CCCGGGCCAA CGTCTGGGGC TTCATCGCCG GGCTTTTGGG TGGGGCGCTG GTCTGGCTCG ACGCGCTCTT CCTGCCCGCG CTGGTGGGCA CCGAGCCCTT TTTCCTGTTG GGCTTCCCCA CCGCACCGGA GTCGGCCTCC GCCATTTACG GCCTGGCCAC CTTCTGGTCG ATCGCCTTCA ATTCGCTGCT GTTCGCCGCG GTGTCTGTCC TGACCCCGCA AACCGGGCCG GAGCGCCAGG CCGCCGAGGC CTGTCGCGAC CTGGGCCACC CCATGACCTT CGGCACCCTG GTGGCCGACT CACCGGCCCA ATTCGTGGTG CAACTGGCGC CGGTGACCGG CGACGAGGCG GCGCGCGCCG AGGTGGGGAA GGCCCTGGAG GACCTGGGAC TGGACTGGAC GGAGAACCGG CCGGACCGGC TCAAGCACCT GCGCGACCAG ATCGAGCGCA ATCTCTCGGG CATGATGGGC CCGGTCCTGG CCCGTATGAT CGTGGACGAG CGCCTGGAAC TGGACCACTC CGCCCATCAG GCCCATACCC AGAGCATCCG CCAGATCGAG GAACGGCTGG AGTCCTCCAG CAGCCGCTTC CGCGGGCTGA CCGCCGAGCT GGACCGCCTG CGCCGCTACC ACCGCCAGAT CCTTGAGGAC CTGCCGCTGG GCGTGATCAC GGTCACGGCA AACCAGCGGA TCGTGCGCTG GAACAGTGCC ATGCAGCAGC TCACCGGTAT TTGTGCCCGG GATGCCCTGG GCAACCGGCT GGAGGATCTG GCCCACCCCT GGGGCCGGCT ACTGGGACGG TTCATGGCGC TGGGCCAGGC CCACGCCCAC GACCAGGCCC AGGTTCCCGG CGACGGCACC CGCTGGCTGA GCCTGCACCG CACCAGTATC GGGGAGCCGG ACAAGGGCCG CACCAACGAC AGTCTGCTGC TGGTGGAGGA CGTCACCGAG TTGCGGGTGT TGGAGCGGGA GCTGGCTCAC AGCGAGCGGC TGGCCTCCAT CGGCCGGCTC GCCGCCGGCG TCGCCCACGA GATTGGCAAC CCGGTCACCG GCGTGGCCTG CCTGGCGCAG AACCTGCGCG ACGAGGATGA CCCGGAGCTG ATCCGCGAGA GCATGGAACA GATCCTGGAA CAGACCCACC GGATCAGTAA CATCGTGCAC ACCCTGGTCA GCTACGCCCA TGCCGGCAGC ACCGAGGAGT CGCCGCCGGA GCCGGTGCGG CTGCACGACG CGGCGGAGGA GGCCCGCCAG GTGATGGTCC TGAGCCGCCG GGGCAAGGAG ATGGAGTTCG ACAACCGGAT CCCGCCGCAC CTGGAGGTGG CCGGCGACAG CCAGCGGTTG GTCCAAGTGT TCGTCAATCT CTTCTCCAAC GCCGCCGACG CCTGCGCCCA GCAGGGCCGG CTGGTGCTCA CCGCCCGGGA GCGGGGCGAC CGGATCATCG TGCGGGTGGC GGACAACGGG CCGGGCATCC CCGCGTCGGC CCTGAAGAAG GTGCTGGACC CGTTCTACAC CACCAAACCG GCGGGACAGG GCACAGGGCT GGGCCTGCCG CTGGTGTACA ACATCATCAC CGATCACGGG GGCACCCTGG ACATTGAATC GGACGTGGGC GGCACTACAG TGACGCTGGA ACTGCCCGCC CTGGAGGGAG TGGCATAA
|
Protein sequence | MDFNLSTLFF SAAAYLLLLF VIAHGTERRW LPRAVAQHPL FYVLALGVYA TTWSFYGSVG FAERYGIAYL PIYLGATIAF LLTPVLLMPL LRLTRDQQLT SLSDVFAFRF HSQAAGVLVT LLMLTGILPY IALQIRAVAE STQWLVGGET SPHVLAVVFC MTVAVFAIIF GARHVTPREK HTGLVVAIAF ESVVKLAALM VIAWVAMSLA FDGPIGLNRW LADNPERLQA FYQPALDGPW ISLLFLAFSA AFLLPRQFHM IFTENLKPRS LLKASWGFPA YLLALAICIP PILWAGQVLQ PGTGPEYYVL GMAVLSESPV LVIVTYLGGI SAASAMIIIS TLALSSMTLN HLVLPLTRAR PRQDLDLYAT LRWTRRALIL LMIAAGYYFF LMLDPGEGLV DWGLISFLAM AQFIPGIVGV LWWTRANVWG FIAGLLGGAL VWLDALFLPA LVGTEPFFLL GFPTAPESAS AIYGLATFWS IAFNSLLFAA VSVLTPQTGP ERQAAEACRD LGHPMTFGTL VADSPAQFVV QLAPVTGDEA ARAEVGKALE DLGLDWTENR PDRLKHLRDQ IERNLSGMMG PVLARMIVDE RLELDHSAHQ AHTQSIRQIE ERLESSSSRF RGLTAELDRL RRYHRQILED LPLGVITVTA NQRIVRWNSA MQQLTGICAR DALGNRLEDL AHPWGRLLGR FMALGQAHAH DQAQVPGDGT RWLSLHRTSI GEPDKGRTND SLLLVEDVTE LRVLERELAH SERLASIGRL AAGVAHEIGN PVTGVACLAQ NLRDEDDPEL IRESMEQILE QTHRISNIVH TLVSYAHAGS TEESPPEPVR LHDAAEEARQ VMVLSRRGKE MEFDNRIPPH LEVAGDSQRL VQVFVNLFSN AADACAQQGR LVLTARERGD RIIVRVADNG PGIPASALKK VLDPFYTTKP AGQGTGLGLP LVYNIITDHG GTLDIESDVG GTTVTLELPA LEGVA
|
| |