Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0666 |
Symbol | |
ID | 4268278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 730123 |
End bp | 732876 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638125415 |
Product | periplasmic sensor signal transduction histidine kinase |
Protein accession | YP_741510 |
Protein GI | 114319827 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0591] Na+/proline symporter [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.220304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAGT GGCTGCTCGT TTTCACCTCC TTCGGCTATG TGGGGCTATT GTTCGCCATC GCCTACTCGG CGGACAAGCT GGGCGATACC GGGCGCTCGC TGGTCAACAA CCCTTATGTC TACACCCTGT CCATCGCGGT CTACTGCACC GCCTGGACGT TCTATGGCAG TGTCGGTCGG GCCACCGAGG CCGGGGTGTC GTTCCTGACG ATCTACCTCG GCCCCACTCT GATGGCGGTC CTCTGGTGGC TGATCCTCCG CCGGATCATC CGGATTAGTA AGCTCTACCG CATCACCTCC ATCGCCGACT TCATCGCCTC GCGCTACGGG AAATCCATGG CGCTGGGCGG ACTGGCCACC TTCATCGCGG TGGTGGGCAT CACCCCCTAT ATCGCCCTGC AGCTCAAGGC GGTCTCCGAC AGCTTCGTGG TCATGCTGAG CTTCCCGGAA CCGGAGGTGG CGGCCACGCC GTTGCGCTTC TGGGACGACA AGGCCTTCTG GGTGGCCGGG CTGCTGGCGA TCTTCGCCAT CCTGTTCGGG ACCCGGCACA TCGACGCCAC CGAGCGGCTG GAGGGCATGG TGGTTGCGGT GGCCTTCGAG TCGGTGATCA AGCTGCTGGC CTTCCTGGCG GCGGGCTTCT TCGTCACCTT CATGCTCTTC GACGGGCCGC GGGCGCTGTT CGAGCAGGCC GCCGAGGTGG ACCAGATCCG CGACCTGATG CTGCTCTCCA GCCTGCCGGG TGGCTACGTG AACTGGCTCA GCCTCACCCT GCTCTCCATG CTGGCCATCA TCTTCCTGCC CCGGCAATGG CAGGTCACGG TGGTGGAGAA CGTCAACGAG GACCACCTGC GCACCGCCGC CTGGTTGTTC CCACTCTACC TGCTGGTCAT AAACCTGTTC GTACTGCCCA TCGCCTTCGC GGGCATGCTC TATTTCTCCG AGGGCACCGT CAATCCGGAC AGCTTCGTGC TGGCACTGCC GCTGGCCGAG GGGTATAACT GGCTGGCCCT GTTTATCTAC ATCGGCGGTT TCTCGGCGGC CACCGGCATG GTCATCGTGG CCACTATCGC CCTGTCGATC ATGGTCAGCA ACGACTTGGT GGTACCGGCG CTCCTGCGCA TGCGGGGTTT TCGCCTGTCC GCCGACGCCG ATCTCTCGCG CTTCATCGTC AACCTGCGCC GCGGGATCAT CTGCCTGATC CTGTTATTGG GTTATTTCTA TTACCTGCTC ATTGCGGATA CCTACGCCCT GGTCTCCATC GGCCTGATCT CCTTCGTGGC CGTGGCCCAG TTCGCCCCAC CGATCCTGAT CGGGATCTTC TGGAAGGGTG CCAGCCGCAA GGGTGCCATG GCCGGGCTGA TCGCCGGCTT CGTCATCTGG GCCTATACCC TGTTGCTGCC CTCCTTCGTT GACTCCGGGC TCATCCCCAG GGCCCTGCTG GAGCAGGGCC CCGCCGGGAT TGCCTGGCTC AACCCCTACG GCCTGTTCGG CCTCACCGAC CTGGACCCGA TCACGCATAC GGTGTTCTGG AGCCTGCTGG CCAACATCGG GCTGCTGGTG GGGGTCTCGT TGTTCGCCCG CCAGTCGGAT ATCGAACGGA TCCAGGGCGC GCTGTTCGTC GATGTCTTCC TGCGCTCCGA GCGCGACACC CGCTTCTGGG AAGGCACCGC GACCGTGGGC GATCTGCAGG ACCTGCTGGG CCGTTTCATC GGGCAGGAGC GGTCACGGGC CGCCTTTACC AGTTATGCCG AGGAACACGG CCTCCAGCTT CACGAGCGCG ACCATGCCCG TCCGGAACTG GTGGGCTACG CCGAGCGCCT GCTCTCCGGC AGTATCGGCT CCGCCTCGGC GCGGGTGATG GTCTCGTCCA TCATCAAGGG CGAGGCGCTC TCCTTCGAGG GGGTGATGGA GATCCTCGAC GCCACCTCAC GAGCCATCGA ATACTCCCGT CGCCTGGAGC AGAAATCCCG CGAGTTGGAA ATGGCCACCG ACGAACTGCG CCGGGCCAAT GAGCGGTTGA AGGAACTCGA CCACCTCAAG GACGAATTCG TCTCCATGGT CAGCCATGAG CTGCGCACCC CGCTGACCTC GATCCGCGCC TTCGGCGAGA TCCTGCTCAG CAATCCGGAG CTGGACGCGG AGCAGCGCAA GGAGTTCCTG CAGGTGGTGG TCAAGGAGAG CGAGCGGCTC ACCCGGTTGA TCAACCAGGT ACTGGATCTC TCCAAGATCG AGAGCGGCGC CGCGGAGTGG CACCTGGAGA CCCTGGACCT GAACCAGGTG GTTCAAGAGG CCGCTGACGC CACCCGGCAG ATCTTCCACG ACACCCGGGT GGACCTCCAG GTCAAGGCCC CCGAGCAGCC CACGATCATC ACCGGCGACC ACGACCGGCT CATTCAGCTG GTGATCAACC TGCTCTCCAA TGCCGCCAAG TTCACCGACC CGGACAACGG GCGGGTGGAG GTGTCCGTGG TCCCGGTCTC CGCGCGCAAG CTGGAACTGC GGGTACAGGA CAATGGCCCC GGCATCAGCG AGGCGGAGCA GCGCAAGATC TTCGACAAAT TCCATCAGGT GAGCAGCCAG CAGGCGGGCA AGCCCAAGGG CAGCGGCCTG GGGTTGGCCA TCTGTAAGCT CATCATGGAC GCGCACTCCG GCGATATCCG CGTCGAAAGC GAGCCCGGTG CCGGTGCCAC CTTTATCTGC GAGTTCCCCG TCGGCGATCG CCGCGCCGCC ACCGGCGAGG ACGAATCCCC CTGA
|
Protein sequence | MSEWLLVFTS FGYVGLLFAI AYSADKLGDT GRSLVNNPYV YTLSIAVYCT AWTFYGSVGR ATEAGVSFLT IYLGPTLMAV LWWLILRRII RISKLYRITS IADFIASRYG KSMALGGLAT FIAVVGITPY IALQLKAVSD SFVVMLSFPE PEVAATPLRF WDDKAFWVAG LLAIFAILFG TRHIDATERL EGMVVAVAFE SVIKLLAFLA AGFFVTFMLF DGPRALFEQA AEVDQIRDLM LLSSLPGGYV NWLSLTLLSM LAIIFLPRQW QVTVVENVNE DHLRTAAWLF PLYLLVINLF VLPIAFAGML YFSEGTVNPD SFVLALPLAE GYNWLALFIY IGGFSAATGM VIVATIALSI MVSNDLVVPA LLRMRGFRLS ADADLSRFIV NLRRGIICLI LLLGYFYYLL IADTYALVSI GLISFVAVAQ FAPPILIGIF WKGASRKGAM AGLIAGFVIW AYTLLLPSFV DSGLIPRALL EQGPAGIAWL NPYGLFGLTD LDPITHTVFW SLLANIGLLV GVSLFARQSD IERIQGALFV DVFLRSERDT RFWEGTATVG DLQDLLGRFI GQERSRAAFT SYAEEHGLQL HERDHARPEL VGYAERLLSG SIGSASARVM VSSIIKGEAL SFEGVMEILD ATSRAIEYSR RLEQKSRELE MATDELRRAN ERLKELDHLK DEFVSMVSHE LRTPLTSIRA FGEILLSNPE LDAEQRKEFL QVVVKESERL TRLINQVLDL SKIESGAAEW HLETLDLNQV VQEAADATRQ IFHDTRVDLQ VKAPEQPTII TGDHDRLIQL VINLLSNAAK FTDPDNGRVE VSVVPVSARK LELRVQDNGP GISEAEQRKI FDKFHQVSSQ QAGKPKGSGL GLAICKLIMD AHSGDIRVES EPGAGATFIC EFPVGDRRAA TGEDESP
|
| |