Gene Mlg_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0666 
Symbol 
ID4268278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp730123 
End bp732876 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content64% 
IMG OID638125415 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_741510 
Protein GI114319827 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0591] Na+/proline symporter
[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGT GGCTGCTCGT TTTCACCTCC TTCGGCTATG TGGGGCTATT GTTCGCCATC 
GCCTACTCGG CGGACAAGCT GGGCGATACC GGGCGCTCGC TGGTCAACAA CCCTTATGTC
TACACCCTGT CCATCGCGGT CTACTGCACC GCCTGGACGT TCTATGGCAG TGTCGGTCGG
GCCACCGAGG CCGGGGTGTC GTTCCTGACG ATCTACCTCG GCCCCACTCT GATGGCGGTC
CTCTGGTGGC TGATCCTCCG CCGGATCATC CGGATTAGTA AGCTCTACCG CATCACCTCC
ATCGCCGACT TCATCGCCTC GCGCTACGGG AAATCCATGG CGCTGGGCGG ACTGGCCACC
TTCATCGCGG TGGTGGGCAT CACCCCCTAT ATCGCCCTGC AGCTCAAGGC GGTCTCCGAC
AGCTTCGTGG TCATGCTGAG CTTCCCGGAA CCGGAGGTGG CGGCCACGCC GTTGCGCTTC
TGGGACGACA AGGCCTTCTG GGTGGCCGGG CTGCTGGCGA TCTTCGCCAT CCTGTTCGGG
ACCCGGCACA TCGACGCCAC CGAGCGGCTG GAGGGCATGG TGGTTGCGGT GGCCTTCGAG
TCGGTGATCA AGCTGCTGGC CTTCCTGGCG GCGGGCTTCT TCGTCACCTT CATGCTCTTC
GACGGGCCGC GGGCGCTGTT CGAGCAGGCC GCCGAGGTGG ACCAGATCCG CGACCTGATG
CTGCTCTCCA GCCTGCCGGG TGGCTACGTG AACTGGCTCA GCCTCACCCT GCTCTCCATG
CTGGCCATCA TCTTCCTGCC CCGGCAATGG CAGGTCACGG TGGTGGAGAA CGTCAACGAG
GACCACCTGC GCACCGCCGC CTGGTTGTTC CCACTCTACC TGCTGGTCAT AAACCTGTTC
GTACTGCCCA TCGCCTTCGC GGGCATGCTC TATTTCTCCG AGGGCACCGT CAATCCGGAC
AGCTTCGTGC TGGCACTGCC GCTGGCCGAG GGGTATAACT GGCTGGCCCT GTTTATCTAC
ATCGGCGGTT TCTCGGCGGC CACCGGCATG GTCATCGTGG CCACTATCGC CCTGTCGATC
ATGGTCAGCA ACGACTTGGT GGTACCGGCG CTCCTGCGCA TGCGGGGTTT TCGCCTGTCC
GCCGACGCCG ATCTCTCGCG CTTCATCGTC AACCTGCGCC GCGGGATCAT CTGCCTGATC
CTGTTATTGG GTTATTTCTA TTACCTGCTC ATTGCGGATA CCTACGCCCT GGTCTCCATC
GGCCTGATCT CCTTCGTGGC CGTGGCCCAG TTCGCCCCAC CGATCCTGAT CGGGATCTTC
TGGAAGGGTG CCAGCCGCAA GGGTGCCATG GCCGGGCTGA TCGCCGGCTT CGTCATCTGG
GCCTATACCC TGTTGCTGCC CTCCTTCGTT GACTCCGGGC TCATCCCCAG GGCCCTGCTG
GAGCAGGGCC CCGCCGGGAT TGCCTGGCTC AACCCCTACG GCCTGTTCGG CCTCACCGAC
CTGGACCCGA TCACGCATAC GGTGTTCTGG AGCCTGCTGG CCAACATCGG GCTGCTGGTG
GGGGTCTCGT TGTTCGCCCG CCAGTCGGAT ATCGAACGGA TCCAGGGCGC GCTGTTCGTC
GATGTCTTCC TGCGCTCCGA GCGCGACACC CGCTTCTGGG AAGGCACCGC GACCGTGGGC
GATCTGCAGG ACCTGCTGGG CCGTTTCATC GGGCAGGAGC GGTCACGGGC CGCCTTTACC
AGTTATGCCG AGGAACACGG CCTCCAGCTT CACGAGCGCG ACCATGCCCG TCCGGAACTG
GTGGGCTACG CCGAGCGCCT GCTCTCCGGC AGTATCGGCT CCGCCTCGGC GCGGGTGATG
GTCTCGTCCA TCATCAAGGG CGAGGCGCTC TCCTTCGAGG GGGTGATGGA GATCCTCGAC
GCCACCTCAC GAGCCATCGA ATACTCCCGT CGCCTGGAGC AGAAATCCCG CGAGTTGGAA
ATGGCCACCG ACGAACTGCG CCGGGCCAAT GAGCGGTTGA AGGAACTCGA CCACCTCAAG
GACGAATTCG TCTCCATGGT CAGCCATGAG CTGCGCACCC CGCTGACCTC GATCCGCGCC
TTCGGCGAGA TCCTGCTCAG CAATCCGGAG CTGGACGCGG AGCAGCGCAA GGAGTTCCTG
CAGGTGGTGG TCAAGGAGAG CGAGCGGCTC ACCCGGTTGA TCAACCAGGT ACTGGATCTC
TCCAAGATCG AGAGCGGCGC CGCGGAGTGG CACCTGGAGA CCCTGGACCT GAACCAGGTG
GTTCAAGAGG CCGCTGACGC CACCCGGCAG ATCTTCCACG ACACCCGGGT GGACCTCCAG
GTCAAGGCCC CCGAGCAGCC CACGATCATC ACCGGCGACC ACGACCGGCT CATTCAGCTG
GTGATCAACC TGCTCTCCAA TGCCGCCAAG TTCACCGACC CGGACAACGG GCGGGTGGAG
GTGTCCGTGG TCCCGGTCTC CGCGCGCAAG CTGGAACTGC GGGTACAGGA CAATGGCCCC
GGCATCAGCG AGGCGGAGCA GCGCAAGATC TTCGACAAAT TCCATCAGGT GAGCAGCCAG
CAGGCGGGCA AGCCCAAGGG CAGCGGCCTG GGGTTGGCCA TCTGTAAGCT CATCATGGAC
GCGCACTCCG GCGATATCCG CGTCGAAAGC GAGCCCGGTG CCGGTGCCAC CTTTATCTGC
GAGTTCCCCG TCGGCGATCG CCGCGCCGCC ACCGGCGAGG ACGAATCCCC CTGA
 
Protein sequence
MSEWLLVFTS FGYVGLLFAI AYSADKLGDT GRSLVNNPYV YTLSIAVYCT AWTFYGSVGR 
ATEAGVSFLT IYLGPTLMAV LWWLILRRII RISKLYRITS IADFIASRYG KSMALGGLAT
FIAVVGITPY IALQLKAVSD SFVVMLSFPE PEVAATPLRF WDDKAFWVAG LLAIFAILFG
TRHIDATERL EGMVVAVAFE SVIKLLAFLA AGFFVTFMLF DGPRALFEQA AEVDQIRDLM
LLSSLPGGYV NWLSLTLLSM LAIIFLPRQW QVTVVENVNE DHLRTAAWLF PLYLLVINLF
VLPIAFAGML YFSEGTVNPD SFVLALPLAE GYNWLALFIY IGGFSAATGM VIVATIALSI
MVSNDLVVPA LLRMRGFRLS ADADLSRFIV NLRRGIICLI LLLGYFYYLL IADTYALVSI
GLISFVAVAQ FAPPILIGIF WKGASRKGAM AGLIAGFVIW AYTLLLPSFV DSGLIPRALL
EQGPAGIAWL NPYGLFGLTD LDPITHTVFW SLLANIGLLV GVSLFARQSD IERIQGALFV
DVFLRSERDT RFWEGTATVG DLQDLLGRFI GQERSRAAFT SYAEEHGLQL HERDHARPEL
VGYAERLLSG SIGSASARVM VSSIIKGEAL SFEGVMEILD ATSRAIEYSR RLEQKSRELE
MATDELRRAN ERLKELDHLK DEFVSMVSHE LRTPLTSIRA FGEILLSNPE LDAEQRKEFL
QVVVKESERL TRLINQVLDL SKIESGAAEW HLETLDLNQV VQEAADATRQ IFHDTRVDLQ
VKAPEQPTII TGDHDRLIQL VINLLSNAAK FTDPDNGRVE VSVVPVSARK LELRVQDNGP
GISEAEQRKI FDKFHQVSSQ QAGKPKGSGL GLAICKLIMD AHSGDIRVES EPGAGATFIC
EFPVGDRRAA TGEDESP