Gene Mlg_2624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2624 
Symbol 
ID4269533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2972808 
End bp2974997 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content69% 
IMG OID638127383 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_743454 
Protein GI114321771 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.608847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTC GAGCCCTGAA CGGCCTGAGA GGCGAGGGCT GGCTGCGGCT GGTCCTCTGC 
GTCGTGCTGC TGGCCTCCCT GTACCTGCTC TCGGTGGCCA CCGAGAACAC GGAGCGCTTC
GGCCGGCTCT ACGTCTGGCT GATTGCCGTC AACAGCATCG GCCTGGTGTT GCTGCTCAGC
GTGATCGCCG CCAATCTCTG GCGCCTCTAC CGCCAGCGGC GCCGGGGGCA GGTGGGCAGC
CGCTTGACGG TGCGCCTGGT GGCCGTGTTT GTGTTGCTGG CGGTGGTTCC GGTGTCGGTG
GTCTATTACT TCTCCATGCA GTTTCTGCGG GCCGGCATCG ACAGTTGGTT CGACGTCCGG
GTGGAACACG CCCTGGAGGA CGCCCTGACC CTCTCCCAGG CCTCGTTGGA TCTGCGCACC
CGCGATCTGC TGCGCCGGGT GGAGGCGGCC GGGCGGGAGC TCACCGACAC CCCCGAGTCA
CTGGCCGCCC TCACCCTGAA TGACCTCCGG CAACAGTTAC GCGCCACCGA GGTCACCCTG
CTGCGGTCCA GTGGCCAGAT TATCGCGACC AGCAGTGCAG AACCTTCCGC CACCCTGCCG
CACCGCCCGG AGGAGGAGAT CCTCCTGCAG CTGCGCCAGG GGCTGCCCTA TGTGGCCCTG
GATCCGATGG ACGACGGTGA GCTGCAGGCC CGGGTGGTGG TGCCCACCCG CGGGCCAGCC
GGCCCCGGCG ACACCCGCTT TCTCGAGGCC TATTTCCCCA TCCCCTCGCG CCTGGGGGCG
CTGGCCGAGG AGGTGCAGAC GGCCTACGGT GAGTACCGTG AGATCGCCTT CCTGCGCCAG
CCGCTCAAAG ACAGCTTCAT CCTCACCCTC TCGCTGGTGC TGTTGCTCAG CCTGCTGTTC
GCTGTCTGGA CCGCCTTCTT CCTCGCCCGT CGCATGGTGG CACCCATCCG CAATCTAGCG
GAGGGCACGC GGGCGGTGGC CGCCGGTGAT TACGGCACCC AGCTGCGTGC GGGTAGCCGG
GACGAGCTGG GCTTCCTGGT CGAGTCGTTC AACGATATGA GCCGCCGGAT CGCCCGCACC
CGGGACAGCG CCCGGCGCAG CCAGGCGCAG GTGGAGCGGC AGCGGGCCTA CCTGGAGACC
GTGCTGGGCC GGCTGTCGTC GGGGGTGCTG GCGCTGGATG CCCAGGGCCG GTTCCGTACC
AGCAACCGGG CGCTGGAGGA GATCCTGGGG GTCCGGCTCA CCCGCTATAC CGGGGGCAAC
CTCCAGCAGC TGGCGACCGC CGCACCGCGG CTGGCGGGGC TCGCCGAGGT GGTGGTGCGT
CACCTGGACA GTGGCGATAC CGAGTGGCGC GAACAGGTCA CCCTGCCGGC GGAGGAGGGC
GAACGGGTGC TGATGCTGAG CGGGGCCACT CTGCCGGGGC ATCGCCAGAG CGGCGGCCAC
GTGATCGTGA TCGACGACAT CACCACGCTC ATCCAGGCCC AGCGCGACGC CGCCTGGGGG
GAGGTGGCGC GGCGCCTGGC CCACGAGATC AAGAACCCGC TTACCCCCAT TCAGTTGTCC
GCCGAACGCC TGCGCCATAA GCTCTCCGGA CGCCTGGAGG GGCGCGACGC CGAGTTGCTG
GAGCGTTCCA CCGGTACCAT CGTGCGCCAG GTCAGCGCCA TGAAGGAGAT GGTCAACGCC
TTTTCGGAGT ACGCCCGGCC ACCGCGGCTG CGCCTGGAGC GGGTGGATCT CAACACCCTG
GTGGCGGAGG TGGCCGAGCT CTACCGCGGG GAGTCGGGGC TGGTGCTGGA AATGGCGCCG
GCTGAGGGGT TGCCTGCGAT CCGCGCCGAT GCCGGCCGGC TGCGGCAGCT GCTGCACAAT
CTTATCAAGA ACGCCCAGGA GGCGGCCGAG GGCGAGACCC GCGTGCGCCT GGAGACCGAT
TGGGAGGATG TGCCCGGTGG CCGCAAGGTC CGGCTGCGGG TCTGCGACAA CGGGCCGGGT
TTCAATGCCG AGATGCTGGC CAGTCTGTTC GAGCCCTATG TCACCACCAA GGCCCGGGGG
ACCGGCCTGG GGCTGCCCAT CGTCAAGAAG ATCGTCGAGG AGCATGGCGG CAGCATCAGT
GCCCGTAACA GCGACGGCGG GGGGGCCTGC ATCAGCATGC GCTTCCCTCT GCCTCAGGAA
CAGGGCGCGC TGAGTGCGCC CGGGGAGTAA
 
Protein sequence
MARRALNGLR GEGWLRLVLC VVLLASLYLL SVATENTERF GRLYVWLIAV NSIGLVLLLS 
VIAANLWRLY RQRRRGQVGS RLTVRLVAVF VLLAVVPVSV VYYFSMQFLR AGIDSWFDVR
VEHALEDALT LSQASLDLRT RDLLRRVEAA GRELTDTPES LAALTLNDLR QQLRATEVTL
LRSSGQIIAT SSAEPSATLP HRPEEEILLQ LRQGLPYVAL DPMDDGELQA RVVVPTRGPA
GPGDTRFLEA YFPIPSRLGA LAEEVQTAYG EYREIAFLRQ PLKDSFILTL SLVLLLSLLF
AVWTAFFLAR RMVAPIRNLA EGTRAVAAGD YGTQLRAGSR DELGFLVESF NDMSRRIART
RDSARRSQAQ VERQRAYLET VLGRLSSGVL ALDAQGRFRT SNRALEEILG VRLTRYTGGN
LQQLATAAPR LAGLAEVVVR HLDSGDTEWR EQVTLPAEEG ERVLMLSGAT LPGHRQSGGH
VIVIDDITTL IQAQRDAAWG EVARRLAHEI KNPLTPIQLS AERLRHKLSG RLEGRDAELL
ERSTGTIVRQ VSAMKEMVNA FSEYARPPRL RLERVDLNTL VAEVAELYRG ESGLVLEMAP
AEGLPAIRAD AGRLRQLLHN LIKNAQEAAE GETRVRLETD WEDVPGGRKV RLRVCDNGPG
FNAEMLASLF EPYVTTKARG TGLGLPIVKK IVEEHGGSIS ARNSDGGGAC ISMRFPLPQE
QGALSAPGE