Gene Mlg_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1106 
Symbol 
ID4269813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1292300 
End bp1293631 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content66% 
IMG OID638125858 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_741948 
Protein GI114320265 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02966] phosphate regulon sensor kinase PhoR 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAGG GAAACCCCTG GCCCCGGGTG CTGTCCCGCC TGTCCGCATT GTTCCTGCTG 
GCGGCCCTGG TCGGTTGGTG GGCGGAGGCG CTGAGCGAGG CGCTGTTGCT GGCGGCCCTG
ACGATGCTCG CCTGGGAAGG GTATAACCTC TACCGTTTTG AGGCCTGGCT GCGTAAGGGG
CGGCAATTGG CACCCCCCGC CGCCCATGGG CTCTGGCGGC ACCTCTTCGA CGCCCTGTAC
CAGCGCCAGC AGCGCCAGCG CCAACGTCGG CGTAATCTGC AGCGCCTGAT GGCCCGCTAC
CGGGACTCGG CCCGCGCCAT GCCCGACGCC CTGGTGGTGC TTAGCGGCGA CTACCGCATC
GATTGGTGGA ACCCGGCCGC CGCACGTTTG CTGGGGCTGC GCTGGCCCGG CGACAGCCAC
CAGCGGATCG CCAACATCTA CCGCCATCCC GATTTCGTGG CGTTCCTCTC CTCCCGCAGT
CCGGTGACCC GCGAGGTCAG CCTCCCGTCA CCGTTGGACA GCCAGGTCTG GCTGGAAATC
CGGCTGGTGC CCTATGGCAC CGACCGCTAC CTGCTGCTGG CGCGTGATGT CACCCACCTG
CACCGGCTGG AGACCATGCG TCGGGACTTT GTGGGCAATG TCTCCCACGA GTTGCGGACG
CCGTTGACGG TCATCTACGG CGTGGCGGAA ACCTTGAACG AGGAGATGGC GGACGACCCC
GAAGTTGGCG ACATGCTGCG TTTGTTGCAG GAGCAGTCCG AGCGCATGCG CCGGCTGGTG
GATGACCTGC TGCTGCTCTC CCGCCTGGAG ACCGGTGCCA CGCCCAGCCA CCCGGAGTGG
GTGGACATGC CGCGGTTACT GGAGGAACTC GTCGAGGACG GCAAGGCACT TTCGGGCAGC
CGCCATCACC GTTTTGAGTT GACGTGCGAG CCGGGGCTGT TGCTGGAGGG GTGTGAGAGC
GAGCTGCGCA GCGCCTTCTC CAACCTTATT TTTAATGCCG TTAAATACAC CCCGGGGGAC
GGCTATATCG GCGTTCGGTG GTATGCCGAC AAGGCGGGTG CCCATTTATC TGTCACCGAC
AATGGCATTG GGATACCGGC AGCTCATATC CCAAGGCTCA CCGAGCGTTT TTACCGGGTG
GACAGCGCCC GTTCCAAGGC CAGTGGCGGT ACCGGTCTGG GGCTCGCCAT CGTCAAGCAT
GTGCTCAATC GCCACCGGGC GCAGCTTACC GTGCGCAGTC AGCCGGGGCA GGGCAGCACC
TTCATTTGTA CCTTTCCCAC CGCATTGCTG CGCCGGGCCG GCACCCGGAC GGCCCGGCCT
CCCGCAAGCT AG
 
Protein sequence
MMQGNPWPRV LSRLSALFLL AALVGWWAEA LSEALLLAAL TMLAWEGYNL YRFEAWLRKG 
RQLAPPAAHG LWRHLFDALY QRQQRQRQRR RNLQRLMARY RDSARAMPDA LVVLSGDYRI
DWWNPAAARL LGLRWPGDSH QRIANIYRHP DFVAFLSSRS PVTREVSLPS PLDSQVWLEI
RLVPYGTDRY LLLARDVTHL HRLETMRRDF VGNVSHELRT PLTVIYGVAE TLNEEMADDP
EVGDMLRLLQ EQSERMRRLV DDLLLLSRLE TGATPSHPEW VDMPRLLEEL VEDGKALSGS
RHHRFELTCE PGLLLEGCES ELRSAFSNLI FNAVKYTPGD GYIGVRWYAD KAGAHLSVTD
NGIGIPAAHI PRLTERFYRV DSARSKASGG TGLGLAIVKH VLNRHRAQLT VRSQPGQGST
FICTFPTALL RRAGTRTARP PAS