Gene Mlg_1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1449 
Symbol 
ID4270230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1654151 
End bp1655161 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content66% 
IMG OID638126205 
ProductLysR family transcriptional regulator 
Protein accessionYP_742288 
Protein GI114320605 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.319275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGGCA GTGGCAGGCT CTACTATAAA CACAACCGAC TCAAGCAGTT GCGGGCGTTT 
TGTCATGCCG CCCAGGAGGG CAGCATCTCC CGGGCGGCGC AGCGACTGGA GTTGAGCCAA
CCCTCGGTGT CGCTCCAGAT CCAGGCCCTG GAGCGGGAAC TGGGCATCGC GCTGTTCGAG
CGCCGGGGCC CGCGCATCCG GCTGACCCCG GATGGCGAGA CCCTGTACGA ACTGGCCCAG
CCTTTGGTGG AGGGCGTGGA CGCCTTGCCC GAGCGGTTCG CCGCCCGCCA CCAGCGGCTG
CAGACGGGAC GGCTGGACAT CGCCGCCGGC GAGTCAACCA CGCTCTACAT CCTGCCCGAT
CTGCTCAAAC AGTTCATGGC CCGCTACCCC GGGGTCCATG TCAAGCTGCA CAACCTGATC
GGTCGCGACA TGATCTCCGC CCTGCAGCAC GACGAGGTGG ACCTGGCCGT GGGCTCCACC
CTGGATCTGC CCGAGGACCT GAGCTACCGC GCCATCTATA CGTATGACCT GCGCCTGATC
CTCCCGCTGG GGCACCCGCT GGCGGAGAAA TCCGAGCTCA CCCTGGCCGA TCTCGCCTCC
GGCGAACTGA TCCTGCCTCC CCGCCACCTG ACCACCTGGC GGCTGGTGAA CCTGGTCTTC
CAACAGCACA ACATTCCCTA CCGGGTGCGC CTGGAGGTGG GTGGCTGGGA GATCATCAAG
CGCTACGTGG AGTTGGGGTT TGGCATCGGC ATTGCCAGCA GTATCTGTCT CAGTGGCTCG
GAACGCCTGC ACGTCAGGTC CCTGCCCGAG GTCTTCCCGC AGCGCACCTA CGGGGTCATG
CTGCGCAGGG GCCGCTATCT TTCGCCCCAG GCCAAGCGTT TCCTGGAGGT GATGGCGCCG
GATATCTTCT CCGGCGAGGC GGACCTGGAC GGCGAACGCC GCAAGGATGT CGCCAGCGAA
TCGGTCTTCA TCCCCCGGGC CAGCGAGCAT GCCCCGGACA AGGAGCGCTG A
 
Protein sequence
MAGSGRLYYK HNRLKQLRAF CHAAQEGSIS RAAQRLELSQ PSVSLQIQAL ERELGIALFE 
RRGPRIRLTP DGETLYELAQ PLVEGVDALP ERFAARHQRL QTGRLDIAAG ESTTLYILPD
LLKQFMARYP GVHVKLHNLI GRDMISALQH DEVDLAVGST LDLPEDLSYR AIYTYDLRLI
LPLGHPLAEK SELTLADLAS GELILPPRHL TTWRLVNLVF QQHNIPYRVR LEVGGWEIIK
RYVELGFGIG IASSICLSGS ERLHVRSLPE VFPQRTYGVM LRRGRYLSPQ AKRFLEVMAP
DIFSGEADLD GERRKDVASE SVFIPRASEH APDKER