Gene Mlg_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1541 
Symbol 
ID4270546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1761224 
End bp1762495 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content58% 
IMG OID638126299 
Productputative transcriptional regulator 
Protein accessionYP_742380 
Protein GI114320697 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.509332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATA CCGCGGCTGC GCTCTTGGAG AAGATACAGC TTGGGGAAGA TTCCTTCCTT 
GAGCTGAAGG AGGTTCGGAT CGCCGGGAAG CGAGTGACCG CACCACATCG CAACTCACTG
GTGGACGAAT TGGCAGCGTT TGCGAATGCC AAGGGCGGTG TCTGTGTGCT CGGTGTGGAT
GATGCAACGC GAGAAATTCT CGGGATTCCG CGAGACAAGC TGGACCTTGT GACTGACTAT
GTCCGGCAGA CCTGCCTGGA CTCAGTGACA CCACCGTTGA CCCCAGTTAT TGAGCGGCTA
CTTCTGCCTA CGACCACCGG TGACGAGGTG GCCGTATTGA AGGTGGAAAT CGGTCGCAGC
TTGTGGGTGC ATCGCAGCCC CGGCGGCTAT ATGCATCGGG TGGGTGATGA AAAGCGTGAA
ATGGCGCCGG ACTTCCTCGC CCGACTGTTT CAGCAGCGCA GTCAGGCCCG AATCATTCGC
TTTGACGAGC AGCCAGTGCC GAATGCCACC CTGGATGACT TGAATGAGGC GCTTTGGCAA
CGATTCGCAA CCGCACGTAC CCGGGATAAC CGCGATGACC TGCTGCGAAA GCTCGGCATG
GCCCGCATGG ATGACGATGT CTTGCGCCCG TCGGTGGCGG GTATCCTTCT GGCTTCCGAT
GATCCTCGCC ATTGGCTGCC CAATGCGTTT ATTCAGGCGG TCGCCTACCG GGGTACTGAA
ATTCGACCGG TAGGCGACCA AGCGTACCAG CTTGATGCAG CAGATCTCAC CGGGCCTTTG
GATCAACAGG TGCTTTCTGC CTGCCATTTT GTCAGCAAAA ATATGCGAGT CGCCGCGTCC
AAAAGCGTGG GCCGAGAGGA TGTTCCTCAG TTTGATATGA CAGCCGTATT CGAGGCCATC
GTCAACGCGG TCGCGCATCG TGACTACGCT ATGCAGGACG CCAAGATCCG GTTGCGCGTA
TTCGCAGACC GCATGGAACT GTACTCCCCC GGAGCCATCC CTAACACCAT GACGGTGGAT
AGCCTACCGT ACCGCCAGGC TGCACGGAAT GAAACGATTA CCAGTTTGCT GGCGAAATGT
CGGGTGCCGG ATGAAGGCGG GCTGGGAACA GGCCGATCCA CCATGATGGA TAAGCGGGGC
GAAGGCGTGT CCATCATCCT TCAAAATAGT GAAATGCTGT CCGGTCGTGT CCCGGAATAC
AGCCTGGTTG ATGATAGTGA GCTCCGCCTG GTGATTTACG CACCAGCGGA AACGGATGGA
GGGGAAGACT GA
 
Protein sequence
MFDTAAALLE KIQLGEDSFL ELKEVRIAGK RVTAPHRNSL VDELAAFANA KGGVCVLGVD 
DATREILGIP RDKLDLVTDY VRQTCLDSVT PPLTPVIERL LLPTTTGDEV AVLKVEIGRS
LWVHRSPGGY MHRVGDEKRE MAPDFLARLF QQRSQARIIR FDEQPVPNAT LDDLNEALWQ
RFATARTRDN RDDLLRKLGM ARMDDDVLRP SVAGILLASD DPRHWLPNAF IQAVAYRGTE
IRPVGDQAYQ LDAADLTGPL DQQVLSACHF VSKNMRVAAS KSVGREDVPQ FDMTAVFEAI
VNAVAHRDYA MQDAKIRLRV FADRMELYSP GAIPNTMTVD SLPYRQAARN ETITSLLAKC
RVPDEGGLGT GRSTMMDKRG EGVSIILQNS EMLSGRVPEY SLVDDSELRL VIYAPAETDG
GED