Gene Mlg_1538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1538 
Symbol 
ID4270543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1752206 
End bp1755097 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content62% 
IMG OID638126296 
Producthelicase domain-containing protein 
Protein accessionYP_742377 
Protein GI114320694 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCA CGGCCCACCA CGCCAAATAC TATGCCTATG ACCTGACGCG TCAGGCCCCT 
CCCGGCGCGG CGGACCAGCT CTCCATGTCG CTGTTCGATG CGTCGGTCGA TCTGAACCCC
CATCAGATCG AGGCAGCGAT GTTCGCCCTG CAGTCCCCGC TCTCCGAGGG AGTCGTGCTT
GCGGACGAGG TGGGCCTGGG CAAGACCATC GAGGCGGGCA TCGTGCTGTG CCAGCGGTGG
GCGGAGCGCC GCCGGCGCCT GCTGGTGATT TGTCCCGCTG CGATACGCAA GCAGTGGGCC
ACCGAGTTGC AGGAGAAATT CAACCTGCCC GCTGTGGTAA TGGATTCCCG CGCCTACCGG
CAATTCCAGC AACAGGGCCA ACCACGGCCA TTCGAGCAGC GGGCCGTGAT CCTGGTCTCC
TACCACTATG CGGCTCGTAT GCAGGACGAC CTGCGTATGG TGCGCTGGGA GCTGGTGGTC
ATTGATGAAG CCCACAAGCT GCGCAATGCC TACCGGCCCA GCAACAAAGT GGGCCAAGCC
ATTCGCTGGG CCACCGAAGG CCGCCGCAAA CTGCTTTTGA CCGCTACGCC ATTGCAGAAC
AGCCTGATGG AGCTGTACGG GCTGGCGAGC CTGATCGATG ATCACATCTT CGGCGATCCC
AACGCCTTCC GTTCCCAGTA CGTCAATGCC GGTGGCGATC TGGCCGGGCT GCGCCAGCGT
CTGGCGGGCT TTTGCAAACG CACCTTGCGC CAGGAGGTGC TTGAGTATGT GCGCTACACT
GAGCGCCGGG CGCTGACCCA ACCCTTCCGG CCAACTGATG CCGAGCAGAA ACTCTACGAC
CGCATCTCCG AGTTTCTGCA GCGGGAAAAC ACCTACGCCA TTCCCGACCG TCAGCGCCAC
CTGATCGTGC TGATCCTGCG CAAGCTGTTG GCCTCCAGCT CAAACGCGAT CGCCGGCACT
CTCGAGAGCC TTCGCGACCG GCTGATCGAC CTGCGCGACA GTGAGCAACT GGAGCTGGAC
TGGAGCAGCC AACTGATCGA AAGCGAGGAA ATGGAAACCG AGCTGCTGGA CGAGTGGCTG
CCGCCGGAGG AGAACGGCGA GGCCAATGGT AAGACCGAAG ACAATCGTCC GGAACCGAAG
AAGCTGCGCG ATGAAATCAG CGAGATCGAC AGTCTGGCCA AGGCTGCCCG CGCCATTGGT
GTGGATGCCA AATCCTGCGC ACTGCTCACC GCGCTCGATG TCGGTTTCCA GGAGATGGGT
CGAATGGGGG CGCGGGAAAA GGCGTTGATC TTCACCGAAT CCCGCCGAAC CCAGGACTAT
CTGAAAGACT ATCTGGAGCA GAACGGTTAT GCCGGGGAGG TGCTGCTGTT CAACGGCACC
AATGCCGGCC CTGAAGCCAA GGCCATCTAC GAGAACTGGG TCGGCAAAAA CCAGGAAAGC
GGCCGCGCCA CCGGTTCCCG CGCCATCGAT GCCCGGCTGG CGCTGGTGGA GCATTTCCGC
GACAACGCCA AAATCATGAT TGCCACGGAA GCAGCGGCGG AAGGCGTCAA CCTGCAGTTC
TGCTCGCTGG TGATCAATTA CGACCTGCCC TGGAACCCCC AGCGTATCGA ACAGCGCATT
GGTCGCTGTC ACCGTTACGG CCAGCAGCAC GATGTGATCG TGGTCAACTT CCTCAACGAG
CGCAACGAGG CAGACCAGCG CGTGCATGAG TTGTTGACGG AGAAGTTCAG TCTCTTCAAC
GGTCTGTTTG GTGCATCCGA TGAAGTGCTG GGCCAGCTTG AGTCTGGCGT CGATTTCGAG
AAGCGCATCC TCGCCATCTA CCAGCAGTGC CGCACCTCGG ATGAAATCGA AACCGCCTTT
CGCCAGCTCC AAGAAGTACT GGAGGAGCAG ATCAAGAGCC GCATCCAGGA AACCCGCCAG
ACCCTGCTGG AGCATTTCGA CGAAGACGTG CACGCCCGGC TGCGCTTGCG GCTGGAAGAC
GCCCAGGCCC AGCTCGACCG CATCGGCCAG CGTTTCTGGC AAGTCACCGG GCACATACTG
GATGGCCGCG CACGCTTTGA TGATGCCAGG CTGGTCTTCG ATCTCCACGA GCCGCCAAAG
GCCGACATCC GGCCTGGCCG CTACCACCTG ATCTCCAAGA CGCGCAGTGG TGAGCAGGGC
GAATCGCCGG ACTACGGCTA CTACCTGTAC CGCCTCTCCC ATCCGCTGGG CGAGCACGTC
ATCGAATCGG CCAAGCAGCA GCCGACGCCC ACAGCGCATC TGCGCTTCAA CATCACCGTC
CATCCGGGGC GCATTGCCCT GGTGGAAGCA CTCAAGGGTA AGTCCGGTTG GCTGACATTG
GCTCGGCTGA CCATCGAATC TTTCGAAACC GAGGAATACC TGCTGTTCTC CGGCATCGAC
GACGAAGGCC GCTCGCTGGA CAATGAAACC TGCGCCAAGC TGTTTAACGT CGCCGCTGAC
AATCGTGGCC CGGCGGACTT GCCCGCCGAG GTGGAGCAGC GCCTTGCCGC TGAGCTTGCC
CAACACCAGC GCGCCACTGC GAACCGCTCG CTGGAGGCCA ACAACCAGCA CTTCAACGCC
GCCCGCGAAC GCCTGGAACA ATGGGCGGAA GACAAGATGC TGGCCACCGA GAAAACGCTC
AAGGACACAA AAGAACAGAT CAAGGCCCTG CGCCGTCAGG CCCGACAGGC CGAAACCCTG
GAAGATCAGC ACGACATCCA GGAACGTCTG AAACAACTCG AGCAGCGCCA GCGCAAGCAG
CGCCGGGACA TCTTCAAGGT GGAAGACGAA ATCGAAGAAC AGCGCGACAG CCTCATTGCC
GCGCTGGAAA AACGCCTTGC CCGAGACCAG CGTAGTGAGC GCCTGTTTAC GCTGCGCTGG
TCCGTCGTAT GA
 
Protein sequence
MTITAHHAKY YAYDLTRQAP PGAADQLSMS LFDASVDLNP HQIEAAMFAL QSPLSEGVVL 
ADEVGLGKTI EAGIVLCQRW AERRRRLLVI CPAAIRKQWA TELQEKFNLP AVVMDSRAYR
QFQQQGQPRP FEQRAVILVS YHYAARMQDD LRMVRWELVV IDEAHKLRNA YRPSNKVGQA
IRWATEGRRK LLLTATPLQN SLMELYGLAS LIDDHIFGDP NAFRSQYVNA GGDLAGLRQR
LAGFCKRTLR QEVLEYVRYT ERRALTQPFR PTDAEQKLYD RISEFLQREN TYAIPDRQRH
LIVLILRKLL ASSSNAIAGT LESLRDRLID LRDSEQLELD WSSQLIESEE METELLDEWL
PPEENGEANG KTEDNRPEPK KLRDEISEID SLAKAARAIG VDAKSCALLT ALDVGFQEMG
RMGAREKALI FTESRRTQDY LKDYLEQNGY AGEVLLFNGT NAGPEAKAIY ENWVGKNQES
GRATGSRAID ARLALVEHFR DNAKIMIATE AAAEGVNLQF CSLVINYDLP WNPQRIEQRI
GRCHRYGQQH DVIVVNFLNE RNEADQRVHE LLTEKFSLFN GLFGASDEVL GQLESGVDFE
KRILAIYQQC RTSDEIETAF RQLQEVLEEQ IKSRIQETRQ TLLEHFDEDV HARLRLRLED
AQAQLDRIGQ RFWQVTGHIL DGRARFDDAR LVFDLHEPPK ADIRPGRYHL ISKTRSGEQG
ESPDYGYYLY RLSHPLGEHV IESAKQQPTP TAHLRFNITV HPGRIALVEA LKGKSGWLTL
ARLTIESFET EEYLLFSGID DEGRSLDNET CAKLFNVAAD NRGPADLPAE VEQRLAAELA
QHQRATANRS LEANNQHFNA ARERLEQWAE DKMLATEKTL KDTKEQIKAL RRQARQAETL
EDQHDIQERL KQLEQRQRKQ RRDIFKVEDE IEEQRDSLIA ALEKRLARDQ RSERLFTLRW
SVV