Gene Mlg_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0129 
Symbol 
ID4269822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp146358 
End bp148046 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content64% 
IMG OID638124853 
Producttransposase, IS4 family protein 
Protein accessionYP_740974 
Protein GI114319291 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACACGC GCATCACCAA GTCGGGTCCA CGCCGCTACC TGCAGTTGGT GCAGGGGTAC 
CGGGACGACA ATGGCAAGGT AAAGCAGCGG GTCGTTGCTA ATCTCGGCCG CCTTGATCAG
TTGGAAGCGT CAGATCTCGA CGCCCTCATC AAAGGGCTTC AGCGGGCGGT GGGGCGCCCC
GAGTCACTGC CCGAGGCCCC GCGCTTCGAC ACGGCCAAGG CCTTTGGTGA TGTCTGGGCA
CTGCACCAGC TCTGGCAGGA GTTGGGGCTG GGTGAGGCCC TGAAACGCGC GCTGCGCTCT
TCGCGCCGCC AGTTCGATGC CGAGGCCCTT ATCCGAGCGA TGGTCTTCAA TCGTCTAGCT
GCCCCGTGCA GCAAACTCGG CGTACTGGAA TGGCTCCGTG AGGAAGCCAG TGTCCCTGGG
CTTGATAGCG AAGCCATCCA TCATAAACAG CTCCTGCGCG CCATGGATGC GTTGGAAGAG
CACAAGGAAG CCGTCGAACG GTCGGTGGCG GCCCAGTTGC GCCCGCTTCT GGACCAAGAG
CTCAGCGTCA TTTTCTATGA CCTAACCACG GTGCGCATTC ACGGCACCAC CAGGGTGGCC
GATGACATCC GCGACTACGG CCTGAGCAAG GAGACCGGCG GTATTGGCCG CCAATTCGCC
CTCGGCGTTG TCCAGACCGC CGAGGGCCTG CCCATCGCCC ATGAGGTCTT CGAGGGTAAC
GTCGCCGAGA CGCGTACTTT GGCGCCAATG ATTGAGCGCC TGCTGGAGCG CTTTGCCCTC
ACGCGGGTAG TGGTCGTCGC CGACCGGGGC CTGCTGTCGT TCGACAACAT CGACACCCTG
GATGCCTTGG GCGCCGAGCA AGGGCTGGCG GTGGATTACA TCCTGGCCGT CCCCGGTCGC
CGCTACCGCG ATTTCGCCAA GTTGATGAGC ACGCTGCATC CACAACTGGC GGCGGCGGTC
ACCGAGCCGA GTGCCGACGT GGTGACCGAG ACCACGTGGG AGGGTCGACG GCTGGTGGTA
GCACACAATG CCGAGCGGGC GGCAGAGCAG ACCGTCCAGC GCCGCGAAAC CATCGAGGAA
CTGGATGCAC TGGGGGCACA ACTCGCGGAA CGCCTCGACA ACCAGGATGC TGGCCAGCCG
GGTCGGGGTA GACGTTCCAC TGATCGCAGC GCCTACCAGC GCTTCCACAA GGCAGTACTG
GACAAGCGCA TGGGCGCCAT TGTTAAAACG GACCTCGGGG CGCCCCGGTT CAGTTACAGC
ATCGACACCG AGGCGTGGGC TCGGGCCGAG CAGCTTGATG GCAAACTGCT GCTGGTCACC
AGCCTCAGCG ACATGGAGGC CGAGGCTGTG GTGGAGCGTT ACCGCTCCCT GGCCGACATC
GAACGCGGCT TCCGGGTGCT CAAAAGCGAG ATTGAAATTG CCCCGGTTTA CCACCGCCTG
CCAGAACGTA TCCGGGCACA CGCGATGATT TGTTTCCTGG CCCTGGTGCT CTACCGCGTC
CTGCGTGGGC GGCTGAAGGC TGCCGGAAGC CCCCACTCAC CGGAGAGGCT GCTGCGCGGT
CTGAGGCAGA TCCAACGCCA CACCGTCCAT GTGGGCAGCC AATCCTACGA GGGCCTGACA
CGGCCCAGCC AGGAGCAACT GGACCTATTC GAGGTCGCCG GCGTGGAGGT GCCGAAGGAG
CCCTGCTGA
 
Protein sequence
MYTRITKSGP RRYLQLVQGY RDDNGKVKQR VVANLGRLDQ LEASDLDALI KGLQRAVGRP 
ESLPEAPRFD TAKAFGDVWA LHQLWQELGL GEALKRALRS SRRQFDAEAL IRAMVFNRLA
APCSKLGVLE WLREEASVPG LDSEAIHHKQ LLRAMDALEE HKEAVERSVA AQLRPLLDQE
LSVIFYDLTT VRIHGTTRVA DDIRDYGLSK ETGGIGRQFA LGVVQTAEGL PIAHEVFEGN
VAETRTLAPM IERLLERFAL TRVVVVADRG LLSFDNIDTL DALGAEQGLA VDYILAVPGR
RYRDFAKLMS TLHPQLAAAV TEPSADVVTE TTWEGRRLVV AHNAERAAEQ TVQRRETIEE
LDALGAQLAE RLDNQDAGQP GRGRRSTDRS AYQRFHKAVL DKRMGAIVKT DLGAPRFSYS
IDTEAWARAE QLDGKLLLVT SLSDMEAEAV VERYRSLADI ERGFRVLKSE IEIAPVYHRL
PERIRAHAMI CFLALVLYRV LRGRLKAAGS PHSPERLLRG LRQIQRHTVH VGSQSYEGLT
RPSQEQLDLF EVAGVEVPKE PC