Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0129 |
Symbol | |
ID | 4269822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 146358 |
End bp | 148046 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638124853 |
Product | transposase, IS4 family protein |
Protein accession | YP_740974 |
Protein GI | 114319291 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACACGC GCATCACCAA GTCGGGTCCA CGCCGCTACC TGCAGTTGGT GCAGGGGTAC CGGGACGACA ATGGCAAGGT AAAGCAGCGG GTCGTTGCTA ATCTCGGCCG CCTTGATCAG TTGGAAGCGT CAGATCTCGA CGCCCTCATC AAAGGGCTTC AGCGGGCGGT GGGGCGCCCC GAGTCACTGC CCGAGGCCCC GCGCTTCGAC ACGGCCAAGG CCTTTGGTGA TGTCTGGGCA CTGCACCAGC TCTGGCAGGA GTTGGGGCTG GGTGAGGCCC TGAAACGCGC GCTGCGCTCT TCGCGCCGCC AGTTCGATGC CGAGGCCCTT ATCCGAGCGA TGGTCTTCAA TCGTCTAGCT GCCCCGTGCA GCAAACTCGG CGTACTGGAA TGGCTCCGTG AGGAAGCCAG TGTCCCTGGG CTTGATAGCG AAGCCATCCA TCATAAACAG CTCCTGCGCG CCATGGATGC GTTGGAAGAG CACAAGGAAG CCGTCGAACG GTCGGTGGCG GCCCAGTTGC GCCCGCTTCT GGACCAAGAG CTCAGCGTCA TTTTCTATGA CCTAACCACG GTGCGCATTC ACGGCACCAC CAGGGTGGCC GATGACATCC GCGACTACGG CCTGAGCAAG GAGACCGGCG GTATTGGCCG CCAATTCGCC CTCGGCGTTG TCCAGACCGC CGAGGGCCTG CCCATCGCCC ATGAGGTCTT CGAGGGTAAC GTCGCCGAGA CGCGTACTTT GGCGCCAATG ATTGAGCGCC TGCTGGAGCG CTTTGCCCTC ACGCGGGTAG TGGTCGTCGC CGACCGGGGC CTGCTGTCGT TCGACAACAT CGACACCCTG GATGCCTTGG GCGCCGAGCA AGGGCTGGCG GTGGATTACA TCCTGGCCGT CCCCGGTCGC CGCTACCGCG ATTTCGCCAA GTTGATGAGC ACGCTGCATC CACAACTGGC GGCGGCGGTC ACCGAGCCGA GTGCCGACGT GGTGACCGAG ACCACGTGGG AGGGTCGACG GCTGGTGGTA GCACACAATG CCGAGCGGGC GGCAGAGCAG ACCGTCCAGC GCCGCGAAAC CATCGAGGAA CTGGATGCAC TGGGGGCACA ACTCGCGGAA CGCCTCGACA ACCAGGATGC TGGCCAGCCG GGTCGGGGTA GACGTTCCAC TGATCGCAGC GCCTACCAGC GCTTCCACAA GGCAGTACTG GACAAGCGCA TGGGCGCCAT TGTTAAAACG GACCTCGGGG CGCCCCGGTT CAGTTACAGC ATCGACACCG AGGCGTGGGC TCGGGCCGAG CAGCTTGATG GCAAACTGCT GCTGGTCACC AGCCTCAGCG ACATGGAGGC CGAGGCTGTG GTGGAGCGTT ACCGCTCCCT GGCCGACATC GAACGCGGCT TCCGGGTGCT CAAAAGCGAG ATTGAAATTG CCCCGGTTTA CCACCGCCTG CCAGAACGTA TCCGGGCACA CGCGATGATT TGTTTCCTGG CCCTGGTGCT CTACCGCGTC CTGCGTGGGC GGCTGAAGGC TGCCGGAAGC CCCCACTCAC CGGAGAGGCT GCTGCGCGGT CTGAGGCAGA TCCAACGCCA CACCGTCCAT GTGGGCAGCC AATCCTACGA GGGCCTGACA CGGCCCAGCC AGGAGCAACT GGACCTATTC GAGGTCGCCG GCGTGGAGGT GCCGAAGGAG CCCTGCTGA
|
Protein sequence | MYTRITKSGP RRYLQLVQGY RDDNGKVKQR VVANLGRLDQ LEASDLDALI KGLQRAVGRP ESLPEAPRFD TAKAFGDVWA LHQLWQELGL GEALKRALRS SRRQFDAEAL IRAMVFNRLA APCSKLGVLE WLREEASVPG LDSEAIHHKQ LLRAMDALEE HKEAVERSVA AQLRPLLDQE LSVIFYDLTT VRIHGTTRVA DDIRDYGLSK ETGGIGRQFA LGVVQTAEGL PIAHEVFEGN VAETRTLAPM IERLLERFAL TRVVVVADRG LLSFDNIDTL DALGAEQGLA VDYILAVPGR RYRDFAKLMS TLHPQLAAAV TEPSADVVTE TTWEGRRLVV AHNAERAAEQ TVQRRETIEE LDALGAQLAE RLDNQDAGQP GRGRRSTDRS AYQRFHKAVL DKRMGAIVKT DLGAPRFSYS IDTEAWARAE QLDGKLLLVT SLSDMEAEAV VERYRSLADI ERGFRVLKSE IEIAPVYHRL PERIRAHAMI CFLALVLYRV LRGRLKAAGS PHSPERLLRG LRQIQRHTVH VGSQSYEGLT RPSQEQLDLF EVAGVEVPKE PC
|
| |