Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1129 |
Symbol | |
ID | 4269624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1319824 |
End bp | 1321512 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638125879 |
Product | transposase, IS4 family protein |
Protein accession | YP_741969 |
Protein GI | 114320286 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.981252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACACGC GCATCACAAA GTCCGGCCCG CGCCGCTATT TGCAGTTGGT CGAGGGCTAC CGAGACGCCA ACGGCAAGGT TAAACAGCGT GTGGTAGCCA GCCTGGGTCG CCTTGATCAG CTTGAGGCAG CTGACCTGGA GCCGCTCATC AAGGGACTCC AACGGGCGGT GGGGCGTCCT GAATCATTGC CTGAAGCGCC TCAGTTCGAG ACCGCCCGCG CCTTTGGCGA TGTGTGGACC CTCCATCAGC TCTGGCAGGA GTTAGGCCTG GATGGGGCCT TGAAACGGGC TCTGCGCTCC TCCCGACGAC AGTTCGATGC CGAGGCACTC ATCCGAGCCA TGGTTTTCAA CCGCCTCGCT GAGCCGAGCA GCAAACGCGG CACGCTGGAG TGGTTGCGCG AGAGCACCAG CATGCCCGGG CTCGATGCCT CGACCGTGCA CCACGAGCAA CTCCTGCGTG CCATGGATGC GTTGGAGGAC CACAAGGAGG CCGTGGAGCG TGCGGTAGCC GGCCAGTTGC GGCCGCTGCT TGACCAGGAT CTCAGCGTCA TCTTTTACGA TCTCACCACG GTACGCATCC ACGGCACCCA TGCGATGGAA GACGATATCC GTCAGCATGG CCTCAGCAAG GACACCGCCG GTATTGGCCG CCAATTTGCG CTGGGGGTGG TCCAGACTGC CGAAGGCCTG CCGATCGCCC ACGAGGTCTT CGAGGGGAAT GTCGCCGAGA CGCGCACCCT GGCCCCAATG ATTGAACGTC TGCTGGAGCG TTTCGCGCTC GCCCGCGTGG TAGTGGTCGC CGACCGCGGC CTGCTCAGCT TCGATAACAT CGACACTCTG GAGGCACTGG GCGTCGAGCA GGGGCTGACG GTCGACTACA TTCTGGCTGT ACCCGCCCGG CGTTACGGGG ACTTCACCGA GGTGATGGAG TCTCTGCATA CGCAACTGGA AACCGCCGTC ACCGAGCCTA GTCAGGACGT GGTGACCGAA ACCACCTGGC AGGGCCGACG CCTGGTGGTG GCCCACAATG CCGAACGGGC CGCCGAGCAG AGCGCGCAGC GTCGGCAGAC CATCGAAGAG CTTGACGCCC TTGGTGCCCA ACTCGCCGAG CGACTCGATA ATCAGGATGC TGGCCAGCCC GGTCGTGGGC GGCGTTCCAC CGACCGTAGT GCCTACCAGC GCTTCCACAA GGCCGTGCTG GAGAGGCGCA TGAGCGCGAT CATCAAGGCC GATCTCGGCT CGCCGCAATT CAGCTACACC ATCGATGAGC CCGCCTGGGC GGCTGCCGAA CGTCTGGACG GCAAGCTACT GCTGGTTACC AGCCTCACCG ACATGGGGGC GGAAGCTGTG GTAGAGCGTT ACCGCTCCCT GGCCGATATC GAGCGTGGAT TCCGGGTACT CAAGAGCGAG ATTGAAATCG CGCCGGTCTA CCACCGGCTG CCGGAGCGCA TTCGGGCGCA CGCGATGATT TGCTTCCTCG CGCTGGTGCT CTACCGGGTC TTGCGCGGGC GCCTCAAGGC CGCCGGCAGC GCCTGCTCAC CGGAAAAGCT GCTACGGAGC CTGCGGCAAA TCCAGCGGCA CACGGTGCGT GTGGGTAGCC GCGCCTACGA AGGGCTTACC CGACCCAACC AGGAACAGCT GGACCTTTTC GAAGCCGCCG GCGTGGAGGT TCCGAAGGCC CCGCGTTGA
|
Protein sequence | MYTRITKSGP RRYLQLVEGY RDANGKVKQR VVASLGRLDQ LEAADLEPLI KGLQRAVGRP ESLPEAPQFE TARAFGDVWT LHQLWQELGL DGALKRALRS SRRQFDAEAL IRAMVFNRLA EPSSKRGTLE WLRESTSMPG LDASTVHHEQ LLRAMDALED HKEAVERAVA GQLRPLLDQD LSVIFYDLTT VRIHGTHAME DDIRQHGLSK DTAGIGRQFA LGVVQTAEGL PIAHEVFEGN VAETRTLAPM IERLLERFAL ARVVVVADRG LLSFDNIDTL EALGVEQGLT VDYILAVPAR RYGDFTEVME SLHTQLETAV TEPSQDVVTE TTWQGRRLVV AHNAERAAEQ SAQRRQTIEE LDALGAQLAE RLDNQDAGQP GRGRRSTDRS AYQRFHKAVL ERRMSAIIKA DLGSPQFSYT IDEPAWAAAE RLDGKLLLVT SLTDMGAEAV VERYRSLADI ERGFRVLKSE IEIAPVYHRL PERIRAHAMI CFLALVLYRV LRGRLKAAGS ACSPEKLLRS LRQIQRHTVR VGSRAYEGLT RPNQEQLDLF EAAGVEVPKA PR
|
| |