Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1536 |
Symbol | |
ID | 4270541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1750241 |
End bp | 1751833 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126294 |
Product | transposase IS66 |
Protein accession | YP_742375 |
Protein GI | 114320692 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.881033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.856435 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCAG CGGCCATTCA ACCAGATAGG GATGTCTCCC GCCTACAGCG TCAGGTCGCT GAGCTGGAGA AAAAGCTCGC CGAAAAGGAC GCCCTGTTGG CCACCAAGGA AGCCCACTGG GCTGCCCGCG AGTGCTCCAT GTTCGAGCAG ATCCGGCTGC TGCTCGACAG CCGTTTCGGC CCCTCCACCG AACGCTACCA CGTCGATCAG CAGCAACTGC AGTTCGACGA GGCCGAGCAG TATGCCGATG CACCGGTCAC CGAACCGGAG GCAGAGGCCG CTCAAGCCGG CGAGACGGCC CCGAGCGTGC CGGCCAAGCG CCGGAACCGT GGCGGCCGCG TGCGGCTGCC CGCGGAACTG CCACGGGTCG AGGTGGTGCA CGATATCCCC GAGGCACAGC GCTACTGCCC GCATGGCGGC AGCGAGCTGA CCTGTATCGG CGAAGAGGTC ACCGAGCAAC TGGATGTCAT CCCCGCCCGG GTGCAGGTCC GCCGCCACAT CCGCCGCAAG TACGCCTGCA GATGCTGCGA AGAAGGCGTG CACACCGCAA GCATGCCGCC GCAACCGCTG CCCTGGAGCA TGGCCAGCCC CGGATTGCTG GCCTACATCG CCACCGCCAA GTATGAATAC GGGCTGCCGC TCTACCGCCA GGCCAAGGGC TTCGAGCGCA AGGGCATCCC GCTGCCGCGT AACACCCTGG CGCGCTGGAT GGTGGGCATC GGCGAGCTGC TCACCCCGCT GGGGCAGGCC CTGCAGGACC ATCTACTGGC CCAGCCGCTC ATCCACATGG ATGAGACCAC GGTCCAGGTG AACACCGAGC CGGGGCGAAC GGCCTCCAGC ACCTCCTACA TGTGGGTCCA GCGCGGTGGC CCGCCCGGTG AGCAGGTGGT GCGCTACGAC TACGACACCA GCCGCTCCGG CCGGGTCCCC CGCCGCCTGC TCGGCGACTA TGCCGGCGTG CTGGTCACCG ACGGCTACGA GGGCTATGCC CAGGTGGTGC GGGAGAATGG CATCACCCAT GCCGGCTGCT GGGCGCATGC CCGGCGGAAG TTTGTCGAGG CCCAGAAGGT CCAGCCCAAG GGCAAGACCG GCAAGGCCGA CTGGGCGCTG AGCCTGATCG GCAAGCTTTA CCGCGTCGAG CGCGAAGGCA AAACCCTGGA CCCGGAGGAT CGCCTGGTGC TGCGTCAGCG CCAGAGCCGG CCGCTGATCG ACAAACTCCA GCGCTGGCTG GAGAAGTCCA TCACCCAGGT GCCGCCGAAG ACCGCCATCG GCAAGGCCCT ACGCTATCTC CAGACCCAGT GGCCCCGGCT GACCCGCTTT CTCGATGATG GGCGAATCCC ACTGGATAAC AATCCGGCGG AGAACGCCAT CCGACCCTTC GTAGTGGGTC GAAAGAACTG GCTATTCAGT CACACCACCC AGGGCGCGTC GGCCAGCGCG ATGATCTATA GCGTGATAGA GACGGCCAAG GCCAACGGGC TGGAGCCCTA CGAGTACCTG GAAGATGTCC TTACCCGCCT GCCGGCTGCG GACACCGACC AGGCGATTCA CGCCCTGCTG CCCTGGAACT GGGGGAAGAC CATACAGGCC TGA
|
Protein sequence | MKSAAIQPDR DVSRLQRQVA ELEKKLAEKD ALLATKEAHW AARECSMFEQ IRLLLDSRFG PSTERYHVDQ QQLQFDEAEQ YADAPVTEPE AEAAQAGETA PSVPAKRRNR GGRVRLPAEL PRVEVVHDIP EAQRYCPHGG SELTCIGEEV TEQLDVIPAR VQVRRHIRRK YACRCCEEGV HTASMPPQPL PWSMASPGLL AYIATAKYEY GLPLYRQAKG FERKGIPLPR NTLARWMVGI GELLTPLGQA LQDHLLAQPL IHMDETTVQV NTEPGRTASS TSYMWVQRGG PPGEQVVRYD YDTSRSGRVP RRLLGDYAGV LVTDGYEGYA QVVRENGITH AGCWAHARRK FVEAQKVQPK GKTGKADWAL SLIGKLYRVE REGKTLDPED RLVLRQRQSR PLIDKLQRWL EKSITQVPPK TAIGKALRYL QTQWPRLTRF LDDGRIPLDN NPAENAIRPF VVGRKNWLFS HTTQGASASA MIYSVIETAK ANGLEPYEYL EDVLTRLPAA DTDQAIHALL PWNWGKTIQA
|
| |