Gene Mlg_1536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1536 
Symbol 
ID4270541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1750241 
End bp1751833 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID638126294 
Producttransposase IS66 
Protein accessionYP_742375 
Protein GI114320692 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.881033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.856435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAG CGGCCATTCA ACCAGATAGG GATGTCTCCC GCCTACAGCG TCAGGTCGCT 
GAGCTGGAGA AAAAGCTCGC CGAAAAGGAC GCCCTGTTGG CCACCAAGGA AGCCCACTGG
GCTGCCCGCG AGTGCTCCAT GTTCGAGCAG ATCCGGCTGC TGCTCGACAG CCGTTTCGGC
CCCTCCACCG AACGCTACCA CGTCGATCAG CAGCAACTGC AGTTCGACGA GGCCGAGCAG
TATGCCGATG CACCGGTCAC CGAACCGGAG GCAGAGGCCG CTCAAGCCGG CGAGACGGCC
CCGAGCGTGC CGGCCAAGCG CCGGAACCGT GGCGGCCGCG TGCGGCTGCC CGCGGAACTG
CCACGGGTCG AGGTGGTGCA CGATATCCCC GAGGCACAGC GCTACTGCCC GCATGGCGGC
AGCGAGCTGA CCTGTATCGG CGAAGAGGTC ACCGAGCAAC TGGATGTCAT CCCCGCCCGG
GTGCAGGTCC GCCGCCACAT CCGCCGCAAG TACGCCTGCA GATGCTGCGA AGAAGGCGTG
CACACCGCAA GCATGCCGCC GCAACCGCTG CCCTGGAGCA TGGCCAGCCC CGGATTGCTG
GCCTACATCG CCACCGCCAA GTATGAATAC GGGCTGCCGC TCTACCGCCA GGCCAAGGGC
TTCGAGCGCA AGGGCATCCC GCTGCCGCGT AACACCCTGG CGCGCTGGAT GGTGGGCATC
GGCGAGCTGC TCACCCCGCT GGGGCAGGCC CTGCAGGACC ATCTACTGGC CCAGCCGCTC
ATCCACATGG ATGAGACCAC GGTCCAGGTG AACACCGAGC CGGGGCGAAC GGCCTCCAGC
ACCTCCTACA TGTGGGTCCA GCGCGGTGGC CCGCCCGGTG AGCAGGTGGT GCGCTACGAC
TACGACACCA GCCGCTCCGG CCGGGTCCCC CGCCGCCTGC TCGGCGACTA TGCCGGCGTG
CTGGTCACCG ACGGCTACGA GGGCTATGCC CAGGTGGTGC GGGAGAATGG CATCACCCAT
GCCGGCTGCT GGGCGCATGC CCGGCGGAAG TTTGTCGAGG CCCAGAAGGT CCAGCCCAAG
GGCAAGACCG GCAAGGCCGA CTGGGCGCTG AGCCTGATCG GCAAGCTTTA CCGCGTCGAG
CGCGAAGGCA AAACCCTGGA CCCGGAGGAT CGCCTGGTGC TGCGTCAGCG CCAGAGCCGG
CCGCTGATCG ACAAACTCCA GCGCTGGCTG GAGAAGTCCA TCACCCAGGT GCCGCCGAAG
ACCGCCATCG GCAAGGCCCT ACGCTATCTC CAGACCCAGT GGCCCCGGCT GACCCGCTTT
CTCGATGATG GGCGAATCCC ACTGGATAAC AATCCGGCGG AGAACGCCAT CCGACCCTTC
GTAGTGGGTC GAAAGAACTG GCTATTCAGT CACACCACCC AGGGCGCGTC GGCCAGCGCG
ATGATCTATA GCGTGATAGA GACGGCCAAG GCCAACGGGC TGGAGCCCTA CGAGTACCTG
GAAGATGTCC TTACCCGCCT GCCGGCTGCG GACACCGACC AGGCGATTCA CGCCCTGCTG
CCCTGGAACT GGGGGAAGAC CATACAGGCC TGA
 
Protein sequence
MKSAAIQPDR DVSRLQRQVA ELEKKLAEKD ALLATKEAHW AARECSMFEQ IRLLLDSRFG 
PSTERYHVDQ QQLQFDEAEQ YADAPVTEPE AEAAQAGETA PSVPAKRRNR GGRVRLPAEL
PRVEVVHDIP EAQRYCPHGG SELTCIGEEV TEQLDVIPAR VQVRRHIRRK YACRCCEEGV
HTASMPPQPL PWSMASPGLL AYIATAKYEY GLPLYRQAKG FERKGIPLPR NTLARWMVGI
GELLTPLGQA LQDHLLAQPL IHMDETTVQV NTEPGRTASS TSYMWVQRGG PPGEQVVRYD
YDTSRSGRVP RRLLGDYAGV LVTDGYEGYA QVVRENGITH AGCWAHARRK FVEAQKVQPK
GKTGKADWAL SLIGKLYRVE REGKTLDPED RLVLRQRQSR PLIDKLQRWL EKSITQVPPK
TAIGKALRYL QTQWPRLTRF LDDGRIPLDN NPAENAIRPF VVGRKNWLFS HTTQGASASA
MIYSVIETAK ANGLEPYEYL EDVLTRLPAA DTDQAIHALL PWNWGKTIQA