Gene Mlg_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1783 
Symbol 
ID4269169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2036728 
End bp2038320 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content67% 
IMG OID638126539 
Producttransposase IS66 
Protein accessionYP_742617 
Protein GI114320934 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAG CGGCCATTCA ACCATATAGG GATGTCTCCC GCCTACAGCG GCAGGTCGCT 
GAGCTGGAGA AAAAGCTCGC CGAAACAGAC GCCCTGTTGG CCACCAAGGA GGCCCACTGG
GCTGCCCGCG AGCGCTCCAT GTTCGAGCAG ATCCGGCTGC TGCTCGACAG CCGCTTCGGC
CCCTCCACCG AACGCTACCA CGTCGATCAG CAGCAACTGC AGTTCGACGA GGCCGAGCAG
TATGCCGATG CACCGGTCAC CGAACCGGAG GCAGAGGCCG CTCAAGCCGG CGAGACGGCC
CCGAGCGTGC CGGCCAAGCG CCGGAACCGT GGCGGCCGCG TGCGGCTGCC CGCGGAACTG
CCGCGGGTCG AGGTGGTGCA CGATATCCCC GAGGCACAGC GCTACTGCCC GCATGACGGC
AGCGAGCTGA CCTGCATCGG TGAAGAGGTC ACCGAGCAAC TGGATGTCAT CCCCGCCCGG
GTGCAGGTCC GCCGCCACAT CCGGCGCAAG TACGCCTGCA GATGCTGCGA AGAAGGCGTG
CACACCGCAA GCATGCCGCC GCAACCGCTG CCCCGGAGCA TGGCCAGCCC CGGATTGCTG
GCCTACATCG CCACCGCCAA GTACGAATTC GGCCTGCCGC TCTACCGCCA GGCCAAGGGC
TTCGAGCGCA AGGGCATCCC GCTGCCGCGT AACACCCTGG CGCGCTGGAT GGTGGGCATC
GGCGAGCTGC TCACCCCGCT GGGGCAGGCC CTGCAGGACC ATCTACTGGC CCAGCCGCTC
ATCCACATGG ATGAGACCAC GGTCCAGGTG AACACCGAGC CGGGGCGAAC GGCCTCCAGC
ACCTCCTACA TGTGGGTCCA GCGCGGTGGC CCGCCCGGTG AGCAGGTGGT GCGCTACGAC
TACGACACCA GCCGCTCCGG CCGGGTCCCC CAGCGCCTGC TCGGCGACTA TGCCGGCGTG
CTGGTCACCG ACGGCTACGA GGGCTATGCC CAGGTGGTGC GGGAGAATGG CATCACCCAT
GCCGGCTGCT GGGCGCATGC CCGGCGGAAG TTTGTCGAGG CCCAGAAGGT CCAGCCCAAG
GGCAAGACCG GCAAGGCCGA CTGGGCGCTG AGCCTGATCG GCAAGCTTTA CCGCGTCGAG
CGCGAAGGCA AAACCCTGGA CCCGGAGGCT CGTCTGGTGC TGCGTCAGCG CCAGAGCCGG
CCGCTGATCG ACAAACTCCA GCGCTGGCTG GAGAAGTCCA TCACCCAGGT GCCGCCGAAG
ACCGCCATCG GCAAAGCGCT GCGCTATCTT CAGGGCCAGT GGTCCCGGCT GACCCGCTTT
CTCGATGATG GGCGCATCCC GCTGGATAAC AATCCGGCGG AGAACGCCAT CCGACCCTTC
GTGGTGGGCC GAAAGAACTG GCTATTCAGT CACACCACCC AGGGCGCGGC GGCCAGCGCG
ATGATCTACA GCGTGATAGA GACGGCCAAG GCCAACGGGC TGGAGCCCTA CGAGTATCTG
GAAGATGTCC TCACCCGCCT GCCGGCTGCG GACACCAACC AGGCGGTTCA CGCTCTGCTG
CCCTGGAACT GGGGTAAGAC CATACAGGCC TGA
 
Protein sequence
MKSAAIQPYR DVSRLQRQVA ELEKKLAETD ALLATKEAHW AARERSMFEQ IRLLLDSRFG 
PSTERYHVDQ QQLQFDEAEQ YADAPVTEPE AEAAQAGETA PSVPAKRRNR GGRVRLPAEL
PRVEVVHDIP EAQRYCPHDG SELTCIGEEV TEQLDVIPAR VQVRRHIRRK YACRCCEEGV
HTASMPPQPL PRSMASPGLL AYIATAKYEF GLPLYRQAKG FERKGIPLPR NTLARWMVGI
GELLTPLGQA LQDHLLAQPL IHMDETTVQV NTEPGRTASS TSYMWVQRGG PPGEQVVRYD
YDTSRSGRVP QRLLGDYAGV LVTDGYEGYA QVVRENGITH AGCWAHARRK FVEAQKVQPK
GKTGKADWAL SLIGKLYRVE REGKTLDPEA RLVLRQRQSR PLIDKLQRWL EKSITQVPPK
TAIGKALRYL QGQWSRLTRF LDDGRIPLDN NPAENAIRPF VVGRKNWLFS HTTQGAAASA
MIYSVIETAK ANGLEPYEYL EDVLTRLPAA DTNQAVHALL PWNWGKTIQA