Gene Mlg_1129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1129 
Symbol 
ID4269624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1319824 
End bp1321512 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content64% 
IMG OID638125879 
Producttransposase, IS4 family protein 
Protein accessionYP_741969 
Protein GI114320286 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.981252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACACGC GCATCACAAA GTCCGGCCCG CGCCGCTATT TGCAGTTGGT CGAGGGCTAC 
CGAGACGCCA ACGGCAAGGT TAAACAGCGT GTGGTAGCCA GCCTGGGTCG CCTTGATCAG
CTTGAGGCAG CTGACCTGGA GCCGCTCATC AAGGGACTCC AACGGGCGGT GGGGCGTCCT
GAATCATTGC CTGAAGCGCC TCAGTTCGAG ACCGCCCGCG CCTTTGGCGA TGTGTGGACC
CTCCATCAGC TCTGGCAGGA GTTAGGCCTG GATGGGGCCT TGAAACGGGC TCTGCGCTCC
TCCCGACGAC AGTTCGATGC CGAGGCACTC ATCCGAGCCA TGGTTTTCAA CCGCCTCGCT
GAGCCGAGCA GCAAACGCGG CACGCTGGAG TGGTTGCGCG AGAGCACCAG CATGCCCGGG
CTCGATGCCT CGACCGTGCA CCACGAGCAA CTCCTGCGTG CCATGGATGC GTTGGAGGAC
CACAAGGAGG CCGTGGAGCG TGCGGTAGCC GGCCAGTTGC GGCCGCTGCT TGACCAGGAT
CTCAGCGTCA TCTTTTACGA TCTCACCACG GTACGCATCC ACGGCACCCA TGCGATGGAA
GACGATATCC GTCAGCATGG CCTCAGCAAG GACACCGCCG GTATTGGCCG CCAATTTGCG
CTGGGGGTGG TCCAGACTGC CGAAGGCCTG CCGATCGCCC ACGAGGTCTT CGAGGGGAAT
GTCGCCGAGA CGCGCACCCT GGCCCCAATG ATTGAACGTC TGCTGGAGCG TTTCGCGCTC
GCCCGCGTGG TAGTGGTCGC CGACCGCGGC CTGCTCAGCT TCGATAACAT CGACACTCTG
GAGGCACTGG GCGTCGAGCA GGGGCTGACG GTCGACTACA TTCTGGCTGT ACCCGCCCGG
CGTTACGGGG ACTTCACCGA GGTGATGGAG TCTCTGCATA CGCAACTGGA AACCGCCGTC
ACCGAGCCTA GTCAGGACGT GGTGACCGAA ACCACCTGGC AGGGCCGACG CCTGGTGGTG
GCCCACAATG CCGAACGGGC CGCCGAGCAG AGCGCGCAGC GTCGGCAGAC CATCGAAGAG
CTTGACGCCC TTGGTGCCCA ACTCGCCGAG CGACTCGATA ATCAGGATGC TGGCCAGCCC
GGTCGTGGGC GGCGTTCCAC CGACCGTAGT GCCTACCAGC GCTTCCACAA GGCCGTGCTG
GAGAGGCGCA TGAGCGCGAT CATCAAGGCC GATCTCGGCT CGCCGCAATT CAGCTACACC
ATCGATGAGC CCGCCTGGGC GGCTGCCGAA CGTCTGGACG GCAAGCTACT GCTGGTTACC
AGCCTCACCG ACATGGGGGC GGAAGCTGTG GTAGAGCGTT ACCGCTCCCT GGCCGATATC
GAGCGTGGAT TCCGGGTACT CAAGAGCGAG ATTGAAATCG CGCCGGTCTA CCACCGGCTG
CCGGAGCGCA TTCGGGCGCA CGCGATGATT TGCTTCCTCG CGCTGGTGCT CTACCGGGTC
TTGCGCGGGC GCCTCAAGGC CGCCGGCAGC GCCTGCTCAC CGGAAAAGCT GCTACGGAGC
CTGCGGCAAA TCCAGCGGCA CACGGTGCGT GTGGGTAGCC GCGCCTACGA AGGGCTTACC
CGACCCAACC AGGAACAGCT GGACCTTTTC GAAGCCGCCG GCGTGGAGGT TCCGAAGGCC
CCGCGTTGA
 
Protein sequence
MYTRITKSGP RRYLQLVEGY RDANGKVKQR VVASLGRLDQ LEAADLEPLI KGLQRAVGRP 
ESLPEAPQFE TARAFGDVWT LHQLWQELGL DGALKRALRS SRRQFDAEAL IRAMVFNRLA
EPSSKRGTLE WLRESTSMPG LDASTVHHEQ LLRAMDALED HKEAVERAVA GQLRPLLDQD
LSVIFYDLTT VRIHGTHAME DDIRQHGLSK DTAGIGRQFA LGVVQTAEGL PIAHEVFEGN
VAETRTLAPM IERLLERFAL ARVVVVADRG LLSFDNIDTL EALGVEQGLT VDYILAVPAR
RYGDFTEVME SLHTQLETAV TEPSQDVVTE TTWQGRRLVV AHNAERAAEQ SAQRRQTIEE
LDALGAQLAE RLDNQDAGQP GRGRRSTDRS AYQRFHKAVL ERRMSAIIKA DLGSPQFSYT
IDEPAWAAAE RLDGKLLLVT SLTDMGAEAV VERYRSLADI ERGFRVLKSE IEIAPVYHRL
PERIRAHAMI CFLALVLYRV LRGRLKAAGS ACSPEKLLRS LRQIQRHTVR VGSRAYEGLT
RPNQEQLDLF EAAGVEVPKA PR