Gene Mlg_0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0852 
SymbolileS 
ID4270789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp966977 
End bp969802 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content68% 
IMG OID638125604 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_741696 
Protein GI114320013 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGATT ACAAGCACAC CCTGAACCTG CCGAAGACCG GCTTTCCCAT GCGCGGCAAT 
CTGGCCAAGC GGGAGCCGGA GCGGCTCGCC GGCTGGTATC AGACGGACCT TTACGGTCGC
CTGCGCCGCG AGCGCGCCGG CAAGCCCCGT TTCGTGCTGC ACGACGGCCC GCCCTACGCC
AACGGCGACA TCCACATCGG CCACGCGGTC AACAAGATCC TCAAGGACAT CATCATCAAG
GCGCGCAGCA TGGACGGCTA CGACGTCCCC TATGTGCCGG GCTGGGATTG CCACGGGCTG
CCCATAGAGC TGATGGTGGA GAAAAAGCGG GGCAAGGCGG GCGCCAAGGT GAGCCCGCGG
GCCTTCCGCG ACGCCTGCCG CGAGTTTGCC GCCAGCCAAG TGGACGGCCA GCGCGAGGAT
TTTAAGCGGC TCGGCGTGCT TGGTGACTGG GACAACCCTT ACCTCACCAT GGATTACCGC
ACCGAGGCGG ATATCCTGCG TGCCCTGGGG CGGATCATCC AGCGTGGTCA TGTCACCCGA
GGCTTCAAGC CGGTGCACTG GTGCGCCGAC TGTGGTTCCG CCCTGGCCGA GGCCGAGGTG
GAATACGAGG AGAAGACCTC GCCCGCCATC GACGTGCGCT TTGCCGTGCT GGAGCCCGAG
GAGTTGGACC GCCGGGCCGG ACTCGGGGGC GAGGCGGCCG CGGCCGGCCG GGTCGCCATC
CCCATCTGGA CCACCACCCC CTGGACACTG CCCGCCAACC AGGCGGTGGC TTTGCACCCG
GAGTTGGAAT ACGTCGTGGT CGCTTTCGAC GACGAACTAC TGGTGCTGGC GGCCGAGCTG
GTGGAATCGG CCATGGCCCG TTACGAGGTG GACGACTACC GGGTCGTCGG CCGTTGCGAC
GGCGCGGTGC TCGAGGGCCT GCGACTGGCG CACCCCTTCC TGGAACGCGA GGTGCCGGTG
ATCCTGGGCG GGCACGTCAC CACCGACGGC GGCACCGGGG CGGTGCACAC CGCCCCCGGG
CATGGCCAGG ATGACTACGT GGTCGGCCAG CAGTACGACC TGCCCACCGA CAACCCGGTG
GACGGCAACG GGGTTTTCCT CCCCGACACC CCGTTCTTCG CCGGACAGCA TGTCTTCAAG
GCCAACCCGA AGGTGGTGGA CCTGCTCGCG GAGCGCGGCG CGCTGCTGCA CCACGAGCCC
TACCGCCACA GCTACCCTCA TTGCTGGCGC CACAAGACCC CCATCCTCTT CCGGGCCACG
CCGCAGTGGT TCATCAGCCT GGACAAGGCG GGCATGCGTG AGCACGCCAT GGCCGCCATC
AAGGGGGTGA GCTGGCATCC GGAGTGGGGG CAGGCCCGGA TCGAATCGAT GGTCAACGGC
CGGCCGGACT GGTGCATCTC CCGCCAGCGC AACTGGGGTG TGCCCATCGC CCTGTTCGTG
GACAAGCGCA GCGGTGAGCC CCACCCGGAG AGCGAACGGT TGATCGAGGC CGTGGCCCGG
CGGGTGGAGG AGGCCGGGGT GGACGCCTGG TTCGAGCTGG ACCCCGCCGA GCTGTTGGGC
GCTGACGCCG AGCGCTACGA GAAGGTCACC GATATCCTGG ATGTCTGGTT CGACTCCGGC
GTCACCCACG CCACGGTGCT GGAGCGCCGG GACGAGCTGC AGGTGCCCGC CGACCTCTAC
CTGGAGGGCT CCGATCAGCA CCGCGGCTGG TTCCAGTCCT CGCTGCTCAC CTCGGTGGGG
GTGCGCGAGA CCGCGCCCTA CAAGGGGGTG CTTACCCACG GCTTCACCGT GGACGAGAAG
GGCCACAAGA TGTCCAAGTC CCGGGGCAAC GTGGTGGCCC CGCAGAAGGT CATGGATACC
CTGGGGGCGG ATATCCTGCG CCTGTGGGTG GCCTCCTCGG ACTACTCCGC CGAGATGGCC
GTCTCCGACG GCATCCTCAA GCGCACCGCT GACGCCTACC GGCGCATGCG TAACACGGCC
CGCTTCCTGC TCGCCAACCT CAACGGCTTC GAGCCCGCCG AACACGCCGT GGCCCCCCCG
GATATGCTGC CCCTGGACCG CTGGGCCGTG GACCGCGCCT ACCTGCTCCA GCAGCAGGTG
CGCGAGGCCT ACGAGCGCTA CGAGTTCCAC CGTATCTATC AGATGGTGCA CAACTTCTGC
GTGGTGGACC TGGGCGGTTT CTACCTGGAC GTGATCAAGG ACCGCCAGTA CACCACGAAG
CCCGACAGCC TCGCCCGCCG CTCCTGCCAG ACCGCCCTCT GGCACGTGGC CGAGGGCCTG
GTGCGCTGGC TTGCGCCCAT CATCTCCTTC ACCGCCGAGG AGATCTGGGA GCACCTGCCG
GGCGAACGCA GCGACTCGGT GCTGCTTGAG ACCTGGTACG AGGGCCTGTT CCCGCTCGAT
GACAGCGACC CGTTCGGGCG CGCCTTCTGG GACGACGTGC TGGCGGTGCG CGCCGGTGTC
AACCGGGAGC TGGAGCAGCT TCGCAACGAC AAGGTGATTG GCGCCAGCCT GCAGGCGGAG
GTGCAGCTCT TCTGTCCGCC GGAGCTCAAG GCCAAGCTCG ACCGGCTGGG CGACGAGCTG
CGCTTCGTGC TGATCACCAG CGAGGCGCGC GTGGAGGATC TGGAACGGGC CCCGGTGGAG
TCGGTGGAGG TCCCCGGTGA GAATGGCCAG GGTTTCCGCC TCTTCGCCGC AGCCAGTCAG
CACCCCAAGT GCACCCGCTG CTGGCACCAT CGCCCGGATG TGGGCCACCA TGCGGATCAC
CCGGAGCTGT GCGGCCGTTG CGTGAGCAAC GTCGACGGCG AGGGTGAGAC CCGTCACTAC
GCCTGA
 
Protein sequence
MSDYKHTLNL PKTGFPMRGN LAKREPERLA GWYQTDLYGR LRRERAGKPR FVLHDGPPYA 
NGDIHIGHAV NKILKDIIIK ARSMDGYDVP YVPGWDCHGL PIELMVEKKR GKAGAKVSPR
AFRDACREFA ASQVDGQRED FKRLGVLGDW DNPYLTMDYR TEADILRALG RIIQRGHVTR
GFKPVHWCAD CGSALAEAEV EYEEKTSPAI DVRFAVLEPE ELDRRAGLGG EAAAAGRVAI
PIWTTTPWTL PANQAVALHP ELEYVVVAFD DELLVLAAEL VESAMARYEV DDYRVVGRCD
GAVLEGLRLA HPFLEREVPV ILGGHVTTDG GTGAVHTAPG HGQDDYVVGQ QYDLPTDNPV
DGNGVFLPDT PFFAGQHVFK ANPKVVDLLA ERGALLHHEP YRHSYPHCWR HKTPILFRAT
PQWFISLDKA GMREHAMAAI KGVSWHPEWG QARIESMVNG RPDWCISRQR NWGVPIALFV
DKRSGEPHPE SERLIEAVAR RVEEAGVDAW FELDPAELLG ADAERYEKVT DILDVWFDSG
VTHATVLERR DELQVPADLY LEGSDQHRGW FQSSLLTSVG VRETAPYKGV LTHGFTVDEK
GHKMSKSRGN VVAPQKVMDT LGADILRLWV ASSDYSAEMA VSDGILKRTA DAYRRMRNTA
RFLLANLNGF EPAEHAVAPP DMLPLDRWAV DRAYLLQQQV REAYERYEFH RIYQMVHNFC
VVDLGGFYLD VIKDRQYTTK PDSLARRSCQ TALWHVAEGL VRWLAPIISF TAEEIWEHLP
GERSDSVLLE TWYEGLFPLD DSDPFGRAFW DDVLAVRAGV NRELEQLRND KVIGASLQAE
VQLFCPPELK AKLDRLGDEL RFVLITSEAR VEDLERAPVE SVEVPGENGQ GFRLFAAASQ
HPKCTRCWHH RPDVGHHADH PELCGRCVSN VDGEGETRHY A