Gene Emin_1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1144 
Symbol 
ID6263975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1242668 
End bp1245496 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content42% 
IMG OID642611624 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001876033 
Protein GI187251551 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.946837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACA ATACAAAACC AAACAACAAC AAATATTCCA AAACAGTCTT ATTACCAAAA 
ACCGATTTTC CAATGAGGGC GGGTCTTGCT CAGAAAGAAC CAAAATTTGT TGAGTTTTGG
AAAACTATTA AACTTTATGA AAAAATGCAA AAACTGCGCG ACGGTAAAGA AGTTTTTATG
CTTCACGATG GGCCCCCTTA CGCCAATGGC CGCATACATA TCGGCCATGC CATGGATAAA
ACTTTAAAGG ATATGGTTTT AAAAAGCCGC CATATGCAAG GCTTTCAAAC GCCTTATATT
CCCGGTTGGG ACTGCCACGG CCTTCCTATA GAACAGGCTT TAATGAAGGA AATGAAAATC
GACAAAAAGA GCATTAAGGA AGAAGACGTG CCCGGCTTTC GTAAAAAAGC AAGGGAATTT
GCCGATAAAT TTGTTGATAT CCAAATGCAA GGCTTTGAAA GGCTTGGCGT TCAGGGAGAC
TGGGCAAATT ACTACAGCAC CATGGCCAAA AGATATGAAG GTAATGTAAT CGGTGTATTT
TTAGATTTTA TAGAAAAGGG TTTAGCTTAC AGAGGCACAA AAACAATTTT TTGGTGCCCC
ACTTGCGAAA CCGCTTTGGC CGACGCGGAA ACGGAATATA AAGACAAAGT TTCCCAATCA
ATTTACCTGC GTTTTAAACT GGCTGAACCT TTTAAAGGTA AGGCAAATGT TTCCTTGGTT
ATTTGGACCA CCACGCCGTG GACAATACCC GCCAATAAAG CTACCGCCGT TAATAAAGAT
GAAGACTATG TTTTACTTAA AGACAATAAA ACCGGAGAAT ATTATATAGT AGCCGACAAA
CTTGCCGAAA ATTTTAAAAA CAAAAGCGGT TATGACATAA CAAAAGAAGA AAAATTTAGC
GGCAACGATT TAGTGGGTCT TAAATATAAA CACCCTTTAT TAGACAGGCT TAACCCTGTT
ATATGGACGG ACTTCGTTGC TATGGATACC GGCGTAGGCT TGGTACACGT AGCTCCCGCC
CACGGTGAGG ATGACGCCAA AGCCGGACAA ATTTGGAACT TGGAAGTTTT TGGCCCTGTT
GATGAAAAAG GTATTTTTAC TAAAGAAGCG GGCGAGTTTG CCGGGCAGCA TATTTTTAAA
GCCAATGCGG AAATCATAAA GCGTTTAAAT GAACTCGGCA ACTTAATAAA AGAAGAAACT
ATTGATCACA GCTACCCGCA TTGCTGGCGC TGTAAACAGC CCATTATATT TAGAGCTACG
GAGCAGTGGT TTCTTTCCAT TGACGGACAA AATTTGAGAA GTGAACTTAC AAAAGCGGTT
GAGTCTGTTG ATTTTTACCC CAAAGCCGGC GTTTCACGCA TAGGCAACAT GGTTAAAATG
AGGCCCGACT GGTGTTTAAC CAGGCAGAGA TTCTGGGGCA CTCCCGCAAC TATATTTTAT
TGCGACGACT GCAAAACTCC GCAGGTTGAC GCTAAATTAT TTGCGCATAT TAAACAAATG
GCGCTTGATA ACGGCGGCGA TTTTTGGTTT ACCTTTCCCA ATGAAAAAAT ATTGCCCGAA
GGTTACAAGT GCACAAAATG CGGTGGAACG CATTTCGTAA AAGAAAAGGA TATTTTGGAC
GTTTGGTTGG ATTCGGGCTG CTCTTGGAGA GCGGTTTTGA AAGACCGCGG TTTAAAATAC
CCGGCCGACA TGTACCTTGA AGGGGCCGAC CAGCACAGGG GCTGGTTCCA AAGTTCGCTT
ATTCCTTCGG TTGCATTGGA AGGCAAATCG CCTTTTAGGC AAATTTTAAC ACACGGCTTT
GTGCTTGACC AGCACGGGCA CGCCATGCAT AAATCTTTAG GCAACTCCGT AGAACCGCAT
GAAGTTTTTG ATAAATACGG GGCGGACATT TTACGCCTTT GGGTAAGCTT AAGCGATTAC
CAGGACGATA TAAGAATATC TGATGAAATT TTAAGCGGCC CTGTTGACAG TTACAGAAGG
TTAAGAAACA CATTCCGCTA CGCTATGGGC AGTTTGTTTG ACTATGACCC GGAAATTCAT
AAAATGAAAC CTGAAGAAAT GACGGAAATT GACAGATATA TGTTAAGCAA GCTTGATACT
TTGATAAAAG AATCTCTTGA AAATTACGAT ATCTATGAAT TTAGAAAAGT AGTGCGCGGC
CTTATAGATT TTTGTATTTT GGACCTGTCG TCATTTATGC TTGATGCTTC CAAAGACAGG
CTTTATACTT TAGGCACGGA CGCCCAAACA AGGCGCAGCG CCCAAAACGC TTTATATGAA
ATTTTAATAG TTTTATTAAA ACTTTTAGCG CCAGTTTTGT CTTTCACCAC GGAAGAAGCC
TGGCAGGAAT TAAAAAAGAC TCCCGCCGGG GCTAAATTAG AGGAAAGTAT TTTCCTTTCC
GACTTTCCTA AATCAAGCAG TTTTAAACAT GACGCGAAGC TTGAGGAAAA ATGGTCAAAA
ATAAGAACGG TAAGAGAAAA CGTTTTAAAG AAACTTGAGG AAGCCAGAAG CGCGGGGCTT
ATAGGCTCGT CGCTTGAAGC TAATGTCATA TTTAGCACGA CCTCAAAAGA AGAGCTTGCT
TTCTTAGAGG AAAACAAAGA TCTTTGGCCG GAAATCGCCA TTGTTTCCAA AGCGGAAATT
AAAAATGAAG GCGGTGAGGA AATATTAATT ACAATTGAAC ACGCGCCGGG CGCGAAATGC
CCCAGATGCT GGCAATGGAA AGAAGATATA GGCGAAAATT CCGCACATGC TGAAGTTTGC
GTTCGCTGCG CGGGCGTGCT TGAAAAAGAG GGCATTACGG TGAATGAGGA TGTAAACGTA
AATGTCTAA
 
Protein sequence
MSNNTKPNNN KYSKTVLLPK TDFPMRAGLA QKEPKFVEFW KTIKLYEKMQ KLRDGKEVFM 
LHDGPPYANG RIHIGHAMDK TLKDMVLKSR HMQGFQTPYI PGWDCHGLPI EQALMKEMKI
DKKSIKEEDV PGFRKKAREF ADKFVDIQMQ GFERLGVQGD WANYYSTMAK RYEGNVIGVF
LDFIEKGLAY RGTKTIFWCP TCETALADAE TEYKDKVSQS IYLRFKLAEP FKGKANVSLV
IWTTTPWTIP ANKATAVNKD EDYVLLKDNK TGEYYIVADK LAENFKNKSG YDITKEEKFS
GNDLVGLKYK HPLLDRLNPV IWTDFVAMDT GVGLVHVAPA HGEDDAKAGQ IWNLEVFGPV
DEKGIFTKEA GEFAGQHIFK ANAEIIKRLN ELGNLIKEET IDHSYPHCWR CKQPIIFRAT
EQWFLSIDGQ NLRSELTKAV ESVDFYPKAG VSRIGNMVKM RPDWCLTRQR FWGTPATIFY
CDDCKTPQVD AKLFAHIKQM ALDNGGDFWF TFPNEKILPE GYKCTKCGGT HFVKEKDILD
VWLDSGCSWR AVLKDRGLKY PADMYLEGAD QHRGWFQSSL IPSVALEGKS PFRQILTHGF
VLDQHGHAMH KSLGNSVEPH EVFDKYGADI LRLWVSLSDY QDDIRISDEI LSGPVDSYRR
LRNTFRYAMG SLFDYDPEIH KMKPEEMTEI DRYMLSKLDT LIKESLENYD IYEFRKVVRG
LIDFCILDLS SFMLDASKDR LYTLGTDAQT RRSAQNALYE ILIVLLKLLA PVLSFTTEEA
WQELKKTPAG AKLEESIFLS DFPKSSSFKH DAKLEEKWSK IRTVRENVLK KLEEARSAGL
IGSSLEANVI FSTTSKEELA FLEENKDLWP EIAIVSKAEI KNEGGEEILI TIEHAPGAKC
PRCWQWKEDI GENSAHAEVC VRCAGVLEKE GITVNEDVNV NV