Gene Emin_0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0566 
Symbol 
ID6262784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp618459 
End bp620210 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content43% 
IMG OID642611037 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001875458 
Protein GI187250976 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAATA CTGATTTGCA AACCGCAAGG CACTCGCTTT CCCATATTAT GGCTCAGGCC 
GTGCAGGAAT TATTTCCGGG CACGCGTTTA GGCATAGGTC CCGCGATTGA AAATGGGTTT
TATTATGATT TTGAGTCGGA CCATAAATTT GTGCCCGAGG ATTTAAAAGC CATTGAAGCT
AAAATGAAGG AAATTATTAA AGCCAAACAG CCGTTTGCTT GTAAAAACAT GCCAAAGGCT
GAGGCGGTTA AATTCTTTTC AGACAAGGGC GAACACTTTA AAGTAGAACT TATTAACGAA
CTTGAGGACG GCAGCATTTC CGTTTATACA AACGGGGATT TTGTTGACCT CTGCAAAGGC
CCGCACGTTG AGCACACGGG TAAAATAAAC AATTTTAAAC TTACCCATAT CGCAGGAGCT
TACTGGAAAG GCGATGAAAA ACGCCCCATG CTTCAGCGCA TTTACGGCCT TGCTTTTGAA
ACTAAAGACA AGCTTAACGA ATATATTAAA CAACAAGAAG AAGCCGCCAA ACGCGACCAC
CGCAAATTAG GCATTGAACT TGATTTGTTC AGCATCAGTG AAGACATAGG CCCCGGTTTA
ATTTTAATGC ATCCTAAAGG TGGTATGTTA AGAAAGGTTG TTGAAGACTG GATTAGAGAC
GAAAATATTA AACACGGTTA TGACTTGGTT TATTCGCCCC ATATCGCAAG GCTGCACCTT
TGGCAAAAAA GCGGACACGC CAATTTCTAT TCCGAAAATA TGTTTCAGCC TATAGAAGTT
GACGACCAGC AGTACCAGCT TAAACCTATG AACTGTCCTT TCCATATAGC TATTTATGAG
TCGCACTTAA GGTCATACCG TGATTTACCA GTAAGGCTTG CCGAACTGGG TACGGTTTAC
AGATATGAAC GCAGCGGCGT TGTTCACGGT CTTTTGCGTG TAAGAGGTTT TACGCAAGAC
GACGCGCACA TTTTCTGCAC TCCCGAACAA ATGAACGGCG AAATTGAGGA TTGCTTTAAC
TTTGCCATGT TGGTTATGAA AACATTTGGT TTTGAAAAAT TCTCGGTAGA GCTTTCCACC
TGGGATGAAA CCAAACCTGA AAATTACACC GGTGGCAAAA AAGACTGGGA GGAGGCGCAA
AACGCCCTTG AAAGCGTGTT AAAGAAGAAC AATGTGCCTT TTACCGTACA TGCGGGTGAA
GCTGCTTTTT ACGGTCCCAA AATCGATATT AAAGTTATGG ACGCTATAGG AAGATATTGG
CAGCTTTCCA CAATTCAGTT TGACTTTAAC CTGCCCCAGA AATTTGAGCT TGAATATGTT
TCGCCCGAAG GCAGAAAAAG ACCTCTTATG GTGCACAGGG CTTTACTAGG CTCAATTGAA
AGGTTTTTAG GCGTTTTAAT TGAACATTAC GCGGGCTTAT TTCCGTTATG GCTTGCGCCT
GTGCAGGTAA AACTTTTAAC TCTTACGGAC GACCAGTTTG ACTTTGCCAA AGACGTTGTT
AAACAAATGA AGCTTGCAGG TTTAAGGGCA GAGCTTGACT CCAGGCCTGA AAAACTTGGC
CTTAAAATAA GGGAGGCGCA TGTTGAAAAG ATACCCTACT CCATAGTGAT AGGCGCTAAA
GAAGCGGAAA ACAAAACGCT TACACTTAGG TTAAGAAGCG GCAAAAATGT TGAAGGCCTT
AGCGTTGGGG ATGTTATAGC AAAACTTAAG GAAGAGGCCG ATACAAAGAG TTTAAAACCT
TTGTTTGAAT AA
 
Protein sequence
MSNTDLQTAR HSLSHIMAQA VQELFPGTRL GIGPAIENGF YYDFESDHKF VPEDLKAIEA 
KMKEIIKAKQ PFACKNMPKA EAVKFFSDKG EHFKVELINE LEDGSISVYT NGDFVDLCKG
PHVEHTGKIN NFKLTHIAGA YWKGDEKRPM LQRIYGLAFE TKDKLNEYIK QQEEAAKRDH
RKLGIELDLF SISEDIGPGL ILMHPKGGML RKVVEDWIRD ENIKHGYDLV YSPHIARLHL
WQKSGHANFY SENMFQPIEV DDQQYQLKPM NCPFHIAIYE SHLRSYRDLP VRLAELGTVY
RYERSGVVHG LLRVRGFTQD DAHIFCTPEQ MNGEIEDCFN FAMLVMKTFG FEKFSVELST
WDETKPENYT GGKKDWEEAQ NALESVLKKN NVPFTVHAGE AAFYGPKIDI KVMDAIGRYW
QLSTIQFDFN LPQKFELEYV SPEGRKRPLM VHRALLGSIE RFLGVLIEHY AGLFPLWLAP
VQVKLLTLTD DQFDFAKDVV KQMKLAGLRA ELDSRPEKLG LKIREAHVEK IPYSIVIGAK
EAENKTLTLR LRSGKNVEGL SVGDVIAKLK EEADTKSLKP LFE