Gene Mlg_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0438 
Symbol 
ID4268291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp486713 
End bp487912 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID638125168 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_741282 
Protein GI114319599 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000725763 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0537593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGG CAAATGATGC CATGGCACTG TTGCGCCGCG GTGCCGAGGA GATCCTGCTG 
GAGAACGAGC TGCGGGACAA GCTCCAGTAT GATCGTCCGC TGCGCGTCAA GGCCGGCTTC
GATCCCACCG CGCCGGACCT TCACCTGGGC CATACGGTGC TGATCAATAA GCTGCGTCAG
TTCCAGGATC TGGGGCATGA GGTCTACTTC CTGATCGGTG ATTTCACCGG GATGATTGGG
GACCCCAGCG GCAAAAGCGC CACCCGCCCC CCACTTACTC GCGAGGAGGT GCGCGACAAC
GCCCGCACCT ACGAGGAACA AATCTTCCAG GTGCTGGATC CGGAGCGCAC CCAGGTGGTG
TTTAACTCCG ACTGGATGAA CGATTTCTCC GCTGCGGACA TGATCCGGCT GGCCTCCCAC
CATACCGTCG CGCGCATGCT GGAGCGCGAC GACTTTCATA AGCGTTATGC CGCCCGCCAG
CCCATCGCCA TCCACGAGTT CCTCTACCCG CTGGTCCAGG GGTACGACTC GGTCGCCCTG
AAGGCGGACG TCGAGCTGGG CGGCACCGAT CAGAAGTTCA ACCTGCTGGT GGGGCGGGAG
CTACAGAAGG CCTACGGCCA GTCTCCGCAG ACGGTGCTGA CCATGCCCTT GCTGGAGGGG
CTGGACGGCG TGCAGAAGAT GTCCAAGTCG CTGGGCAACT ACGTGGGGAT CAAAGAACCG
GCTGAGGAGA TGTTCGGAAA GCTGATGTCC ATCTCCGATG ACCTCATGTG GCGCTACTTC
CTGTTGCTCA GCTTCCGACC GGAGAGCGAG ATCGAGCGGC TCCGCCGTGA CGTCGCCGAA
GGGCGCAATC CCCGGGACGT CAAGTTTGAA CTGGCCGAGG AGATCGTCAC CCGATTTCAC
GACGCGCGCG CGGCGGCGCG CGCGAGAGAG GTGTTCATCG CCCGGTTCCG GAAAGGCGCC
ATGCCGGAGG AGATGCCGGA ACACACCCTG CCCGCCGACG ATGGCGGCCT GGCCCTCGAT
CGGTTGCTCA AGGGGGCCGG CCTGGTGGCC AGCACCTCGG ACGCCCGGCG CATGCTCAAG
CAGGGCGCGG TGCGCATCGA TGGCGAGCGT GTGGAGGATC AACGACTTGT GGTGCCCGCC
GGCGAGACCC ATGTCTATCA GGTGGGCAAG CGCCGTTTCG CCCGTGTGAC CGTGGCGTGA
 
Protein sequence
MTEANDAMAL LRRGAEEILL ENELRDKLQY DRPLRVKAGF DPTAPDLHLG HTVLINKLRQ 
FQDLGHEVYF LIGDFTGMIG DPSGKSATRP PLTREEVRDN ARTYEEQIFQ VLDPERTQVV
FNSDWMNDFS AADMIRLASH HTVARMLERD DFHKRYAARQ PIAIHEFLYP LVQGYDSVAL
KADVELGGTD QKFNLLVGRE LQKAYGQSPQ TVLTMPLLEG LDGVQKMSKS LGNYVGIKEP
AEEMFGKLMS ISDDLMWRYF LLLSFRPESE IERLRRDVAE GRNPRDVKFE LAEEIVTRFH
DARAAARARE VFIARFRKGA MPEEMPEHTL PADDGGLALD RLLKGAGLVA STSDARRMLK
QGAVRIDGER VEDQRLVVPA GETHVYQVGK RRFARVTVA