Gene Sare_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1044 
Symbol 
ID5706543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1168917 
End bp1170932 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content70% 
IMG OID641270560 
Productmethionine--tRNA ligase 
Protein accessionYP_001535944 
Protein GI159036691 
COG category[G] Carbohydrate transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0143] Methionyl-tRNA synthetase
[COG0662] Mannose-6-phosphate isomerase 
TIGRFAM ID[TIGR00398] methionyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTTCA GCAAGTACGA CCCGGATCTC CTGGAACCGG CCTTCGGCAT CGGCATGAGC 
GCCGTCGAGG GGCTCGGGAC AGGTGCCGGC TGGGGGCGGG TCGCGCCGGG TGGGGCGTCC
ACCAGCCACC AGCACGACGA AACCGAGTTC TTCGTTGTCG TCGCCGGTGA GGGAGAGTTC
GTCGTGGATG GTCGCCGCCA TCCGGCCCGG CCCGGCACGC TGGCCCTCTT CGAACCGTTC
GAGTCGCACG TGCTGGAGAA CACCGGCGAT GACGACCTGG TCTTCCTCAC GCAGTACTGG
CGCGACACCG GACGTGCCCT GATCTCGGCC GGGAACAACG AGCGGCTCGC GTTCGGTGAC
CGCCCGGTGT TCGTCTTCTC CACCCCGCCG ACGCCCAACG GCGACCTCCA CCTCGGGCAC
CTTTCCGGCC CGTACCTGGG CGCGGACGCG TTCGTCCGGT TCCAGCGGAT GAACGGTACC
GAGGCATGGC ACCTGACGGG CAGCGACGAC TACCAGAGCT ACGTGGTGAA CACCGCACGG
CGGGAGGGCC GCGCGCCCGC GGAGACCGCC GCACGCTACA GCGCCGAGAT CGCACAGACC
CTGGCCATGA TGGACATCAA CCCTGACCAG TACACGGTCA CTGACACCGA ACCCGGCTAC
CGGCAGGGCC TGCGGAACTT CTTCTCCCAG GTGATCGCCT CCGGGCGAGC CACGGTCACC
GAGCGGGACG CCCTCTTCGA CGGTGAGAGC GGTCGATACC TGTACGAGGC GGATGTCCGG
GGTGGCTGCC CCGGCTGCGG CGAGAGCACG AGCGGCAACA TCTGCGAGGA GTGCGGCGAG
CCCAACACCG TGGTGGACCT CAGGCAGCCG AGGTCGAACG AGTCCGACGC CGAGCCGCGG
CGGGCCCCGC TGGCCCGCTG GTCGCTGCCG CTGCACCAGT TCCGTGACGA AGTCTCCACC
CACCACAGCC TCGGCCGCGT GCCTGCCCGG CTGCGGGAAC TCGGAGACCG CCTCTTCGCC
CGCCCCGTCC TGGACATCCC GCTGTCGCAC CCCGCCGACT GGGGCGTCCC CCCGGCGGAG
AAGGACGTCG ACGACCAGGT CATCTGGGTC TGGCCCGAGA TGTCGTACGG ATTTCTGCAC
GGCATCGAGG CGCTGGGCGC CCGACTGGGC CGCGGTTGGC AGGCCGCCGT ACCCGAGCAG
GACTGGAAGA TCGTCCACTT CTTCGGCTAC GACAACAGCT TCTACCACGC GGTGCTCTAC
CCGGTGCTGT ACCGGCTGGC CCATCCCGGA TGGCAGCCGG ACATCGACTA CCACGTCAAC
GAGTTCTATC TACTGGAGGG CGAAAAGTTC TCGACCAGCC GGCGGCATGC CATCTGGGGC
AAGGAGATCC TCGACGAGGA CACCGTCGAC GCGGTCCGCT ACTTCCTCAG TCGCACCCGG
CCCGAGGCCG AGCGCACCAA CTTCCGGCGC GCCGACTTCC GGTCGGTGCT GCACGACACG
CTGATCGGCA CCTGGCAGCG CTGGCTGAAC GACCTCGGCG CCCGGATCGC CAGGCACTAC
GACGGCAAGG CTCCCGACGC GGGCATCTGG ACGCCGGAGC ACTCGGCGTT CCTGGCCCGG
CTCGGCGGCC GGCTCGACGC GGTCACCGGC TGCCTCCGCG CCGACGGCTT CAGCCTCAAC
CAGGCCGCTG CGGAACTCGA CGCGTTGGTC GCGGAGACCC TACGCTTCGT CGGCCGGGAG
GCCCGTACCG CGCGGAGCGC CGGGTGGCAG GACGAAGCCC GTACCGCGGT CGCCTTGGAA
CTGGCCGCGG CCCGCCTCCT CGCCTCGGTC GCAACGCCGC TGATGCCACG CTTCGCGGGT
CACCTGGCCA CCGCTCTCGG CCTGCCGAAG CCCACCGTAT GGCCACAAGC GGTGGAACTC
GTTCCACCGG GGAGCGCCGT CTGCCTCGCC ACCACCGTGT TCTTCAGGCC CACCACCGAG
CCGGCCGGGA ACGAGGACCG GGGGTCGGAT CGATGA
 
Protein sequence
MIFSKYDPDL LEPAFGIGMS AVEGLGTGAG WGRVAPGGAS TSHQHDETEF FVVVAGEGEF 
VVDGRRHPAR PGTLALFEPF ESHVLENTGD DDLVFLTQYW RDTGRALISA GNNERLAFGD
RPVFVFSTPP TPNGDLHLGH LSGPYLGADA FVRFQRMNGT EAWHLTGSDD YQSYVVNTAR
REGRAPAETA ARYSAEIAQT LAMMDINPDQ YTVTDTEPGY RQGLRNFFSQ VIASGRATVT
ERDALFDGES GRYLYEADVR GGCPGCGEST SGNICEECGE PNTVVDLRQP RSNESDAEPR
RAPLARWSLP LHQFRDEVST HHSLGRVPAR LRELGDRLFA RPVLDIPLSH PADWGVPPAE
KDVDDQVIWV WPEMSYGFLH GIEALGARLG RGWQAAVPEQ DWKIVHFFGY DNSFYHAVLY
PVLYRLAHPG WQPDIDYHVN EFYLLEGEKF STSRRHAIWG KEILDEDTVD AVRYFLSRTR
PEAERTNFRR ADFRSVLHDT LIGTWQRWLN DLGARIARHY DGKAPDAGIW TPEHSAFLAR
LGGRLDAVTG CLRADGFSLN QAAAELDALV AETLRFVGRE ARTARSAGWQ DEARTAVALE
LAAARLLASV ATPLMPRFAG HLATALGLPK PTVWPQAVEL VPPGSAVCLA TTVFFRPTTE
PAGNEDRGSD R