Gene Sare_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2149 
Symbol 
ID5706967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2471776 
End bp2473386 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content72% 
IMG OID641271634 
Productmethionine--tRNA ligase 
Protein accessionYP_001537005 
Protein GI159037752 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0143] Methionyl-tRNA synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.147199 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTG ACCCGCACCG ACCGGCCGTC ATCATCGCCG CGACCCCGAC TCCCAACGGT 
GACCTGCACC TGGGCCACCT CGCCGGGCCG TACCTCGCCG CCGACGTCTA CGCCCGCCAC
CTGCGCATGT CGGGTCGCCC CGTGGTCTAC ACCACGTGCA CCGACGACAG CCAGAGCTAC
GTGCTGACCA CCGCACGCCG ACAGGGGGTG CCACCTCGAC GTCTGGCGGC CACCGCCGCC
ACAGCTATCG CCCGCTCGCT GGACGCCGTC GGGATCTCCA CCGCGGGGCT CCCCCCGACC
GGCGACACCT ACCGCGGCAC GGTGCTCGAC TTCGTTGGCC AACTGCATGC GGCGGGCCGG
TTCCGACAGC GCCGCGTACG GCTGCCGTAC GCCCGCCACG CGGGGATGTA CCTCTACGAC
GGGCTGCTCT CCGGCACCTG CCCGACCTGC CTGTCGGACA GCTCCGGTGG GGTCTGTGAG
GCCTGCGGAC ACCCGAACAC CTTCGACGAC CTGTTGGATC CTCGGTACAG CCTCGATCCG
GACGACCCGG TGGAGCCGCG GGTCGCCGAC GTCCTCGTGC TGCCCGCGGA GGACTACCGG
GGTCGGCTCG CCGAGTACTA CGCGCGGCAC ACGCCGCGGT GGCGCCCGCA CGCGCGGCGG
CTCGTCAACG AACTACTCGC CCGGCCGCTA CCTGACATCC CGGTCACGGT ACCGGGCAGT
TGGGGCATCC CCGCGCCGTT CGCGCAGACC CCGGGGCAGG TCCTCTATCC GTGGATCGAG
GCGATGCCCG CCTCGATCTA CAGCACCTGG TGGTCGCGCT CGCCGCGGGG CGCCACCGGC
GGGAACATCG ACGCCCCGTG GCGGGCCGAG ACGGACACCG AGCTGGTCTA CTTCCACGGA
TACGACAACG TGTACCACTG GGGCCTGGTC GATCTGGTCC TGTTGTTGGC GCACGGTGAT
CGGTACGTGC TGCCCGCGGC GAACGTGTGC AACGAGTTCT ACGAACTGGC CGGAGCCAAG
TTCTCCACCA GTCGCGACCA CCTGGTGCAC GCACCCGAGG TCCTCGCCGA GGTACCCCGG
GATCTGTTGC GCTTCTACCT GGCGCTGACC GCCCCGGAGT ACCAGCGATC CACGTTCGAC
CGGGCGGCCC TGCCCTCGGT GACGCAGACC CGGCTGGTCG AGCCATGGAA CCGTCTCTCC
CGGGCCCTCG ACCGGGCACT CGACGCGTCA TCCATGCCGG CCCGGCTGCC CACGGACGAG
GCTGGCCGGC GCCGCGCCGC CATCGTCGCC GACCGCTTCC GCACCTGGTA CGGACTGCCG
GAGTTCAGTA TCCGCCAAGC AGCCGACACG CTCAGCACGC AGGTCGATCG GCTGGCCCGG
CAGGCAGAGG TGCTCACCGG GGATCCGACC GACACCGGTG GCCTCGTGCT GCAGGTCCGC
GCACTCCTCG CCGGCGCCGC TCCGCTGCTG GTCGATACCG CGGCAGCCGC CGCCGCGTCC
GGTTGGGAGA GCGGCGACGC CACCGCGCCG TCGACCACCG TCGCCGCCCT GCGCCTGCCA
CCACTGGCCG GTGTCCGCGC CCCGCAGGAC GGACGCCTAC CGGTGCGGTG A
 
Protein sequence
MTADPHRPAV IIAATPTPNG DLHLGHLAGP YLAADVYARH LRMSGRPVVY TTCTDDSQSY 
VLTTARRQGV PPRRLAATAA TAIARSLDAV GISTAGLPPT GDTYRGTVLD FVGQLHAAGR
FRQRRVRLPY ARHAGMYLYD GLLSGTCPTC LSDSSGGVCE ACGHPNTFDD LLDPRYSLDP
DDPVEPRVAD VLVLPAEDYR GRLAEYYARH TPRWRPHARR LVNELLARPL PDIPVTVPGS
WGIPAPFAQT PGQVLYPWIE AMPASIYSTW WSRSPRGATG GNIDAPWRAE TDTELVYFHG
YDNVYHWGLV DLVLLLAHGD RYVLPAANVC NEFYELAGAK FSTSRDHLVH APEVLAEVPR
DLLRFYLALT APEYQRSTFD RAALPSVTQT RLVEPWNRLS RALDRALDAS SMPARLPTDE
AGRRRAAIVA DRFRTWYGLP EFSIRQAADT LSTQVDRLAR QAEVLTGDPT DTGGLVLQVR
ALLAGAAPLL VDTAAAAAAS GWESGDATAP STTVAALRLP PLAGVRAPQD GRLPVR