Gene Sare_4918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4918 
Symbol 
ID5707406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5583064 
End bp5584293 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content70% 
IMG OID641274312 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001539657 
Protein GI159040404 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.074362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0500799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTCA CCGACGCCGG CTACCGGCAA CGACGACAAC ACCTCGTCGC CCAACTACAC 
ACCACAGGAC ACCTGACCAG CCCCACCGTG GCGGGAGCGT TCGCGTCGGT CGCCCGCCAC
CCGTTCGCCC CCGCCGTCTA CGACGTGAAT TCCCACGGGC AGGTCGGTGA TGTGCTGCAC
GCCACCCGGC CGGAGCACCA GGACGCCTAC CTCGCGGCCG TCTACGGCGA CGAAGCGATC
GTCACCCAGA TCGCCGAAGA TGGGCGGCCG ACCAGCTCAT CCACCCAGCC CGGGGTGATG
GCGGTGATGT TGGAAGCCCT CGACCTGCAA CCCGGCATGA CCGTGTTGGA GATCGGCACC
GGCACCGGCT ACAACGCCGC CCTCCTGGCG CACCTACTCG GGGACGAGGC GGTCACCTCC
GTCGACATCG ACCCGCACCT GGTCACCACC GCCACCACCG CCCTTCACCA CGCCGGCTAC
CGGCCGACCG TGGTCGCCGC GGACGGCCTG GCCGGATACC CGGCGCGGGC ACCCTACGAC
CGGCTGATCG CCACCTGCTC GGTGCGCCGC GTACCAGCAG CCTGGCTACG ACAAGCCAAG
CCAGGTGGGC TGGTCCTGGC CAACCTGTCC TACGGCGTCG TACCGCTGCG AGTCGATGAC
ACCGGGGCAG GTCACGGCCG GTTCCTGCCG CAGGTAGCCG CGTTCATCGA GGCCCGACCC
GCCGACGGGC CGGTCGGACC GACCGTGGCC GACATGGTGT CATCGTGCAT GGGTAGCACC
GGCACCACAT CACCCGGCCA CAACCGCGAT GTCGCGCTGT TCGGCGACCC GTGCGGTGAG
TTCTGGTGGC GGCTTGCCGA GCCGAGCATC TACCACTGCA CCCTGCTCCC CGACGGCGAA
GTGGTCCACT GCCTCGTCGA CGCCGACACC GACTCCTGGG CACGGATCCA CGCCCAGGGC
TCGACTGTCA CCGTCACGCA AGGCGGGCCC CGCCGGATCT GGCACGCGGT GACCACCGCA
TGCCGCCGGT GGGATAGCGC AGGCCGCCCC ACACACGACC ACCTGGGCCT CACCGTTGAC
CGCAACGGCA CCCACACCCT CTGGATCGAC ACACCCGACC GACCACACAC CTGGCCACTC
GACGAGACCA GCCCGCATGT CCGGTGCCAA ACTCCAGGAA CCAGGTCAGC CTCCACGTCC
ACCGACCCGT CGCCGGACCA GGTTGACTGA
 
Protein sequence
MTLTDAGYRQ RRQHLVAQLH TTGHLTSPTV AGAFASVARH PFAPAVYDVN SHGQVGDVLH 
ATRPEHQDAY LAAVYGDEAI VTQIAEDGRP TSSSTQPGVM AVMLEALDLQ PGMTVLEIGT
GTGYNAALLA HLLGDEAVTS VDIDPHLVTT ATTALHHAGY RPTVVAADGL AGYPARAPYD
RLIATCSVRR VPAAWLRQAK PGGLVLANLS YGVVPLRVDD TGAGHGRFLP QVAAFIEARP
ADGPVGPTVA DMVSSCMGST GTTSPGHNRD VALFGDPCGE FWWRLAEPSI YHCTLLPDGE
VVHCLVDADT DSWARIHAQG STVTVTQGGP RRIWHAVTTA CRRWDSAGRP THDHLGLTVD
RNGTHTLWID TPDRPHTWPL DETSPHVRCQ TPGTRSASTS TDPSPDQVD