Gene Sare_4394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4394 
Symbol 
ID5706102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4965779 
End bp4966888 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content67% 
IMG OID641273812 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001539162 
Protein GI159039909 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.138918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00918813 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGACC CGACGGCAGG GCCACGGCAC GAGCTGGCCG CCAAACTCAC CACCGCCGGC 
GCCCTCGGAT CACCGGCATG GATTTCAGCG TTCGAACAGG TACCGCGGCA CCTGTTCGTC
CCCGCCTGCT GGCACCGCAT CGCCGCCGGC CTGGAGTATC TCGACAGCGC GAACCCCGAG
CAGCGCGACC ACTGGCTAGC TGTCTGCTAC TCCGATACCT CCCTGGTGAC CCAGGTCGAC
TCCTCGGGGA CCGCCACCAG CGCATCCAGC CAACCGTCCG TCATGGCCAT CATGCTCGAA
GCGCTCGACG TCGCCGCGGA CAATACTGTC TTGGAGGTCG GCACCGGCAC CGGATACAAC
GCCGCGCTGC TGTGCCACCG TCTCGGCGAT GACCGGGTGC ATACGGTCGA GTACGACCAG
GCCCTGTCCA CCACCGCCAC CGCCGCTCTT GCGCAGGCCG GCTATCACCC CGCGATGCGG
GTAGGTGACG GCGCGGCAGG CTGGCCCGAG CAGGCACCAT ACGACCGGAT CATCGCCACC
TACGGCACCG AGCGAATCCC GCCGACCTGG CTGCGCCAGT GCACACCAGG GGGCGTCATC
GTCGCCAACC TCGGCCTCGG AGTGATCGCC CTGCACGTCG ACCAGCATGG CCACACGGGC
TCAGGCCGTT TCCTGTCCCG AGCGGCCTTC ATGAACTCCC GCGCCGGCGG CGATGCGGCG
ACGGTCCCGC AGGCCGCGTT CGACCCCGCA ATCGTGGGCC TCGGACACCC AGCAGACACA
CCACCGGACT TGAGGGACGA CAACTTCACG GCCTGGCTAC ACTTGCACAG CCCGGAAATC
GTGCAGGTCA CTCTCCCCGG CCCGGACGAC TCACTCAGCC AAGCGGAACA CATTTTCGCC
AATCGCGCGG GCTCCTGGGC GAGAGTCGGC AACGGGCGGA TAACGCAGGT AGGGCCAATC
TGGCGAGACG TACACGACGC ACACACACGC TGGGCGCACG CCGGTCGTCC CGAGGTGGAA
CAGATCGGAC TGACCGTCCG CGACGACGGT CACCACACGC TATGGGTGGA CAACCCGTCC
AGCACACAGC GATGGAATCT CACCCCATGA
 
Protein sequence
MTDPTAGPRH ELAAKLTTAG ALGSPAWISA FEQVPRHLFV PACWHRIAAG LEYLDSANPE 
QRDHWLAVCY SDTSLVTQVD SSGTATSASS QPSVMAIMLE ALDVAADNTV LEVGTGTGYN
AALLCHRLGD DRVHTVEYDQ ALSTTATAAL AQAGYHPAMR VGDGAAGWPE QAPYDRIIAT
YGTERIPPTW LRQCTPGGVI VANLGLGVIA LHVDQHGHTG SGRFLSRAAF MNSRAGGDAA
TVPQAAFDPA IVGLGHPADT PPDLRDDNFT AWLHLHSPEI VQVTLPGPDD SLSQAEHIFA
NRAGSWARVG NGRITQVGPI WRDVHDAHTR WAHAGRPEVE QIGLTVRDDG HHTLWVDNPS
STQRWNLTP