Gene Sare_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2078 
Symbol 
ID5706798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2388605 
End bp2390086 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content71% 
IMG OID641271564 
Productmethyltransferase type 12 
Protein accessionYP_001536935 
Protein GI159037682 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0386809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGTC CGCCGGTCGA CGGCGGGGTT GGGCACGGAT CGGGTGCGGG CGGTCCGGCA 
TCGGGTGGAG CACACGACGC ATTGGTGGAC CTGCCCGGGG TGGGCGCCTG GCAATGGGTT
CCCGGCCCTG AACGGCCGAC CCTGCTGGTC GCTCCCACTG CGCCCGGCAC CGCCGGACGG
ACGAAACCGG ATGCGGAACA GACGTTCGTC GACGAGGCGA AGCCGGCGTC GGCGGCCGGG
GACGTTGCTG TCGCCGGCGC CGACCTGTCC TCGCTTCCTC AGCTGAACCG ACTGATGGAC
GAGGTCGCCC TGCTCGCGAT GGCTCGGGTG CTGCACCGGG CGCGGCTCTT CCGCGACGGC
GCGAAACACG ACACCGGACA GGTGCTGGAC GTGTTGCGGG TAGCACCGCG ACACGCCTGG
ATCGTCCGCC GTTGGCTGTC CACGCTGGTG ACCGAGGGAA GGCTCCGGTA CGACCCGAGC
ACCGGCCACC ACCACGATCT CGTCGCGCCG GATCGCGTCG GGTACGCCCG TGCCCGGCGG
GAGCTGGACG AGGCGCGGCG CGGCCTTGGC TACCCGCCAT CGATGACCCG GTTCTTCGTG
GCCACGGCCG AGCGGCTGCC GGCCCTGCTG CGGGACGAGG CCGGGGTGCA GGAGCTGCTG
TTCCCGGACG GTTCCACCGA CACCGCCGAG GGTAACTATC GGGACAACCT ACCCAGCCGG
TGGGTCAACC ACGCGGCGGC CGCGTTGATC GCCGGCGGCA CCCACAGCTC GAGTGGTCCA
CTGCGGATCC TGGAGGTGGG CGCCGGGATC GGGGCGACCA CCCGGCCGGT GCTCGACGCG
CTGGCCGGCA CCGAGCTGGA CTACCTGTTC ACCGATGTGT CGCGCTTCTT CCTCACGTCG
GCGCGGACGC TCCTCGGGAA CCGTCCGGGG ATGCGGTTCG GCCTGTTCGA CATCAACCGA
CCGCCGGTCG GCCCGGAGTT CACGCCCGGG TCCCGAGACG TCATCCTGGC CGCGAACGTC
CTCCACAACG CTCGCCACGT GGGCCGGGCG CTGGCCGCAC TGCGGGAGCT GCTGTCCCCG
GACGGGCTGC TGGTGCTCGT CGAGTCCTGT CGCGAGCACT ATCAGGCGCT CACCTCCATG
TACCTGCTCA TGTCTCCATC CGCCGAGGAA CAGCGCTGGT TCACCGACGT GCGATCCGGA
CAGGACCGGG TGTTCCTCAG TACGGCCGAG TGGGTCGACC AGTTGGACGC GGCCGGCTTC
GATCCGCTGC CGGTCCTGCC CGGCAACGGT CATCCGTTGG CTGACGCGGG ACAGCGGGTG
ATCGCCGGCC GGGTCCGATC CAACCTCGCA CGCCCGGATC CAGTGCTCGT CGAAGCGACC
CTCGCGGCCC GGTTGCCGCC AAGTGATCGC CCCGAACGGA TCCACGCCGT GGACCGTGTC
ATATCCGCCC CGCCCACTGT CCCGATGGGA GAACCTCGTT GA
 
Protein sequence
MSGPPVDGGV GHGSGAGGPA SGGAHDALVD LPGVGAWQWV PGPERPTLLV APTAPGTAGR 
TKPDAEQTFV DEAKPASAAG DVAVAGADLS SLPQLNRLMD EVALLAMARV LHRARLFRDG
AKHDTGQVLD VLRVAPRHAW IVRRWLSTLV TEGRLRYDPS TGHHHDLVAP DRVGYARARR
ELDEARRGLG YPPSMTRFFV ATAERLPALL RDEAGVQELL FPDGSTDTAE GNYRDNLPSR
WVNHAAAALI AGGTHSSSGP LRILEVGAGI GATTRPVLDA LAGTELDYLF TDVSRFFLTS
ARTLLGNRPG MRFGLFDINR PPVGPEFTPG SRDVILAANV LHNARHVGRA LAALRELLSP
DGLLVLVESC REHYQALTSM YLLMSPSAEE QRWFTDVRSG QDRVFLSTAE WVDQLDAAGF
DPLPVLPGNG HPLADAGQRV IAGRVRSNLA RPDPVLVEAT LAARLPPSDR PERIHAVDRV
ISAPPTVPMG EPR