Gene Sare_0720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0720 
Symbol 
ID5704525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp801008 
End bp802690 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content71% 
IMG OID641270238 
Productglycosyl transferase family protein 
Protein accessionYP_001535630 
Protein GI159036377 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000257265 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCCAGG CGTCAGCAGC GCACCCCGCC GACCAAGCCG GGCCCGAGCA GCCCGAGCCC 
GCGGACCGGG ACTCGTCCCT CGGCAGGGGT GTCCCCGGCG TCGTCCGCCG CCGCCTGGCC
GCCGTCGACA GTCGGTTCGA CAGGTGGTCC TGGCTGGCCA CCGGAGTGAT CGTCGCGATC
GCGGCGATCC TGCGTTTCGT CAACCTCGCC CACCCGGCTG GCAAGATCTT CGACGAGGTC
TACTACGCCC GGGACGCCTG GGGGCTGGTG GACCGGGGTG TCGAGTGGAA CTACGAGGAC
GGCGGCCCGT CGTACGTGGT CCACCCGCCG CTGGGCAAGT GGCTCATCGG GCTCGGTGAG
TGGGCCTTCG GGTACAGCGA CACCGAACAC GGCGTCTCGG CCGCCGGGCA CCTGTTCACC
ACGAGCCCCG AGGTCGGCTG GCGATTCTCC GCGGCCCTCG TCGGCTCGCT GTCCGTGCTT
CTCCTGGTCC GGATCGCGCG TCGACTGTTC CGCTCCACCG TGCTCGGCTG CGCCGCGGGC
CTGCTGCTCG CCCTGGATGG CTTCCACCTG GTGCTGTCCC GTACCGCGCT GCTCGACATC
TTCCTGCTCT TCTTCGTGCT GGCTACGTTC GGCGCGCTGG TTCTGGACCG GGACGCCCGG
CGGCGGCGCT GGGCGAGTGC CCTCGCGACC GGGCTGGATC CGGGCCGCCC CGGGCCCGCC
GGCCGACCGG TCGGTGGGTG GCGGACCTGG CCGTGGTGGC GGCTGGCCGC CGGGGTCCTG
TTCGGCTGCG CCTGCGCGGT CAAATGGAAC GCGCTGTACT TCCTCCCGGC ATTCGTGCTG
CTGGTGGTGT TCTGGGAGGT CGGCGCCCGC CGCTCCGCCG GGGTACGCCG GCCGTGGCGA
CGCACCCTGC TCGACGAGGT GCCCTGGCTG GCGCTGGCCG GGCTGCTGGT GGTGGTCACC
TACATCGCCG CCTGGTCCGG CTGGCTGCTG GGTGACGAAG GGTACTACCG GGTGGTCGAC
GAGCAGCGAT GGCCGAGTGC CCCGTTGAGC GACACTCCCA TCATCGGGGC GCTACAGAAC
CTCATCGAGT ACCACCGAGC CGCCTACAAC TTCCACGCGC AACTCGACGA CCCCCACACA
TACCAATCCT GGCCCTGGCA GTGGCTGCTG CTGGGGAGAC CGGTCGCCTT CCACTGGTCT
GCCGAGGGCC CCTGCGGCGC GGTGAGCTGT GCCACCGAGG TGCTGCTGTT GGGCACACCG
CTGCTCTGGT GGTCGTTCCT ACCAGCCCTG GTCGCGCTCG CCTGGCTGGG CCTGGCGCGG
CGGGACTGGC GAGCCGGGAC GATCCTGCTC TGTGTGGCGG CCGGGCTGCT GCCCTGGTTC
TGGTACGCGC TGGACAGCGG CCGAACCATG TTCTCCTTCT ACACGGCCCC GTCACTGCCG
TTCCTGGTGC TCGCCGTGAC CTACGTGCTG GGTGTGATCA TCGGGCCGTC GCCGTCGACC
GCGGCGGACA CCGCGCCGGC GACCGGCGAC CGGGATCGAC GGCTCGTCGG AGCCGTCATC
GCCGGAGCGT ACGTCCTACT GGTGGCGTTC AACTTCGCCT ACTTCTATCC GATCTTCGTC
GGCGAGTCCA TCCCGTACGA CAGCTGGTCC AACCGCATGT GGCTGGACGG CCGCTGGATC
TGA
 
Protein sequence
MTQASAAHPA DQAGPEQPEP ADRDSSLGRG VPGVVRRRLA AVDSRFDRWS WLATGVIVAI 
AAILRFVNLA HPAGKIFDEV YYARDAWGLV DRGVEWNYED GGPSYVVHPP LGKWLIGLGE
WAFGYSDTEH GVSAAGHLFT TSPEVGWRFS AALVGSLSVL LLVRIARRLF RSTVLGCAAG
LLLALDGFHL VLSRTALLDI FLLFFVLATF GALVLDRDAR RRRWASALAT GLDPGRPGPA
GRPVGGWRTW PWWRLAAGVL FGCACAVKWN ALYFLPAFVL LVVFWEVGAR RSAGVRRPWR
RTLLDEVPWL ALAGLLVVVT YIAAWSGWLL GDEGYYRVVD EQRWPSAPLS DTPIIGALQN
LIEYHRAAYN FHAQLDDPHT YQSWPWQWLL LGRPVAFHWS AEGPCGAVSC ATEVLLLGTP
LLWWSFLPAL VALAWLGLAR RDWRAGTILL CVAAGLLPWF WYALDSGRTM FSFYTAPSLP
FLVLAVTYVL GVIIGPSPST AADTAPATGD RDRRLVGAVI AGAYVLLVAF NFAYFYPIFV
GESIPYDSWS NRMWLDGRWI