Gene Sare_1690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1690 
Symbol 
ID5705225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1951032 
End bp1952588 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content69% 
IMG OID641271193 
Productglycosyl transferase family protein 
Protein accessionYP_001536568 
Protein GI159037315 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0342764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00101578 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGTCC CCGACGTCAC CGTCATCACG GCGGTCTACA ACACCATGCC GTACCTGACC 
CGCTGCCTGA CGTCGCTGGT GGAGCAGACC ATCGGGCGAG ACCGGTTGGA GGTCATCGCG
GTCGACGACG GGTCCACCGA CGGCAGCGGT CCCGAGCTGG ACCGGTTTGC CCGGCTCTAC
CCGGGCACGG TGAAGGTGGT GCACCAACCC AACTCGGGCG GCCCGGCTGC ACCGAGTAAT
CGCGGGCTGG AGCTGGCGAC CGGCCGCTAC GTCTTCTTTG TCGGCTCCGA CGACTACCTG
GGGCCACAGG CGCTTCAGCG GCTGGTCACC GCCGCCGACC GGTGGGAGTC GGACGTGGTG
CTCGGCCGCC TGGTGGGGGT GAACAGTCGC TACATTCACC AGGCGATCTA CGCCGAAAGC
TCCGCCGACG TCGACCTGTT CGGCTCGGCT CTGCCCTGGT CGCTGTCGAA CACGAAGCTG
TTCCGGCGGG AACTCGTCGA GCGGCACGGG CTGCGCTACC CGGAGGACAT GCCGGTCGGC
AGCGACCAGC CGTTCACCAT CGAGGCCTGC GTCCGGGCCC GCAGGGTCTC AGTGCTCGCC
GACTACGACT ACTACTACGC GGTGCGTCGG TTGAACGCGC GTAACATCAC CTACCGCAGC
CGGCACCTGG AGCGGCTGCG CTGCGCCGAG GAACTGGTCA CCTTCGTGGC CGGGCTGGTC
GAGCCCGGCC CGAACCGCGA CGCGGTGCTG CTGCGACACT TCACCTGGGA GGTCGCCAAG
CTGTTGGAAA ACGACTTCCT GCAGCTCGAT CGCACCGTGC AGGACCAAGT GGTGGCAGGG
GTGCGGACGC TCACCGAGGC GCATCTGACC GACCGCATCC GGGATCGTCT GCCGATCGAG
GCCCGGGTGC GGCTCGCCGC TGCCCGGTAC GGTGACACCG ACCACCTCCT CGCGGTGATC
CGGCAGGACG CCGAGTTGGG TATCCCGCTC GCCGTGATCG AGGGTGAACG CTGGTATGCC
GGCTACCCGG GTTTCCGAGA TCCGCGACTG CGCATTCCGG ACTGCTGGTA CGAGATCACC
GATACCGCCG CCGACTGGGT GGCCCGGCTA GACACCGTCT CGGCGGCCTT CGAAGGATCA
CGGGCGCTGC TGGTGACCGC CCGCAGCCCC CGCCCTGACC TGCCGGAGCT GGCGTCGTCG
GTCCGGCTCG CGGCCGGTGA CGTGACCGGC GAGACGCTGT CGACGGTCGC GGACGCCACC
GGCACGACCG TACGCGCCCG GATTCCGTTG GATCGGTTGC TGGAAGGCGC TGGCCCGGGT
GGGGAACTGC GCACGGTCCA GGCGCTTGCG AACGCGTTCG GCACCACCGG CGCGGCGGCC
CTGCGCGGCG CCCGGCGGCC GGTGCCCCAG CGGGCGGTGC TGCGCCGGGG CGCCCGACTC
CATGTTCTGA CCATTACCAC CAATCACAAG GGCCAGCTTG TCATCGCCGT AGCACCTGTC
ACCCCACGCC GGTTGATGGC CCGCCTGCGG CGCAGGCTTC CACTAGGAGG AAAGTAG
 
Protein sequence
MTVPDVTVIT AVYNTMPYLT RCLTSLVEQT IGRDRLEVIA VDDGSTDGSG PELDRFARLY 
PGTVKVVHQP NSGGPAAPSN RGLELATGRY VFFVGSDDYL GPQALQRLVT AADRWESDVV
LGRLVGVNSR YIHQAIYAES SADVDLFGSA LPWSLSNTKL FRRELVERHG LRYPEDMPVG
SDQPFTIEAC VRARRVSVLA DYDYYYAVRR LNARNITYRS RHLERLRCAE ELVTFVAGLV
EPGPNRDAVL LRHFTWEVAK LLENDFLQLD RTVQDQVVAG VRTLTEAHLT DRIRDRLPIE
ARVRLAAARY GDTDHLLAVI RQDAELGIPL AVIEGERWYA GYPGFRDPRL RIPDCWYEIT
DTAADWVARL DTVSAAFEGS RALLVTARSP RPDLPELASS VRLAAGDVTG ETLSTVADAT
GTTVRARIPL DRLLEGAGPG GELRTVQALA NAFGTTGAAA LRGARRPVPQ RAVLRRGARL
HVLTITTNHK GQLVIAVAPV TPRRLMARLR RRLPLGGK