Gene Sare_1909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1909 
Symbol 
ID5708118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2203365 
End bp2204525 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content76% 
IMG OID641271413 
Productglycosyl transferase group 1 
Protein accessionYP_001536785 
Protein GI159037532 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.653406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000425018 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGGAAC CGACCTCGCG GGCCCGCGGC ACGGTCGCCC TGCTGCTCGC CTCCAGCACC 
GGTGGTGTCG GGCAGCATGT CCGCTCACTG GCCGCCGGGC TGACCAGCAT CGGCGTCGCC
GTGCTGGTCT GTGGCCCCGC TGCGACCGAG GAACAGTTCG ACTTCACCGG CGTCGGCGCC
CGCTTCGAGG CGGTGGAGAT CCGAGCCAGC CCGACCCCCG CCGACATGCG GGCGGTGACT
GCGCTTCGCC AGGCCCTCGC CGCCGAGCCG GTCGATGTGC TGCACGCGCA CGGGTTGCGC
GCCGGGCTGG TCGCCGTCGC CGCCCGACCG GCTGTCCCGC TGGTGGTGAC CTGGCACAAC
GCCGTCCTGG CCGGAGGACT GCGCGGCAGC GTGTCCCGCC TGGTCGAGCG GGTCGTCGCC
CGCAATGCCC GAGTGAGCCT CGGTGCCTCG GCCGACCTGG TCCAGCGAGC CACCGAGTTG
GGCGCAGGCG ACGCCCGGCT CGCCCCGGTC GCCGCGCCGC CGCTGTCTGA ACCGCACCGC
CGCCGGGACG CGGTTCGCGC CGAGTTCGGC GTCGGCGGTG AGCAGCCGCT GGTGCTCTCA
GTCGGCAGGC TGCACCCACA GAAGCGGTAC GACGTGTTGG TCGACGCCGC CGCCCGGTGG
CGAAACCGTG CCCCCGTTCC GGCCGTCGTG ATCGCCGGCA GCGGCCCCGC CTACCTGCAA
CTGGCTGCCC GGATCTCCGC CGCGCGGGCG CCGGTCACCC TGCTGGGACA CCGCACCGAT
GTGGCTGACC TGCTGGCCGG CGCCGACGTC GCGGTGGTGA CCAGTGACTG GGAGGCCCGG
CAACTGTTCG CGCAGGAGGC GATGCGGGTC GGCGTGCCGC TGGTGGCGAC CGCGGTCGGC
GGTCTGCCGG AGCTGGTCGG CGACGCGGCC ATACTGGTGC CGCCGGGCGA TGTCGATGCG
GTCGACGCGG CCGTTGGCCG CCTGTTGGAC GACCCGGCCC TGCGGGCCGG GCTGGGTCGG
CAGGCCCGGG AGCGGGCGGC GACGTGGCCG ACCGAGGCGG ACACCTGCGC CCAACTCGCC
GCGCTCTACG CCGAGCTGGC TCCGGGCGCC ACCGGCCCCA CCACCCCGGA GGCCGGCGCA
CCCTCGACGG GGCCGCGGTG A
 
Protein sequence
MTEPTSRARG TVALLLASST GGVGQHVRSL AAGLTSIGVA VLVCGPAATE EQFDFTGVGA 
RFEAVEIRAS PTPADMRAVT ALRQALAAEP VDVLHAHGLR AGLVAVAARP AVPLVVTWHN
AVLAGGLRGS VSRLVERVVA RNARVSLGAS ADLVQRATEL GAGDARLAPV AAPPLSEPHR
RRDAVRAEFG VGGEQPLVLS VGRLHPQKRY DVLVDAAARW RNRAPVPAVV IAGSGPAYLQ
LAARISAARA PVTLLGHRTD VADLLAGADV AVVTSDWEAR QLFAQEAMRV GVPLVATAVG
GLPELVGDAA ILVPPGDVDA VDAAVGRLLD DPALRAGLGR QARERAATWP TEADTCAQLA
ALYAELAPGA TGPTTPEAGA PSTGPR