Gene Sare_0993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0993 
Symbol 
ID5707533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1116197 
End bp1117345 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content74% 
IMG OID641270508 
Productglycosyl transferase group 1 
Protein accessionYP_001535895 
Protein GI159036642 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.153198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CCATCGTGAC CGAATCGTTC CCGCCGGACG TGAACGGTGT CGCGCACTCG 
GTGGTGCGGG CAGCAGAGCA CCTGGTCGCC CGCGGACACG AACCGGTGGT CATCGCGCCC
GCTCCGGGTG GGGCCCGCCG CCACGAGTCG AACCGGCACT CGTACCCGGT GGTCCGCATC
CCCAGCGTTC CGCTGCCGCG CTACCAGGGC TTCCGGTTGG GCGTACCGAC GCAGGCCCAG
CTGACCGGCG CGGTGCTGTC GTGCGCCCCC GACATCGTTC ACCTGGCCAG TCCGTTCGTG
CTCGGGGCAC GGGCCGCGAC CCTGGCGGCC CGGCACGACC TGCCGACGGT TGCCGTCTAC
CAGACCGACG TCGCCTCATA CGCCCGCGCG TATCGGGTCG GCTGGGGCGA AGCGGCGGTC
TGGCGGCGGA TCCGCGAGAT CCACAATTCG GCCCAGCGTA CGCTCGCGCC GTCCACCCGG
GCCGCCGCCG ATCTCGTCGC CAACGGGGTG CAGCGAATCT GGCTCTGGCG ACGCGGCATC
GACGGCGAGC GCTTCCAGCC GGCGAAGCGG TGCGCCGCGC TGCACCGGGC TCTCGCGCCC
GGCGGTGAAC TGCTCGTCGG CTACGTCGGG CGGCTTGCCC CCGAGAAGCG GGTCGACCTG
CTCGAGGCCA CCACCCGCCT GCCCGGCGTC CGGGTCGTGG TCGTCGGCGA CGGGCCGGAC
CGCCGGCGGC TGGAGTGGTC CCTGCCGGGC GCGGCGTTTC TCGGTGTGCA GCACGGCGAG
GACCTCGCCC GCCTCTACGC GAGCCTCGAC GTCTTCGCGC ACACCGGCCC ACACGAAACG
TTCGGCCAGA CGATCCAGGA GGCACTGGCC AGTGGTGTAC CCGTGGTGGC TCCGGCGGCC
GGCGGGCCGG TCGACCTGGT CAAGTCCGGG GTGACCGGGA CACTGGTGCC GCCCGGCGAC
GCCGGGGCGC TCGCCGACGC CGTCCGGGCG CTCGCCACCG ACGAAGCCCG CCGGCAGGCG
TACGCGGCGG CGGGCCGGGC CGCCGTCATC CGCCGCAGTT GGACGGCGGT CGGCGACGAG
CTGATCGGCC ACTACCGGGC GGTCCTCCGG TCCGGTGCCT CGGCGCTGGA CCTACCCGCG
GTGTCGTGA
 
Protein sequence
MRIAIVTESF PPDVNGVAHS VVRAAEHLVA RGHEPVVIAP APGGARRHES NRHSYPVVRI 
PSVPLPRYQG FRLGVPTQAQ LTGAVLSCAP DIVHLASPFV LGARAATLAA RHDLPTVAVY
QTDVASYARA YRVGWGEAAV WRRIREIHNS AQRTLAPSTR AAADLVANGV QRIWLWRRGI
DGERFQPAKR CAALHRALAP GGELLVGYVG RLAPEKRVDL LEATTRLPGV RVVVVGDGPD
RRRLEWSLPG AAFLGVQHGE DLARLYASLD VFAHTGPHET FGQTIQEALA SGVPVVAPAA
GGPVDLVKSG VTGTLVPPGD AGALADAVRA LATDEARRQA YAAAGRAAVI RRSWTAVGDE
LIGHYRAVLR SGASALDLPA VS