Gene Sare_3605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3605 
Symbol 
ID5706630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4161022 
End bp4162170 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID641273029 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_001538394 
Protein GI159039141 
COG category[S] Function unknown 
COG ID[COG2327] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03609] polysaccharide pyruvyl transferase CsaB 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.808806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000648077 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACACCTG GCACTGGCCT GACCATCGGT GTGCTCGGCT CGTACGGCGG TCGTAACCTC 
GGTGACGAGG CAATCCTCAC CGGCCTCCTG GCTGACCTGC AGGAACAGGA GCCGAACGCC
CGTATCATCG TGTTCTCCCG CAATCCCGAC CACACCCGGT CGGCCCACCC GGAGGTGGAG
GCGGTGCCCT GGGAGGGGGT GAGCCGCACC GACTCGTCAC CGGTGCTCGC CCAACTCGAT
CTGCTCATTC TGGGTGGCGG CGGCATCCTC TACGACCGGG AGGCACGCCG CTACCTGCGG
GTCGTCCGGG TTGCCCAGGA GCGCGGCCTG CCGCTGCTCA CGTACGCGGT GGGGGTCGGC
CCACTCAGCG AGATCGTGGA CACCGGGATG GTGCGCGAGA CCCTGGCCGG GGCGACCCAG
GTCACGGTGC GGGACCAGGA ATCGCGCATG CTCCTGGAGG AGGCCGGGCT ACTCAACCCG
ATCACGGTCA CCGCGGACCC GGCGTTTCTG CTCGAGGCCG AGGACTTCCC CGCGCACCTG
CTCCGGGAGG AGGGGGTACC GGCGGGCCGG CGGCTGGTCG GCATGAGCGT GCGCGAGCCG
GGCCGAGCCG CCGAACGCCT CGACGTTGAC GGGTACCACC GGCTCTTGGC CCAGATCGGC
GACTTCCTCG TACACCGGAT CGACGCGGAT GTCCTTTTCG TTCCGATGGA GCGGGACGAC
ATCCGGCACT CCCACGGCGT GCTGTCACAC ATGATCGCCG CCGAGCGAGG CCGTATTCTG
CACGGTAGCT ACTCACCCCA GCAGGTGCTC GGTTTGATGC GCCACTTCGA CCTGGCCGTC
GGCATGCGGC TGCACTTTCT GATCTTCGCC GCGATGGCGA ACACTCCGTT CCTGCCCCTG
CCGTACGCAG GTAAGGTCTT CGACCTGGCT CAGCGGCTTG GCGTCCCCGC CCTGCGGGGA
GTGGAACGGG AGGTCGAGGG CCCGCTGTTG GCCGAGGTCG ACCGGCTGTG GGACGAGCGG
GACCAGCGCG CCGAGGCCAC CGCCCGACGG GTCGCCGAGG TGTGCGAGGA AGCCCGGGGC
ACCTCCAAGG TGACCCGGTC GGTGCTGGAC AGTCTCCGGA CCCAGGCGAT GGTCTCCGTC
GACGCGTGA
 
Protein sequence
MTPGTGLTIG VLGSYGGRNL GDEAILTGLL ADLQEQEPNA RIIVFSRNPD HTRSAHPEVE 
AVPWEGVSRT DSSPVLAQLD LLILGGGGIL YDREARRYLR VVRVAQERGL PLLTYAVGVG
PLSEIVDTGM VRETLAGATQ VTVRDQESRM LLEEAGLLNP ITVTADPAFL LEAEDFPAHL
LREEGVPAGR RLVGMSVREP GRAAERLDVD GYHRLLAQIG DFLVHRIDAD VLFVPMERDD
IRHSHGVLSH MIAAERGRIL HGSYSPQQVL GLMRHFDLAV GMRLHFLIFA AMANTPFLPL
PYAGKVFDLA QRLGVPALRG VEREVEGPLL AEVDRLWDER DQRAEATARR VAEVCEEARG
TSKVTRSVLD SLRTQAMVSV DA