Gene Sare_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0467 
Symbol 
ID5703646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp535479 
End bp536540 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content69% 
IMG OID641269992 
Productglutathione S-transferase-like protein 
Protein accessionYP_001535387 
Protein GI159036134 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0435] Predicted glutathione S-transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0671242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAGG ACGGAGCGGC GAGCGGCGGC GACGAGTCAC CGGGGCGCAC CGGCGGGAGG 
TATGTCGAGC CGGGCGGCGA GTTCACCCGG GATCAGCGCT ATATCGCCAC CCGGATCACC
GTTGACGGCG GGGACGGCTG GCCGGTGGAG CCGGGGCGGT ACCGACTGGC GGTGAGTCGC
GCCTGCCCAT GGGCGAGCCG ACTGGTCATC GTCCGACGGC TACTCGGGCT GGAGGGCGCC
ATCTCGATGG CGATCGCCGG CCCGACCCAC GACGAACGAA GCTGGACCTT CGACCTCGAC
CCCGGTGGGC GGGATCCGGT GTTCGGCATC GAACGGCTGG CGGAGGCGTA CTTCGCGCGC
TTTCCCGGCT ACGACCGCGG CATCACCGTG CCGGCGATCG TCGACGTGCC GACCGGGCAG
GTGGTGACCA ACGACTACGC GCAGATGAGC CTCGACCTGT CGACCCAGTG GACCGAGTAC
CACCGTGACG GGGCGCCGGA CCTCTACCCG CAGCGGCTAC GAGACGAGAT CGACGAGGTC
AACGAGGTCG TCTTCACCGA TGTCAACAAC GGTGTCTACC GGTGCGGCTT CGCTGGCAGC
CAGCAAGCGT ACGACCGGGC CTACCGGCGG CTGTTCGACC GACTGGACTG GTTGAGCGAC
CGACTCGCCG GGCGCCGCTA CCTGGTCGGG GAGACGATCA CCGAGGCGGA TGTGCGGCTG
TTCACCACGT TGGTCCGCTT CGACCCGGTC TACCACGGCC ATTTCAAGTG CAACCGGAGC
AGGTTGACCG AGATGCCGGT GCTCTGGGCG TACGCCCGGG ACCTGTTCCA GACTCCCGGA
TTCGGCGACA CCGTCGACTT CGACCACATC AAGCGCCACT ACTACGAGGT ACAACGGGAC
ATCAACCCGA CCGGGATCGT CCCCCTCGGC CCTGATCTGT CGGCCTGGCT GACGCCGCAC
GATCGGGGGG CCCTGGGCGG CCGTCCCTTC GGCGACGGCA CCGCACCGCC TCCGCCCGCA
CCGGCCGAGC GGGTTGACCC TGCGCACACC CCGTTGCACT GA
 
Protein sequence
MTEDGAASGG DESPGRTGGR YVEPGGEFTR DQRYIATRIT VDGGDGWPVE PGRYRLAVSR 
ACPWASRLVI VRRLLGLEGA ISMAIAGPTH DERSWTFDLD PGGRDPVFGI ERLAEAYFAR
FPGYDRGITV PAIVDVPTGQ VVTNDYAQMS LDLSTQWTEY HRDGAPDLYP QRLRDEIDEV
NEVVFTDVNN GVYRCGFAGS QQAYDRAYRR LFDRLDWLSD RLAGRRYLVG ETITEADVRL
FTTLVRFDPV YHGHFKCNRS RLTEMPVLWA YARDLFQTPG FGDTVDFDHI KRHYYEVQRD
INPTGIVPLG PDLSAWLTPH DRGALGGRPF GDGTAPPPPA PAERVDPAHT PLH