Gene Sare_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1777 
Symbol 
ID5704519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2048867 
End bp2049787 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content68% 
IMG OID641271280 
Productpyridoxal biosynthesis lyase PdxS 
Protein accessionYP_001536655 
Protein GI159037402 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0214] Pyridoxine biosynthesis enzyme 
TIGRFAM ID[TIGR00343] pyridoxal 5'-phosphate synthase, synthase subunit Pdx1 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00625137 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCCGAAT CCCAATCTCC GAACTCTTCC ACCAATGCCC CTGTCACCGG CACCACCCAC 
GTGAAACGGG GTATGGCCGA GATGCTCAAG GGCGGTGTGA TCATGGACGT GGTCACCCCG
GAGCAGGCCA GGATCGCCGA GGACGCCGGT GCGGTCGCGG TGATGGCACT GGAGCGGGTT
CCGGCGGACA TCCGCGCCCA GGGCGGGGTG TCCCGGATGA GTGATCCCGA CATGATCGAC
GGCATCATGC AGGCGGTCTC GATCCCGGTC ATGGCCAAGG CCCGCATCGG TCACTTCGTG
GAGGCGCAGA TCCTCCAGTC GCTCGGCGTC GACTACGTCG ACGAGTCCGA GGTCCTGACC
CCGGCGGACT ACGCGAACCA CGTCGACAAG TGGGCCTTCA CGGTGCCGTT CGTCTGCGGC
GCCACCAATC TGGGCGAGGC GCTGCGGCGG ATCACCGAGG GCGCGGCCAT GATTCGCTCG
AAGGGCGAGG CCGGCACCGG CGACGTTTCC AACGCCACCA CCCACATGCG GGGGATCCGC
ACCGAGATCC GCCGGCTGCA GTCATTGCCG GCGGACGAGT TGTACGTGGC TGCCAAGGAG
CTCCAGGCGC CGTACGAGCT GGTCCGCGAG ATCGCCGAGA CGGGCAAGCT GCCGGTGGTA
CTGTTCACCG CCGGTGGTAT CGCCACCCCG GCCGATGCCG CCATGATGAT GCAGCTGGGC
GCCGAGGGTG TCTTCGTCGG CTCCGGCATC TTCAAGTCCG GCAACCCGGC CGAACGGGCT
GCCGCGATCG TCAAGGCGAC CACGTTCCAC GACGACCCGG AGGTGCTGGC CAAGGTCTCG
CGGGGCCTCG GCGAGGCGAT GGTCGGTATC AACGTCGACC AGATCCCGCA GTCGGACCGC
CTGGCCGAGC GCGGCCGGTG A
 
Protein sequence
MPESQSPNSS TNAPVTGTTH VKRGMAEMLK GGVIMDVVTP EQARIAEDAG AVAVMALERV 
PADIRAQGGV SRMSDPDMID GIMQAVSIPV MAKARIGHFV EAQILQSLGV DYVDESEVLT
PADYANHVDK WAFTVPFVCG ATNLGEALRR ITEGAAMIRS KGEAGTGDVS NATTHMRGIR
TEIRRLQSLP ADELYVAAKE LQAPYELVRE IAETGKLPVV LFTAGGIATP ADAAMMMQLG
AEGVFVGSGI FKSGNPAERA AAIVKATTFH DDPEVLAKVS RGLGEAMVGI NVDQIPQSDR
LAERGR