Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1777 |
Symbol | |
ID | 5704519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2048867 |
End bp | 2049787 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271280 |
Product | pyridoxal biosynthesis lyase PdxS |
Protein accession | YP_001536655 |
Protein GI | 159037402 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0214] Pyridoxine biosynthesis enzyme |
TIGRFAM ID | [TIGR00343] pyridoxal 5'-phosphate synthase, synthase subunit Pdx1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00625137 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCCGAAT CCCAATCTCC GAACTCTTCC ACCAATGCCC CTGTCACCGG CACCACCCAC GTGAAACGGG GTATGGCCGA GATGCTCAAG GGCGGTGTGA TCATGGACGT GGTCACCCCG GAGCAGGCCA GGATCGCCGA GGACGCCGGT GCGGTCGCGG TGATGGCACT GGAGCGGGTT CCGGCGGACA TCCGCGCCCA GGGCGGGGTG TCCCGGATGA GTGATCCCGA CATGATCGAC GGCATCATGC AGGCGGTCTC GATCCCGGTC ATGGCCAAGG CCCGCATCGG TCACTTCGTG GAGGCGCAGA TCCTCCAGTC GCTCGGCGTC GACTACGTCG ACGAGTCCGA GGTCCTGACC CCGGCGGACT ACGCGAACCA CGTCGACAAG TGGGCCTTCA CGGTGCCGTT CGTCTGCGGC GCCACCAATC TGGGCGAGGC GCTGCGGCGG ATCACCGAGG GCGCGGCCAT GATTCGCTCG AAGGGCGAGG CCGGCACCGG CGACGTTTCC AACGCCACCA CCCACATGCG GGGGATCCGC ACCGAGATCC GCCGGCTGCA GTCATTGCCG GCGGACGAGT TGTACGTGGC TGCCAAGGAG CTCCAGGCGC CGTACGAGCT GGTCCGCGAG ATCGCCGAGA CGGGCAAGCT GCCGGTGGTA CTGTTCACCG CCGGTGGTAT CGCCACCCCG GCCGATGCCG CCATGATGAT GCAGCTGGGC GCCGAGGGTG TCTTCGTCGG CTCCGGCATC TTCAAGTCCG GCAACCCGGC CGAACGGGCT GCCGCGATCG TCAAGGCGAC CACGTTCCAC GACGACCCGG AGGTGCTGGC CAAGGTCTCG CGGGGCCTCG GCGAGGCGAT GGTCGGTATC AACGTCGACC AGATCCCGCA GTCGGACCGC CTGGCCGAGC GCGGCCGGTG A
|
Protein sequence | MPESQSPNSS TNAPVTGTTH VKRGMAEMLK GGVIMDVVTP EQARIAEDAG AVAVMALERV PADIRAQGGV SRMSDPDMID GIMQAVSIPV MAKARIGHFV EAQILQSLGV DYVDESEVLT PADYANHVDK WAFTVPFVCG ATNLGEALRR ITEGAAMIRS KGEAGTGDVS NATTHMRGIR TEIRRLQSLP ADELYVAAKE LQAPYELVRE IAETGKLPVV LFTAGGIATP ADAAMMMQLG AEGVFVGSGI FKSGNPAERA AAIVKATTFH DDPEVLAKVS RGLGEAMVGI NVDQIPQSDR LAERGR
|
| |