Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3808 |
Symbol | |
ID | 5735672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4779792 |
End bp | 4780673 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280960 |
Product | pyridoxal biosynthesis lyase PdxS |
Protein accession | YP_001546572 |
Protein GI | 159900325 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0214] Pyridoxine biosynthesis enzyme |
TIGRFAM ID | [TIGR00343] pyridoxal 5'-phosphate synthase, synthase subunit Pdx1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000407513 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACTT CAACATTTAC CACCAAAGTC GGCTTGGCCC AAATGCTCAA GGGCGGCGTG ATTATGGATG TGGTTACGCC AGACCAAGCT AAAATTGCTG AAGAAGCTGG CGCAGTCGCC GTGATGGCGC TCGAACGGGT TCCAGCCGAT ATTCGTAAGG ATGGCGGCGT GGCTCGTATG AGCGACCCCG AAATGATCCA AGGCATCATT GAAGCAGTCA CCATTCCTGT GATGGCTAAA TCACGGATTG GTCACTTTGT TGAAGCGCAA ATCCTCGAAG CGATTGGGGT TGATTATATT GACGAGAGCG AAGTGCTCAC GCCTGCTGAT GAAGAACATC ATACCAACAA GCACAATTTC AAAGTGCCAT TCGTCTGTGG CGCTCGTAAT TTGGGCGAAG CCTTGCGCCG CATCACCGAA GGCGCAGCCA TGATTCGCAC TAAGGGCGAA GCTGGCACGG GCAATGTGGT TGAAGCTGTG CGCCACGCTC GCACGATGTT CGCCGAAATT CGCCGTTTGC AAACCCTCGA TCCCGATGAG TTGTTTGTGG CTGCCAAAAA CTTGCAAGCT CCCTATGAAT TGGTCAAGCA AATTGCTGAA TTAGGCCGCT TACCAGTTGT CAATTTCGCT GCTGGCGGGA TTGCAACTCC AGCCGATGCT GCTTTGATGA TGCAATTGGG TGTTGATGGC GTGTTCGTTG GCTCAGGGAT TTTCAAATCG GGCAATCCTG CCAAACGCGC TAAAGCAATT GTTGAAGCAA CCACCCACTT CCGCGATGCC AAGCTTTTGG CCGAAATTAG CCGCAACTTG GGCGAAGCCA TGGTCGGCAT CAACATCGAT ACCATCCCAG AGAACGAGTT GCTCGCCAAA CGCGGTTGGT AA
|
Protein sequence | METSTFTTKV GLAQMLKGGV IMDVVTPDQA KIAEEAGAVA VMALERVPAD IRKDGGVARM SDPEMIQGII EAVTIPVMAK SRIGHFVEAQ ILEAIGVDYI DESEVLTPAD EEHHTNKHNF KVPFVCGARN LGEALRRITE GAAMIRTKGE AGTGNVVEAV RHARTMFAEI RRLQTLDPDE LFVAAKNLQA PYELVKQIAE LGRLPVVNFA AGGIATPADA ALMMQLGVDG VFVGSGIFKS GNPAKRAKAI VEATTHFRDA KLLAEISRNL GEAMVGINID TIPENELLAK RGW
|
| |