Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5149 |
Symbol | |
ID | 5673483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6170058 |
End bp | 6171023 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243999 |
Product | pyridoxal biosynthesis lyase PdxS |
Protein accession | YP_001509413 |
Protein GI | 158316905 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0214] Pyridoxine biosynthesis enzyme |
TIGRFAM ID | [TIGR00343] pyridoxal 5'-phosphate synthase, synthase subunit Pdx1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0075588 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000117077 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATGTCCG ACGGCCTGAC CGCCCCCGCC CCCGCCCCGG GCGCTTCGCC ATCCGGCCCC GTCGCGCCGG CGGACGGCGC GGAGCGGCAC GCCGGCACCG CCCGGGTCAA GCGCGGCATG GCGGAGATGC TCAAGGGCGG CGTCATCATG GACGTCGTCA CCCCCGAACA GGCCCGCATC GCCGAGGAGG CGGGCGCCGT CGCGGTCATG GCGCTGGAGC GGGTGCCCGC GGACATCCGG GCGCAGGGCG GCGTGGCGCG GATGAGCGAC CCCGACATGA TCTCCGGGAT CATCGAGGCG GTCTCGATCC CGGTCATGGC CAAGGCCCGC ATCGGGCACT TCGTCGAGGC GCAGATCATC CAGGCGCTGG GTGTGGACTA CGTCGACGAG TCCGAGGTCC TCACCCCGGC GGACCCGAAC CACCACATCG ACAAGTGGGG CTTCACGGTT CCCTTCGTCT GCGGAGCGAC GAACCTGGGC GAGGCGCTGC GGCGGATCTC CGAGGGTGCC GCGATGATCC GCTCGAAGGG CGAGGCGGGT ACCGGCGAGG TCTCCAACGC CGTGGTGCAC ATGCGTACGA TCCGGTCGGA GATCGCGCGG CTGTCGGGGC TGCCGTCCGA GGAGCTCTAC GCCGCGGCCA AGGAGCTGCG TGCGCCGGTG GAGCTCGTCA CCGAGGTCGC GCGGCTGGGT CGGCTGCCGG TCGTGCTGTT CACCGCGGGC GGCATCGCCA CCCCGGCCGA CGCGGCGCTG ATGATGCAGC TCGGCGCGGA CGGCGTGTTC GTCGGCTCCG GCATCTTCAA GTCCGGCGAC CCGGCGCGGC GCGCCCGGGC GATCGTCGAG GCCACGACCA TGTACAACGA CCCCGGCGTG CTGGCGAAGG TGTCGCGCGG CCTCGGTGAG GCCATGGTCG GCATCAACGT CGGCGAGCTC CCGCCCGAGG CGCGCTTCGC CGCCCGCGGC TGGTGA
|
Protein sequence | MMSDGLTAPA PAPGASPSGP VAPADGAERH AGTARVKRGM AEMLKGGVIM DVVTPEQARI AEEAGAVAVM ALERVPADIR AQGGVARMSD PDMISGIIEA VSIPVMAKAR IGHFVEAQII QALGVDYVDE SEVLTPADPN HHIDKWGFTV PFVCGATNLG EALRRISEGA AMIRSKGEAG TGEVSNAVVH MRTIRSEIAR LSGLPSEELY AAAKELRAPV ELVTEVARLG RLPVVLFTAG GIATPADAAL MMQLGADGVF VGSGIFKSGD PARRARAIVE ATTMYNDPGV LAKVSRGLGE AMVGINVGEL PPEARFAARG W
|
| |