Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5716 |
Symbol | |
ID | 5674042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6937609 |
End bp | 6938511 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244569 |
Product | squalene/phytoene synthase |
Protein accession | YP_001509972 |
Protein GI | 158317464 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | [TIGR03465] squalene synthase HpnD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.038374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG GTGTCCCGGG CAGCTCCACC GTCGCCGAGG CCTACGCCTC CTGCGAGGCG TTGACGCGGG AGGCCGCGCG GAACTTCAGC TACGGCATCC GCCTGCTGCC GGCGGAGAAG CGGGGCGCGC TGTCCGCGGT GTACGCCCTG GCGCGTCGCC TCGACGACAT CGGCGACGGC GACCTGCCAG ATGAGGAGAA GCTCGCCGGC CTGGAGAAGG TCCGCGCAGA CGTCCGCGTG CTGGACATCA ACAGCTCCGA CCCGGTGATG GTGGCGCTCG CGGACACCGC CCGCCGCTTC CCCGTCCCGA TGGAAGCCTT CATCGAGCTC GCCGACGGGG TCGAGAGCGA TGTCCACGGG GTCCCCTACG AGACGTTCGA CGACATGGTC GGGTACTGCC GGCTGGTGGC GGGCACCATC GGCCGGCTCT CGCTGGGCAT CTTCGGCGTG GACGGGTCGG CCGGTGACGC CCGCTCGGCC GAGATCGCCG ACGCGCTCGG CGTGGCGCTC CAGCAGACCA ACATCCTGCG CGACGTCCGC GAGGACCTCC TCAACGGCCG CATCTACCTG CCCTCGACGG AGCTGGAGAA AGCCGGCGTC ACCCTGGTGG TGAGCCCGGC CGGGCGCCTC GGTGGGCCGG AGGACGCACT GGTGGACTAT CTGCGCGACT GCGCCGCGCG CGCCGACGAG TGGTACACGC GGGGCCTGGC GCTGCTCGGC CTGCTCGACC GGCGCAGCGC CGCCTGCTGC GGTGCCATGG CCGGGATCTA CCTGCGCCTG AACCGGCGCA TCCGGCAGGA CCCCACCGCG GTCCTGGAAC GCCGGCTGTC ACTGCCGGGC TGGGAGAAGG CGGTCGTGGC CGCCCGCAGC CTCGCCGGCA GATCGGAGGC GAGAGCGGCA TGA
|
Protein sequence | MSAGVPGSST VAEAYASCEA LTREAARNFS YGIRLLPAEK RGALSAVYAL ARRLDDIGDG DLPDEEKLAG LEKVRADVRV LDINSSDPVM VALADTARRF PVPMEAFIEL ADGVESDVHG VPYETFDDMV GYCRLVAGTI GRLSLGIFGV DGSAGDARSA EIADALGVAL QQTNILRDVR EDLLNGRIYL PSTELEKAGV TLVVSPAGRL GGPEDALVDY LRDCAARADE WYTRGLALLG LLDRRSAACC GAMAGIYLRL NRRIRQDPTA VLERRLSLPG WEKAVVAARS LAGRSEARAA
|
| |