Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5132 |
Symbol | |
ID | 5673466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6147949 |
End bp | 6148929 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243982 |
Product | squalene/phytoene synthase |
Protein accession | YP_001509396 |
Protein GI | 158316888 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.126114 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.060719 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGCC TGCGCAGCGA GCTGGATGCG GCCGGCATCA CCGATCCGCG CCTGCGGAGC TCCTACCGCG CCGCCCGCGA GCTGAACGCC GTCCACGGGC GCACCTACTA CCTGGCGACC CTCCTGCTGC CACGGTGGAA GCGCCCGCAC GTCCACGCCC TGTACGGCTT CGCGCGGTAC GCCGACGAGA TCGTGGACGA CCTCGACTCG ACGCTGACCG ACGAGGCGAA GGCACAGTGG CTGCGACAGT GGGGCAACCG CTTCCTCACC GCACTCGCCA CCGGCTCGGA CGAGGCCGGG GCCGGGGCCG ACAGGAGCGG GGCAGGCTCC GTCGGTGCCG GGGATGCCGG GGTGCTGCCA GCCGTCCTGC ACACGATTCG GCGTTTCGAC CTGCCGGTCG CGTACTTCGA GGCCTTCCTG ACCTCGATGG CGATGGACCT GACCGTCACC GGCTACGCCA CCTGGGACGA CCTCATGGTC TATGTCCACG GCTCGGCGGT GGTGATCGGC CTGCAGATGC TGCCCATCCT CGAGCCCGTC GACGCCTCGG CGGAACCGTA CGCCCGCGAC CTCGGCGCCG CCTTCCAACT CGCGAACTTC CTCCGCGACG TCGGGGAGGA CCTGCGCCGC GGCCGCGTCT ACCTTCCGCA GGCGTCCCTG GACCTGTTCG GAGTCACCAG GGAGCGCCTC GCCACCGGGG TGGTCGACGG CCCCGTCCGC CGGCTGCTGA CCCACGAGAT CGCCCGTGCG CGGGAGCTGT TCCGCTCGGC CCGACCGGGC ATCCGGCTGC TGCACCCGAC GTCCCGCGAC TGTGTCTGGA CCGCGTTCCA CCTCTACGGC GACATCCTGG ACGAGATCGA ACGCGCCGAC TACCAGATCC TCGACCGACG GGTATCCGTC GGCCTGGGCC GCCGCCTCAC CGTCGCGACC CCGGCCCTGA TCCGCGCCAC CCGCTCCCGC CGCCATCCCT CGACGAACTA A
|
Protein sequence | MSGLRSELDA AGITDPRLRS SYRAARELNA VHGRTYYLAT LLLPRWKRPH VHALYGFARY ADEIVDDLDS TLTDEAKAQW LRQWGNRFLT ALATGSDEAG AGADRSGAGS VGAGDAGVLP AVLHTIRRFD LPVAYFEAFL TSMAMDLTVT GYATWDDLMV YVHGSAVVIG LQMLPILEPV DASAEPYARD LGAAFQLANF LRDVGEDLRR GRVYLPQASL DLFGVTRERL ATGVVDGPVR RLLTHEIARA RELFRSARPG IRLLHPTSRD CVWTAFHLYG DILDEIERAD YQILDRRVSV GLGRRLTVAT PALIRATRSR RHPSTN
|
| |