Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5717 |
Symbol | |
ID | 5674043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6938508 |
End bp | 6939647 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244570 |
Product | squalene/phytoene synthase |
Protein accession | YP_001509973 |
Protein GI | 158317465 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | [TIGR03464] squalene synthase HpnC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAA TCGACGCCCG AACCGGCCCA CCGCGACCGT CTGCCCACCC CGGCGCGACC GACCTCACCA CGCCCACCGA GCAGGTGTTG CTGGCCGCGC CCGCCGAGAA CTTCCCGGTC GCCTCGCGGC TGCTGCCCGC CGCGACCCGG GATCACCTCA ACGCGCTGTA CGTCTTCGGA CGTCTCGTTG ACGACATCGG TGACGAGGCC CCCGGCGACC GCCTGGCGCT GCTCGACGAG CTTGCCGCCG ACCTCGAGCT GATCTGGTCG GGCGGCGAGC CGAAGCTGCC CGTCCACCGT CGGCTCGCGC ACACAGTGCG GGCCTGCGAC CTGCCGGCCG AGCCGTTCCG GCGGCTCGTG CAGGCCAACC AGCAGGACCA GATCGTCACC AGGTACGACA CCTACGAGGA CCTCGTGCAG TACTGCACCC TCTCGGCGGA TCCGATCGGC CGCATGGTGC TCGGGGTGTT CGGTCTGGCG ACTCCGGCGC GGATCGCGCT GTCCGACCGC ATCTGTACCG CGCTGCAGCT CGCGGAGCAC TTCCAGGACG TCGCCGAGGA CCTCGCGGCC GGCCGGATCT ACGTGCCGCT GGAGGACCTG GACGCGTTCG GGGTGACCGA GGCCGACCTG GCCGCGCCGG TCGCCTCGCC CGCGGTGCGG CACCTGATGG CCTTCGAAGT GGCCCGGGCT CGGACCTTGC TGGACCAGGG GGCACCGCTG GTGAACCTGG TCGGTGGCCG GCTGCGGCTG GCGATGGCCG GATTCGTGGG GGGCGGCCGG GCCGCGCTCG ACGCGATCCG CCGCGCCGAC TACGACATGC TCGGTGGCGC GCCCAAGGCC CCCAAGCCCC GGATGGCGGC CTTCGCCGCG GCCGCGTGGG CCCGGTCGCT GGTGCCCGGG CAGGCCGCGG CCGCCCGCGC CGCCGGCACC GCCGCCACCG CCGCCACTGC TTCAAGCACT ACCGCTCCCA GCACTACCGC TCCCGGCGCC GCCGCTGCTG GCGGCACCGG TTCCAGCGGC CCTGTTCCCA GCGGCACCGT TCCCACCACC ACCCCCGGCG GGGGCGCCGC GGTGGCGAGT GCCGTCCCGG CTCCGGCGCC CGCCGGATCC GACGTCGACG TCCCGGAGGG TGCCAGATGA
|
Protein sequence | MTAIDARTGP PRPSAHPGAT DLTTPTEQVL LAAPAENFPV ASRLLPAATR DHLNALYVFG RLVDDIGDEA PGDRLALLDE LAADLELIWS GGEPKLPVHR RLAHTVRACD LPAEPFRRLV QANQQDQIVT RYDTYEDLVQ YCTLSADPIG RMVLGVFGLA TPARIALSDR ICTALQLAEH FQDVAEDLAA GRIYVPLEDL DAFGVTEADL AAPVASPAVR HLMAFEVARA RTLLDQGAPL VNLVGGRLRL AMAGFVGGGR AALDAIRRAD YDMLGGAPKA PKPRMAAFAA AAWARSLVPG QAAAARAAGT AATAATASST TAPSTTAPGA AAAGGTGSSG PVPSGTVPTT TPGGGAAVAS AVPAPAPAGS DVDVPEGAR
|
| |