Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1694 |
Symbol | |
ID | 4484694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1903900 |
End bp | 1904811 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639730484 |
Product | squalene/phytoene synthase |
Protein accession | YP_873452 |
Protein GI | 117928901 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | [TIGR03465] squalene synthase HpnD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0597183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0000516194 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGCAC CCTCCGCTGC ACGGCTCGCG ACGGCGTACG CCGAATGCGA ACACATCATG GTGAGCGCCG CCCGGAATTT CTCCTACGGA ATCCGCCTGC TGCCGCCGGG CAAACGTCAA GCGCTTGCCG CGGTGTATGC GTTGGCCCGG CGCATCGATG ACATCGGCGA CGGCACCCTC CCCGACGACG AGAAACTCCG ATTGCTCGAC GACGTCCGCA TGTCGATCAA ACGAATGGGC GAGCCGACCG ACGACCCGGT GCTGCTGGCG GTGGGCGATG CCGCCCGCCG GCTGCCGATC CCCATCGACG CATTCTTTGA GCTCATCGAA GGGTGCGAGC TGGACGTCCG CAATACCCGC TATCCGACCT TCCGCGAACT CACCCACTAC TGCCGGTGCG TCGCGGGTTC GATCGGGCGG TTGTCGCTCG GCGTTTTCGA CCCGGCTGAT TTCGCGACCG CCGAACCCCT GGCCGACGCA CTCGGCATCG CCTTGCAATT GACCAACATC CTGCGGGACA TCCGGGAAGA TCTGCTTCGC GGCCGCGTCT ACCTGCCGCT CGACGAGCTG ACCGCCGCCG GCATCGAGCC GCATCTGGAC GATCGGGGCA TGGTCGCCGA CGAGCGCGGC CGGTTCGCGG AATTTATCCG CGACCAGGCG GCCCGCGCCG AAACGTGGTA CGCGCGGGGC TTTGCCCTGC TGCCCATGCT CGACCGCCGG TCTGCTGCGT GCACCGCGGC CATGGCCGGC ATCTACCACC GCCTGCTGCA ACGCATCGCC CGCAATCCGC GAGCCGTCCT GCAACGGCGG GTCGCCTTGC CGGCCTGGGA GAAGGCTGCG CTCGCCGCAC GGGCATTCGC CACCGGCGGC CTCGGGGAAG CCGCGGCGCG ACGTTCCGCC GGAGCGCGAT GA
|
Protein sequence | MNAPSAARLA TAYAECEHIM VSAARNFSYG IRLLPPGKRQ ALAAVYALAR RIDDIGDGTL PDDEKLRLLD DVRMSIKRMG EPTDDPVLLA VGDAARRLPI PIDAFFELIE GCELDVRNTR YPTFRELTHY CRCVAGSIGR LSLGVFDPAD FATAEPLADA LGIALQLTNI LRDIREDLLR GRVYLPLDEL TAAGIEPHLD DRGMVADERG RFAEFIRDQA ARAETWYARG FALLPMLDRR SAACTAAMAG IYHRLLQRIA RNPRAVLQRR VALPAWEKAA LAARAFATGG LGEAAARRSA GAR
|
| |