Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13643 |
Symbol | HCL1 |
ID | 7202194 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 823057 |
End bp | 824108 |
Gene Length | 1052 bp |
Protein Length | 321 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | hydroxymethylglutaryl-coenzyme A lyase |
Protein accession | XP_002181269 |
Protein GI | 219121846 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.404031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATACCG TACCGACACT TTTGGTTCCA CCGCGAGTCA AAATTGTCGA AGTTGGGCCC CGAGACGGCT TACAGAACGA ACCCGTCAGC GTCTCCACGG AGGACAAAAT TAAACTCGTA CAAAAGCTGG CTCAGGCGGG CTGTCGCTAC ATCGAAGCTG GAAGTTTCGT CTCACCAAAA TGGGTTCCCA GCATGGCGAA TTCATTTCAA ACAATGACTA AGTTAAGAGA ATGGAAAGAG AAGCAAGGGC CAGAATACGA GCCACTGGTC TTGTCTTGTC TCGTGCCTAA TCTCGCAGGC CTTCACCAAG CGATTCAGGT AAAGGCTGGA GAGATTGCCG TATTTGGATC TGCGAGTGAG ACTTTTTCCA ACAAGGTGCG TCTTCCAAGA ATTGATATCA AAGTGCGTGC TGCGTGTCCT TGGTTGGGTC TCATCTGATA CGATTTTACC TTGTCGCCTA GAACATAAAC TGCAGTATTG ACGAGTCACT GGAGCGCTTC GCGCTGGTCG TTGCCGAGGC CAACACCGCT AGAATTCCCG TGCGAGCCTA CTTGTCCTGT GTCATTGGTT GTCCTTACCA AGGAAGGATC CCACCATTAG CCGTAGCCCA GATGGCGGAA AAATTATTAG CTTTGGGCTG TCACGAATTG TCGTTGGGTG ATACGATTGG GGTCGGGACA CCCACAACCA CAATCGCGCT GTTGAAGGAA CTGCAACACG TCCTGGGGAA CGACGTAGAC AAACTAGCCG TTCATTTTCA CGACACACAT GGTCAAGCTT TGGCTAACAT TTTGGTATCC TTGGAAAGTG GGATCGCGAC GGTCGATGCT TCCGTTGCGG GGCTTGGAGG TTGTCCGTAT GCCCCTGGAG CGTCGGGCAA CGTGGCGACC GAAGATGTTG TCTACATGCT GAATGGTTTG GGTGTTGAGA CGGGGATTGA TCTTGACAAG CTTGTAGAGG CTGGTGACTT CATTTGCGAG GTTTTGGACC GCCCTTCCAG ATCTAGGGCA GGGACAGCCA TTTCTGCCAT ACAGAAGCGG AAAGCACCGT AG
|
Protein sequence | MHTVPTLLVP PRVKIVEVGP RDGLQNEPVS VSTEDKIKLV QKLAQAGCRY IEAGSFVSPK WVPSMANSFQ TMTKLREWKE KQGPEYEPLV LSCLVPNLAG LHQAIQVKAG EIAVFGSASE TFSNKNINCS IDESLERFAL VVAEANTARI PVRAYLSCVI GCPYQGRIPP LAVAQMAEKL LALGCHELSL GDTIGVGTPT TTIALLKELQ HVLGNDVDKL AVHFHDTHGQ ALANILVSLE SGIATVDASV AGLGGCPYAP GASGNVATED VVYMLNGLGV ETGIDLDKLV EAGDFICEVL DRPSRSRAGT AISAIQKRKA P
|
| |