Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_17326 |
Symbol | LHL1 |
ID | 7196297 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 610874 |
End bp | 611952 |
Gene Length | 1079 bp |
Protein Length | 224 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | early light induced protein |
Protein accession | XP_002177121 |
Protein GI | 219110739 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00726613 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCTGCTCGA GCCGCCAACG AGACACCACC ACCTGCTCCG GATACCTCTT GCTTACACAC GCAGTTGTTT CGATACGGTA TAACTACAAT ACCTTTCACC ATGGCGCCAC TCCGAACTAC CTTTGCTCTT TTGTTGAGCC TCGTCTCGGC TTCTGCTTTT GCTCCGGTCC AGAATGTAGC CCGCAAGCAG ACCAGGTACG GATAAAACAG AAAGCGTTGC CACATTCAAT AGCACTTTCT AGCAAAGGAA CAACTCTTGC AAGTCTCGAA CGAGTCATCA GGGAATATTT TGCAAGGGTC TCAGATCCAA ATATTGTGTG GTTGATTTAT CCTATTGATC GCTTAGGAAC ATTGGGATCC TTCGCTGTTC AGTTGTGCAG AGACCCCGCA CTGACACTTT TTTGTTTTGA CTTGTTTTCC TTCATTCTCA GCGTGAGCGC CTTCAAGATC GACCCTCAGC TTTACGACGA TGCCGTCTCC GACTGGGAGA AGCAATTCCC TGCCTTCTCC AAATGGGGCT GGGGACCTTC GGTGCAGGCC GAAAAGTGGA ATGGCCGCCA CGCCATGTTC GGCTGGTTTT TCATCTGCGC CACCGCATAC TGCAAGGGAC ACGGTCTCAT CCCGGATCCC GAAATGCTCC TCGATCTGAA ACAGTGGGGT ACTCTCGCCA CCATCTCTGG AAAGGACACT ATCAGCAACG AGCGTGCCAT CATTTTGGTC GCCAACGCCC ACTTCTTCGC TCTCTCTCTC GCTGCCACCA TCTGCCCGCT TCCGTTCGGT GACTCTCTCT TTGTCGACCC TAACCACCCC AACTATGAAG CCATGGCCGA GCGCAACAAG AATGGATTCG GGTACCTTCC GGCCCTCAAG TTTGGACTTA CCGAAGAAGC CGAAATCATC AACGGACGTC TCGCCATGCT TGGTTTGGTC ATGCTCATCG GAGCCACCGC CACGTCCGGA CAAAACATGC TCGATATTGT CAACGAATGG GTTGGTGGAG CTTACTTTTG AGAGGGCTTA GAGTGATTAA CGTCTCTATA TCTAACGCTG CTATTAGTGT TTCATCTTTT GGAAACTCT
|
Protein sequence | MAPLRTTFAL LLSLVSASAF APVQNVARKQ TSVSAFKIDP QLYDDAVSDW EKQFPAFSKW GWGPSVQAEK WNGRHAMFGW FFICATAYCK GHGLIPDPEM LLDLKQWGTL ATISGKDTIS NERAIILVAN AHFFALSLAA TICPLPFGDS LFVDPNHPNY EAMAERNKNG FGYLPALKFG LTEEAEIING RLAMLGLVML IGATATSGQN MLDIVNEWVG GAYF
|
| |