Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44601 |
Symbol | Lhcr1 |
ID | 7198100 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 1013568 |
End bp | 1014568 |
Gene Length | 1001 bp |
Protein Length | 200 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | protein fucoxanthin chlorophyll a/c protein |
Protein accession | XP_002178624 |
Protein GI | 219115657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCACCGAA TAGATACACC AACCAAGGAG GCTAGTGCAA GCTGTCCAGC ATCTTTTTCA GAGTGCTATC GTTTTTGACC ATGAAGTACG CCGTCTTTGC CTCTCTTTTG GCCAGCGCTG CTGCCTTTGC CCCGGCTGCG AAGGTGTGTT GGACCTACGG GTTTCCCATG TAGTTCGTTC AGTAACAGAG GCAGGAATCG TCTTTTTGTA GACCGCAATT TTGTAGACCG CAATCATCGC GCAAGAACCC CGAACGCCCT ACTTCTTGTT TTAAAAACAA TACTCCTCCT TTCATGTAGA AATTCAACTC TCCGTGGCTA ACACATTTTT GGTTTGTTTC GTATTTCTCT TAGCCTGCGG CTTCGACTTC CGCGTTGAAT GCGGAAATGT CCAAATCCAT GCCTTTCCTA ACGGCTCCCA AGAACACTGG GGGCTACGTC GGGGATGTTG GCTTTGATCC GCTTGGATTC TCCGACAACT TCGACATGAA GTGGCTGCGC GAGAGCGAAA TCAAACACGG ACGGGCCGCT ATGTTGGCCA CTGTCGGATT TGTCATGCAA CAGTTCTGGA CTCTTCCCGG TATGGTTCAT GTGGATGACT CGAACCTGGC GCCAGGAGCA GCTGGTATCT CGCCCATGCT GCAAATTGTC TTCGGAATGG GCGCCCTGGA ATGGTGGACT AATAAGGGAA AAGTCACGAT GGAAGATATG TTCAGTGACA GCAGCCGCGA ACCTGGTAAT CTTGGATTCG ACCCCATGGG CATGTCCAAG AACAAGTCCA AGGAAGAAGC CGAAGCCATG CAACTCAAGG AAATCAAAAA TGGACGATTG GCCATGTTGG CTATTGGAGG TATGATCCAT CACAACTGGG TGACCGGTGA AGCCCTGTTC TGATGCGTCT GCGAGCAGCG GGACGCTGGC AAAGAACCCA TGAACGGATG CAGCGACTTT CACTACTATA GGTTTATCTA ACTGCTAGAG AGACCGGCTT TCGATTTCGT C
|
Protein sequence | MKYAVFASLL ASAAAFAPAA KPAASTSALN AEMSKSMPFL TAPKNTGGYV GDVGFDPLGF SDNFDMKWLR ESEIKHGRAA MLATVGFVMQ QFWTLPGMVH VDDSNLAPGA AGISPMLQIV FGMGALEWWT NKGKVTMEDM FSDSSREPGN LGFDPMGMSK NKSKEEAEAM QLKEIKNGRL AMLAIGGMIH HNWVTGEALF
|
| |