Gene PHATRDRAFT_43522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43522 
SymbolLhcr7 
ID7197205 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp708701 
End bp709863 
Gene Length1163 bp 
Protein Length270 aa 
Translation table 
GC content49% 
IMG OID 
Productprotein fucoxanthin chlorophyll a/c protein 
Protein accessionXP_002177668 
Protein GI219111833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTCAG TTCTGTGCTG TATGACCTTG ACGGCATCCG CAACGGCATT TGCTCCCGCG 
CAGACGGCGT CCTCTTGGTC GGCTGCGTTG GCGGCAGCGA GCTGGCCTGA AGCGGATATA
GGAAGCGGAG TCTCGTCGAC GGTCGAATCG ACGTCACCTC TTCCGAAGGA AAGCAACCAA
TTGCAAACAT CTTCCCCACG GAAAGGATCC GTCATGTCAC AAAGTTTACC CTTTCTGGTA
TGCCCTTCGG CTCTAGCAGA CTGTAATGAG CTTGCCGGGA ACGTAGGGTT TGATCCGTTG
GGTTTGGCCA AGAACAAGGA ACAATTGTGG GAATTCCGAG AAGCGGAGAT TAAGCATGCT
CGTCTTGCTA TGTTGGTATG TCAACAGAGC GCAATTGTTG TATGCCTCAT TATATATGTA
TATTTTGTGT GGGTCCATGT CGCCTCATAT TCCGGCTGGC TTTTTTTTGT CAGGCCGCGG
CTGGGTATCC CTTATCCGAA TTGTATGATC GGCAGATTGC GGAATACTTT AACATGCAGG
CATTGGTCGA TGGTACCGAC AGGGCACCTT CATTTCTGAA TGGGGGTCTT GAACGCGTCT
CACCTGTGTG GTGGGGCTTC TGCCTCGGTA TGACGGCGGC GATTGATCTT TACGGAGTTG
CCAAGGCGCG ATCTGGAGAC CCTACATACT TTCCTGGTAA CCTGGGATTT GACCCCGTAG
GTTTGTACCC GAGTGACCGA AAAGGAAGGA TGCGCATGGA ATTGGCAGAA ATCAAGCATG
GCAGGTTGGC AATGCTTGCC ATTGGAGGCT TTATCGTACA AGAATACGTC ACTAAGGTGG
GTGTCGTGAA GGAGACGCCT TTCTTCTTTG TGCCGTTTTC GGAAACCATG GAGCAGATGG
GTTTTCTGTA AAAGTTCGAT TTTTGAAAAC CGATAAGAGC GGCTTGTGTC TGAGTGAAGG
AGAAGCAACA ATGTTTTTAC ATAGTTTCAT GGTGAACGCA GGTTTAAAAT AGAACTGACA
GTGAGCGGCC TGTGCTACAG AAGAAGGCGA GATCATTCTT GACCATAAGA CCAACATATT
TCCACCTGTC AAGTATTGAG GTAAATAGGA GACAGATTCA AACGACTGTT CTTTAGAGAG
ATAACTGTAA GAAGTTCACA TGT
 
Protein sequence
MRSVLCCMTL TASATAFAPA QTASSWSAAL AAASWPEADI GSGVSSTVES TSPLPKESNQ 
LQTSSPRKGS VMSQSLPFLV CPSALADCNE LAGNVGFDPL GLAKNKEQLW EFREAEIKHA
RLAMLAAAGY PLSELYDRQI AEYFNMQALV DGTDRAPSFL NGGLERVSPV WWGFCLGMTA
AIDLYGVAKA RSGDPTYFPG NLGFDPVGLY PSDRKGRMRM ELAEIKHGRL AMLAIGGFIV
QEYVTKVGVV KETPFFFVPF SETMEQMGFL