Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47054 |
Symbol | |
ID | 7202146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 285651 |
End bp | 287084 |
Gene Length | 1434 bp |
Protein Length | 450 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181174 |
Protein GI | 219121648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.436568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCCTAAAG CAATCCCCAT AGATTGTGGT GTTTTTTCTT TCGTTTGATA AATAGCAAGG ATGAGTTCGA ATGACGGAGG CGGGGAGCAA CCTCTGAACA AATCTCCCGG AAAGGGCGGG AAGAGCCTGC AACTCATGCA ACAGCGCTTG TTGGCCCAGC AACAGCAACA ACCGGCACAA AGCCAACCTA CCGCCGCTTC GTCACCACCG TCAGAAATTC CGGCATCTGC TAATTCACTG GAAACTCCGG CAGATCCACC GTTGCAAGCC GGCCCCCCGG GAGGACGTCC GAAAGGAAAA GATTTCGCGG CAATGAGTGC CCGAGCTCCG AAGCCACCCG TTCTCGCTTC GGCACCAACG CCCGTCGCGC CCGCGGCTCA TGCGTCGTCC CATGCTGCCG TCGTTTCCAA CAAGCCCAAA GGCAAGGACT TTACTTCCAT GGCGTCGCGT ATGGGAGCTC CGCAGTCTGC GCCTGTGGCT GCGGTGCCGG CACCATCCCA GTCGTCGGCG ATTGCGCAGT TGCAAGCCGC GCAGGCGGCA CGGGCGGCGC AGATGCAAGC TGCGGCCCGT GCCGCTGCCG GACTCCCGCC TCTGAAGACC CCCACGGCTA CCACCGTGAC TACTACGCCA CCCCATATTC CGCCTCCCAA AATATCGGCA ACGCAACGGA CTTCCAGTAC CGGAGCCACT TCCGTACCGT ACGGATCGGC CACGAATGTC GGTAGCGGCG GCGCTTCTGG GACCGCCCAA TACCAGCCGC CCCCCACATC ATGGAATGCG GCGCATGCCA ATTCTCAACC ACCACAGCAA CAACAACAGA GCATGCGTCC CTCACAGCAA CCCTTGGCGC AAGTAAAACA CCGTAGCGCT CCGCTGACCG CCCGCCGCGG CTCGTCCGGA CTTTCACAGC CTCCGCCGAA AGTCAGCAAA GCTCGCGACC CCAAGCCGTC ACCAGTACCG ACAGCATCCG AAGCTCGGGT TTCGTTGCAG GGTCCACATC ACGCGGCGGC CGCTTATATG GCACCGTTAG TGGGGACACG GATCCAATCC CTGCTGCAGT CTTTGGATCC GCTCTACGTC ATGGACCCCG CCGCCGAGGA ACAAGTCCTC CAACTCGCGG ACGACTTTTT GGACAAGGTT GTGAAACAAA GTTTGCGGAT TGCCGCACAC CGGGGGAGCA AAACGTTGGA CGTCCAGGAT ATTCAACTGG TGCTGGGAAA ACAGTGGAAT ATTGTCATTC CGGGGTTGGG GCCACCCATG CCCAAGAAGC CCAAAACTAG TACCACCACG AGACCGAGTA GCACCGTGGC CGGCACCAAA CGGAAAAGTA GCGGTACAGG AGCGGGCAAC GTGACGAAAT TGTCCAAGTC CAGCAGTAGC GCAATCGCTA CGACCGCACG TAGTGCTTCT TGAACTAATG TATTTTCTAC TACT
|
Protein sequence | MSSNDGGGEQ PLNKSPGKGG KSLQLMQQRL LAQQQQQPAQ SQPTAASSPP SEIPASANSL ETPADPPLQA GPPGGRPKGK DFAAMSARAP KPPVLASAPT PVAPAAHASS HAAVVSNKPK GKDFTSMASR MGAPQSAPVA AVPAPSQSSA IAQLQAAQAA RAAQMQAAAR AAAGLPPLKT PTATTVTTTP PHIPPPKISA TQRTSSTGAT SVPYGSATNV GSGGASGTAQ YQPPPTSWNA AHANSQPPQQ QQQSMRPSQQ PLAQVKHRSA PLTARRGSSG LSQPPPKVSK ARDPKPSPVP TASEARVSLQ GPHHAAAAYM APLVGTRIQS LLQSLDPLYV MDPAAEEQVL QLADDFLDKV VKQSLRIAAH RGSKTLDVQD IQLVLGKQWN IVIPGLGPPM PKKPKTSTTT RPSSTVAGTK RKSSGTGAGN VTKLSKSSSS AIATTARSAS
|
| |