Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44546 |
Symbol | |
ID | 7198073 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 861558 |
End bp | 862675 |
Gene Length | 1118 bp |
Protein Length | 316 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178319 |
Protein GI | 219115047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000667438 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATAATCTGA ATCTTTCTTT CTTTGGAAAA CAACATATAG ACGACGACTC ACGACATGAA ATTGTTCTCC AAGTTTGCTC TCCCTTCGAC TGCTGGAAAT GGCAAATCCG GTCTCGGCTT TGGGTCCATG GGCATCACGT CTTTCTACGG TGATCCCATG CCCGAAGACA AGGCAATGGA ACTCCTCCAG ACTATTTACG ACAAGGGATG TCGTCACTTT GACACAGCGG AAGTCTACAC AGCCGAAGAA CTCACAATGA ACTGGTCCTT GGACGCTTCT TCAAAAAAAT TCCCCGTGAT TCGTTTACGG TCGCGACCAA ATTCTGGCCC AAGGATGGCG CTTACGATTA TGAGACCGTC AAGGCCTCTT TGACTGGCTC TCTGGATCGT CTTCAGCTTG AGTACGTGGA TCTCTATTAT GCGCATCGAG TCATGACCTT AGAAGGTGGT ATGGATTTTG CGCGCACCGC CAAGCGTCTT AAAGAAGAAG GACTTATCAA AGAGGTTGGT CTGAGCGAAG TGGGCGGCAA GTGGCTCAAA CAAATCAACA ACATCTATCC CATTGACGCT GTGCAACAGG AATGGAGCTT GCTTACCCGT AACTTGGAGG ATGAACTGGT TCCGGTTTGT AAGGAACTTG ATATTACCAT TGTTGCGTAT AGTCCTCTCG CCCGAAATCT ACTGGCGACC AAGCTCGAAG AGGCGCCCAA AGACTGGCGG GCTAAGCTTC CTCGTTACTC CAAGGAAAAT TTTGGAGCAA ACCGCAAGAT TGTGGAAAAG TTGGAAGAGC TTGCCGCAAA GTACAACGGA ACAACTGCTC AGCTTTCCCT AGCGTGGCTC TTCCACAAGG CCAACGAACT TGGTGTGGCT GTAGTTCCTA TTCCGGGATC TACAAAGCTG AGCCACGCAA TTAGCAATCT GGATTCAACC AAGATTGAAA TTTCTGACGA AGATACAGCC ACGTTGGAAG GATTGGCTGC TCAAGTGGCG GGGGCACGGG GTGGCGAAGA CTACACGGGA ATAGCAATCG AAGCGCAAGA CTAACCAGTC AAAGCTGCTT TCAATGAATA AATGTATGAA TAATGGACTC TAATCTTTAG GAAGCAATGA CCGGTTTC
|
Protein sequence | MKLFSKFALP STAGNGKSGL GFGSMGITSF YGDPMPEDKA MELLQTIYDK GCRHFDTAEV YTAEELTMNW SLDASSKKFP DGAYDYETVK ASLTGSLDRL QLEYVDLYYA HRVMTLEGGM DFARTAKRLK EEGLIKEVGL SEVGGKWLKQ INNIYPIDAV QQEWSLLTRN LEDELVPVCK ELDITIVAYS PLARNLLATK LEEAPKDWRA KLPRYSKENF GANRKIVEKL EELAAKYNGT TAQLSLAWLF HKANELGVAV VPIPGSTKLS HAISNLDSTK IEISDEDTAT LEGLAAQVAG ARGGEDYTGI AIEAQD
|
| |