Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_1971 |
Symbol | |
ID | 7198358 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 282391 |
End bp | 284055 |
Gene Length | 1665 bp |
Protein Length | 492 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184517 |
Protein GI | 219128642 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACTCGAGT CGGAACAGCA ACAAGCTAGC GTTCAGGACG AGTGGATGGA TCTAGCGGAG CGCATTGACA GCATCCAGGG GAACGATGTA TGGCGGTCCG ACCCGAGCTG CCCCTTGTAC GAACAGGAAC GTATTTCGGC GCGGATTGAC GAATTGGTGC ACCTTATGCG ACGCCGGGAT ATCTTTGAAC TCATGTTTGT GCTGCGTGCT TCGATTGGAC GCAACAAGTT TGGGCTGTTG CACGAAGGGC TGTTCAGCAA GGCGTTAGCG GGTACCAAAG TACTCGTCGA GACGTATCAC AACGTGGTCT GTGCTGCCTT GGACTTTTGT TGCGACGCTC CCGTGTCTCC TGACGAGGAT CCTATCCCTA CGGATGCCCG TCTCGCATTT TTCAACGAAA CTCGACACGC TTACGGACGT ACAGCCTTGC TCCTATCCGG TGGTGCGGCC TTGGGCTTTT ACCATACTGG TGTCGTCAAG ACGCTTATGG AAAATCGCCT CATGCCGCGT GTAATTGGTG GCAGCTCGGC TGGTTCGCTC GTGTGTGCCA TGATAGCTAC ACGGACAGAC GAAGAATGCG TGCACGACAT GTTCAACGCC CAAGGTACCG ACGCTCCGGG ACATTCCGGC CAACTCCCAC TCAACTTTTT CCGCCCCCTC CAAACGGGGA ATATCAACGC AGCCAAGGTA GACGCAACGC CGAACAAACA ACTCGGGGGA ATTCGTGAAG TGTACTACAA TACTGCTGGA TTCTTTCACG ATGCCAAGCG GACCTTGCAA GGGTTGGTTC CAATTCCTCT CCGACACTTT TCCGCCGTGT TGTACGATAT CGTCACCGGC AACCGTCGGC CTCAAGACAT GCTCATGAAT GATACAGAAC ACTTCCGAGC TTGCGTACGG GCCTGTGTCG GTAACTTCAC ATTTCAAGAA GCTTTCGACC GGACTGGGCG TATTCTGAAC ATCGTGGTGA CGCCCAAGAA CAATTCCGAT CCGCCCCGCT TACTCAACTA CTTGACGGCA CCGCACGTTA TGGTATGGTC AGCTGCCGTC GCGAGCTCCT CCCTACCCGG AGTTTTTGAA GCTAATCGAC TAGTTGTCAA GGAAGCAGAC GGTTGGGAAC GGTACGAGTC GGGCGGCGCG CCACAGCACT TTTCGGATGG ATCGATGGAA CAGGATTTGC CCATGCAGCA GCTATCGGAG ATGTTCAACG TCAACCACTT TTTGATCTCG CAGGCCAACC CACACGCCGT CATGTTTGCC AATTATCAAC AAAAGAATTC GGTGTGGAGT AATCCTGTTA CGGGCTTTGT GGATTCTATT CTGACCTTTT TACGCGATCA AGTGCGCACT TGGTTGTTGC ATCTCGTGGC GTGCGTTGGC GCTCGTAGTA TTACACCTAT GTTCCAGACT CAACGTGGAA TTGGTACAAC TTTCCTGACG CAAGAATACG AAGGGCGGTC TTGCGACATT TCACTCATCC CATGGTTGGG TCATCGGGGA CTCTTCAGTG CCTTATTGCA CATTATCTAC AACCCAAGGG AAGCCGAGTT TCGCGAATGG ATCCAAGCAG CTGAACGAGA AACCTGGCGA CACATTCCGG CCATCAAATC GCACATCGCC GAAGAAGTTA CTCTGGATCG TTGTGTACAA AGGCTACGAA AAAGA
|
Protein sequence | QLESEQQQAS VQDEWMDLAE RIDSIQGNDV WRSDPSCPLY EQERISARID ELVHLMRRRD IFELMFVLRA SIGRNKFGLL HEGLFSKALA GTKVLVETYH NVVCAALDFC CDAPVSPDED PIPTDARLAF FNETRHAYGR TALLLSGGAA LGFYHTGVVK TLMENRLMPR VIGGSSAGSL VCAMIATRTD EECRTLQGLV PIPLRHFSAV LYDIVTGNRR PQDMLMNDTE HFRACVRACV GNFTFQEAFD RTGRILNIVV TPKNNSDPPR LLNYLTAPHV MVWSAAVASS SLPGVFEANR LVVKEADGWE RYESGGAPQH FSDGSMEQDL PMQQLSEMFN VNHFLISQAN PHAVMFANYQ QKNSVWSNPV TGFVDSILTF LRDQVRTWLL HLVACVGARS ITPMFQTQRG IGTTFLTQEY EGRSCDISLI PWLGHRGLFS ALLHIIYNPR EAEFREWIQA AERETWRHIP AIKSHIAEEV TLDRCVQRLR KR
|
| |