Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36628 |
Symbol | |
ID | 7201883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 921722 |
End bp | 923131 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180924 |
Protein GI | 219120369 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0161158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACCC ATACGACGCG ATTATTGCAC CAATTTCTCC TCATACTAAC GCTTGGGACC GTGTCCTTAA TTTTTGAAAT TTACTGGACT ACACATGCGA TCATTGAAGG CAATGCGAGC ACACCGGAAA AAAACCTTTG GAAATACTTT TCTTCATTCG TGTCGACAAG AAATGCCCAA ATTTATATCA GCAGTATTGG CTTCGATAGC TTCGCTGGCC GATTCACGTC GCCGAAACCA TGGAATCACT CTCAGTTCTT TCCTGGAAGA AGGAAAATCG ATGCACGCGA CTCCGCCATT TACAGTTTTG CCAACATATC GAGTGCTTCG GCTGCTATTC CACTCGTGCT GACGGTGAGC TCTGAGAATG CTAGCAGCAT TACGACGACC AAAGCAAATA CCTCCGGCCG AAAGGAACAC CCCGACAATA CATTTTCAGC ATGCATCCTC GTCATGGACG ACAACCATCG ACTCGTGGAG TGGATTGCTT ACCACTACTA CGCCTTGAAT CTGCGGCACT TGGTAGTTAC GGTGGATCCT CATTCCAGAA CGCGTCCTAC GGCGGTACTA GATCGCTGGA GAGATCGCAT GTATATCGAA GAATGGAATG ATCGATCGTT TCTACCCTCT AATATTGGAC GAAGCGCGAA CGATACGGTC GAACAACGAC ATTTAAAGCA TAGGTTTCGT CAAGCACAAT TTTACAAGGG CTGTATCCGA AGACTAAGGG AATTCAACCG ATCTTGGACG ACTTTCATCG ACTCCGATGA GTACCTTACC ATCAATAGCC GCATGGTGGA CAACACGGCG CTCCGGATGC AACAACCAGG ACACGTCGCC GACTACTTTC ACGAGTTAAC ATGGCAAGCG CACAGCGGAC CAAACTACAC CTTCGCTGTG AATTTCGGTC AATCCTGCGT TTTGCTATCA CGCGCCATGT ACGGATCAGT AGAAAGTACA GACGAGGAGA TCCGTAGGGA CGTGCCAGAT TTTCTGGATC CAGCCCGCTT TGATACGCTG CGATGGCGGC ATCGCTCAAC CGAGGATGAC CACGTGTTAG CCAAAAGTCT GATTGACGTC TCTCAAGTAA AGCAACACCA CTTGGATGGC AAAGCCAATG CGCACAAAGC TCTTGTTGAA ATGTGTAAGT CAAACTCGTG GATAGCGTAT ACACTGCCCA TTGGCATTCA TCACTACCTT GGTAGTTGGG AGCAGTATAG CTACCGAGAC GACGCTCGCG ATGGTGGCGA TGCACACAGC TACGAGACGT GGCAAAGGAA AGGCTCTGCG CTGGTCAGTA CAGATGACGA GATCCGCCCT TGGATTCGCG GGTTTGTAAA GATGGTAGGC AACGGTACGG CACTGTCTTT GTTGGAAGGA GCTGGGCTTC CCACAAACCG GACTGCGTAA
|
Protein sequence | MATHTTRLLH QFLLILTLGT VSLIFEIYWT THAIIEGNAS TPEKNLWKYF SSFVSTRNAQ IYISSIGFDS FAGRFTSPKP WNHSQFFPGR RKIDARDSAI YSFANISSAS AAIPLVLTVS SENASSITTT KANTSGRKEH PDNTFSACIL VMDDNHRLVE WIAYHYYALN LRHLVVTVDP HSRTRPTAVL DRWRDRMYIE EWNDRSFLPS NIGRSANDTV EQRHLKHRFR QAQFYKGCIR RLREFNRSWT TFIDSDEYLT INSRMVDNTA LRMQQPGHVA DYFHELTWQA HSGPNYTFAV NFGQSCVLLS RAMYGSVEST DEEIRRDVPD FLDPARFDTL RWRHRSTEDD HVLAKSLIDV SQVKQHHLDG KANAHKALVE MCKSNSWIAY TLPIGIHHYL GSWEQYSYRD DARDGGDAHS YETWQRKGSA LVSTDDEIRP WIRGFVKMVG NGTALSLLEG AGLPTNRTA
|
| |