Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38794 |
Symbol | |
ID | 7203569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 223716 |
End bp | 225341 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182796 |
Protein GI | 219125038 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAG CGGACGATAC AATGATCCCA AATCGATCGT CGACCCACTT TTACGATGAA GACGAAGACG ACGATGATGA GGAGGATGAC TTGGAGGAGC TTCAAGTTCT GCAGCCCTCC GCGAGACAGG AACTCAGACG AGCCGATCGC TTATCGGTGC GTTTGCTGGC AATTCCGGAC GAAGATGAGG ACGAGCACGA TCGCGTTTTG CGAAAGTCAC TGCGCCTACT GGACAACGAT CTAAACGGCT CATCTGGCTT GTTCGACGAC GAAAGCAATG GCGCTGTGAT ACGTCAAAAT TCGGGCGACA TCAATTTAGG AGGAGGACTG GTCCGTCGGA GCTCACGGGC TTCTCTACGC TTGTCCGCAC GCCCCGGTGA AGACGGCAAA ACTGCAGGGC AACGCGTGTG CACGATGGTT GGTGTTGCCG TAGCAGCGGT TGTTTTGCTT TTAGGAATCG CCGGATTCAT TGGTGTTACG GTCGTTGGCC CACCCAATCA ACCAGTCGGA CCGTACCAGT TGGTAGAACG ACAGGAAGGA AACGATTTCT TCCAGTTCTA TGACTTTTAC GAAGGCCGAG ACTCGGCCGG ATCTAACGGG TTTTTGAATT ACGTATCGTA CGATAAGGCA ACCTTGCGGG AAATCGTCAA TGTCACCTAC GAAGATGACG TTCTGGATAT ATACGCACAG CAACGCAGCA CACCGGAAGT CGGTTCGAAT GAAGCGCAGA CCAAACAAGA ACCATTTATT TACATGGGAT CGGCTCCAAC GCCAGCTGGT CCGCGAGATT CTATTCGCTT GGAAGGTAAT CGCCGCTTCA ATAGGGGCTT GTTCATCATT GATATTCGCC ACATGCCCGT GGGATGCGGA GTCTGGCCCG CCTTTTGGCT CACGGACGAG GCCAATTGGC CAGTCAACGG AGAAATCGAT ATTGTAGAAG GCGTAAATTA CCAGTCCGTG GCGAAGACAG CCTTACATAC TACAAAAACA TGCATTATGG ACGACATTCC ACTTGGTACG ATGACAGGAG GATGGGATTC AGCCCAAGGT ATCCCAAATG CCAAAACCGG TATCCCAGAT ATGACAATGC GAGAAGCACG CAATTGCTTC GTGTACGATC CCCATCAGTG GCTGAATCAA GGGTGTGTTG CAGTGGATAC GGAAGGAGGT TCGTTAGGAG TTCCGCTTAA TGCTAAAGGA GGCGGTGTCT TTGCGTTGGA ATGGGACCCC ATCAACCGAC ACATTCGTAC CTGGGTATTC TCTCCGCATT TAAATGTACC TGATAATCTC GTCGATTCTA TTCGAACGGC AAGTTTACCC GACTCAGAAC GCATCGTGCC CGATCCAGAT GTTTGGCCGC TTCCGTACGG CTTTTTTGCA ATTGGTGAAG GTACCAACTG CCCGGCATAC CATTTTCGGC ATATGCGACT TGTATTTAAT ACGGCGTTTT GCGGCAGTGT GGCAGGAAAC CGGTTCCACA TTGATTGCAA AAAGCAAGTC GCGGCCAACT TTAGTACCTG CACTGATTGG ATCAAAAGCG AGCCAGAAGA ATTGCAGGAA GCTTATTGGA AAATTCGCGG GGTGTATGTT TACGAACGTG CGTGGGAGCG AACATGGAGT GTTTAG
|
Protein sequence | MKQADDTMIP NRSSTHFYDE DEDDDDEEDD LEELQVLQPS ARQELRRADR LSVRLLAIPD EDEDEHDRVL RKSLRLLDND LNGSSGLFDD ESNGAVIRQN SGDINLGGGL VRRSSRASLR LSARPGEDGK TAGQRVCTMV GVAVAAVVLL LGIAGFIGVT VVGPPNQPVG PYQLVERQEG NDFFQFYDFY EGRDSAGSNG FLNYVSYDKA TLREIVNVTY EDDVLDIYAQ QRSTPEVGSN EAQTKQEPFI YMGSAPTPAG PRDSIRLEGN RRFNRGLFII DIRHMPVGCG VWPAFWLTDE ANWPVNGEID IVEGVNYQSV AKTALHTTKT CIMDDIPLGT MTGGWDSAQG IPNAKTGIPD MTMREARNCF VYDPHQWLNQ GCVAVDTEGG SLGVPLNAKG GGVFALEWDP INRHIRTWVF SPHLNVPDNL VDSIRTASLP DSERIVPDPD VWPLPYGFFA IGEGTNCPAY HFRHMRLVFN TAFCGSVAGN RFHIDCKKQV AANFSTCTDW IKSEPEELQE AYWKIRGVYV YERAWERTWS V
|
| |