Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50240 |
Symbol | |
ID | 7199016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 75277 |
End bp | 76998 |
Gene Length | 1722 bp |
Protein Length | 295 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185119 |
Protein GI | 219129908 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAGACAACC ATAGTGAATC ACACCCATCG ACATCGTCCA AGCCTAAATA CAAGCACGCG AGTACAGAGA ATCGAACATA AGCATCAAGT CTCTTTCCGG TACGTTGTCA ACAGGCTAAA GTTACATTCA TCGTCTTCCC TCGCATCCCT CGTTCAGATT CATCCACTTC ATGGCAATAT TTTTTCGACA CGGCCATGCA AAACACAAAA CGATTGTTGT CGAGTACGGA TGCTTCTTCG TGACTTCGAT GGAACTCTCC TTGGTCGTGA AGTTCTCGAC ATGAGAACGT GCGTTAAATA TTGCATACAG TAGGGGTCGA GTTGTCTATT TTGGTTGGTA CCTTTTGTTT CGAAATTTTT AAAATCGCCC TTTCCATACA CCGGCTCGCT TGTTTATCGC CGGCCATATT TTTCCAAAAT ATTAGCCAGG AATTCCTGTA AAAGCCGTTC AAAAATCAGC ATCGAGACAT CCACAGAATC GCATCAGTCT GATAAAGTGG CTCTTCCTTC ATTAACTGTA GGACTGTCGG CCCTAGAGGT TGGTCAATAA GAGCTTTGCC CATGTGTTTA TTGGAAGTAC ACCAAGCATG AAAGGAGGCT AAGTGATAGT CTTCCTACAT ATTCATGCTA ATAGACATCC TATGCTCACA CCAATTTGTC TCCATCATCT TTACCCTGCA GAAGCATAAA TTAACCCCAC TAAAATGAAG GACGAAATGG GGAAAAGCAG TGAGAAGATG CCAGTCATTG TCAAGCGGCC CACTAAGAAG AAGCCCAAAG ACAAGCCGAA GAGGCCTTTG AGTGCTTATA ATTTCTTCTT CAAGGAAGAG CGCGAAAAGA TTCTTCGTGT AGTCCTCGCC GAAGATCCGT CCGAGGTAGA GAACGATCCC GAATCCGAGG ATCATATCGA CGACGAGATG CTTGGAAGAC TCCGGAAGGA AGGAGGAAAG GTCAGCTTCG AGGAAATGGG TAAACTCATC GGACAGAGGT GGAAGAAAAT CGACCCCGAT CGTCTCACGA GGTATTCAGA GCTTGCTGCT GAGGACACCG AGCGCTACAA GAAGGAAATG CAGACCTACA ACGGCCGTCA AGAGGCTAAA ATGCGTAGCG AAGCGCTGAA GCCACCAGCA TCGTTTCCAG GAATGGCAAT GGGTATGGAC AAGGGTGGAT CAGCAGCCAA TAATCTTGGA GCTTATTCAG ATGCTATGAG TGGAATGAGT TCTGCGTTCG CAAATGCCGG AGGTATGCAA GGATACCCTT ATGGTGCTAT GGATTTTGGA GCCGGTTATG GTGGAATGGG CATGGCCGGT ATGTACAACC CGTACGGTGG CTATCCAGGA ATGCAAGGGG GCGGTATGGG CGGAGGGAAT CCAGATCCCA TGGCGCATCT TCAAGGTGGT GGTAATGCAA GCATGTATGG TATGATGGGC GGCGGTGGAT TTCAAGGTAG CATGATGGGA TACGGAGGTG GTCAAGTAGG AGCCGGAGCC CCTGGTTCTG ATCCACAAGG AGGATATCCT CCTCAAATGG ACCCATCGCA AGCGAATATG TACGGCTATG GAGCAGGTCA GGGTTGGGGA GGGCAGCAAT AAAAATATAT TGTCTTGGAA TTTGGCAAGT AGCAGCTCCG GTTGACCGGG CGGTATCGTG TATATCTTGT GGTGCCACCC TTCTTTTTCA ATTTCTGTCT CGTCATAATC ACTGAAATCT TCCTTGGCAT CT
|
Protein sequence | MKDEMGKSSE KMPVIVKRPT KKKPKDKPKR PLSAYNFFFK EEREKILRVV LAEDPSEVEN DPESEDHIDD EMLGRLRKEG GKVSFEEMGK LIGQRWKKID PDRLTRYSEL AAEDTERYKK EMQTYNGRQE AKMRSEALKP PASFPGMAMG MDKGGSAANN LGAYSDAMSG MSSAFANAGG MQGYPYGAMD FGAGYGGMGM AGMYNPYGGY PGMQGGGMGG GNPDPMAHLQ GGGNASMYGM MGGGGFQGSM MGYGGGQVGA GAPGSDPQGG YPPQMDPSQA NMYGYGAGQG WGGQQ
|
| |