Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38097 |
Symbol | |
ID | 7202952 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 66200 |
End bp | 67650 |
Gene Length | 1451 bp |
Protein Length | 429 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182314 |
Protein GI | 219124026 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCC CCGGAAGGGA CGAGCGCGGA GGCGACGCCG CGCCGGAGAG GGACTACGGC GAGTTGGGCT ACGCGTCCGC GGTTCGCTTA CCCGTCCGCC ATACGGATGC GTTTGGACGA AATCCGTACA GTAATCCCCG GGAACATTCA GTCGTAGCCA CTCGTCCGGA TCCTTCCTCG ACGACCAAAA AGAAAAAGGC ATGGAAGAAG CCGCCGGTGA GTATCAGGCA AAACTTTGTT AGCTGGTAGT AGTGAGTCTG TTGATCCTCA GATGGTCCTC CGTTACTATT AATCTAGTGA TCGCAGATTG GCCTTTGGTC GGTGCCAAAC CCCATAATCC CCTCTCACGC TGCCTTTGCT CCTTTCTTGG GTGTCAGGGA ATGCCCAAAC GGCCGTTGTC GGCCTACAAT CTCTTTTTCC GGCGCGAGCG ACAGGAAATA CTGGGGGAAG ACCTTTCCAA GGAGTTCGAG ATTACCGACC AAAGTAAGCG AAAGCATCGT AAAACGCACG GAAAAATTGG CTTTACCGAT ATGGCGCGAC AAATCAGTCA AAAGTGGAAA GATTTGGAGG AAGAATTGCG GAGACCCTTT ATCGAACAAG CCAAGAAGGA GAAGGAAAAA TACATGGTGG CAAAGGATGC TTGGGTCCAG GAGCAGAAGG TCGCGGTCAA GGCCCGTACC GAAGCTTTGG CAAAGGAAGA AGCGGCGGCG GCCGCCGCCT CCCGTTGGAC TACCGTAAAT CCCCCGCCCA TGTTGGACAG GAACGCTTGG GATGTATCCT CGCACACTAC GATCCCCCCG ATAGATCCCC ATCACGGTCG TTTTACAGAG ACCGGAGCGC GCCAACTCCA TTCCATGAGA TCCGCGGCAA TTCCCATGAA TGCTTCCTTT GAGGCCATGG GAGGGTCTCG TGGATTTCCG GAAGAAATGC AACGACGCCC CCCGCCTTCA ATGAATCCAG CGGAAGATGC CTACGGCATG CTCAGCGATC AGGAACGAAT ACGTGAAATG AGGATGCATC TGGAACGAGC CGCAGCTCTG CAGGAAGAAA TACGTAGAAA CACAGGGGCC AGCAATGCAA TGATGGATGA ACTGCGGTTA CCAATCCCAC CTCCGCGAGA TGGGCCGGGT CCTCCTGGTG GTTTGCCAAC GTATGCGCAG GAGTCGCGCC GTTTCAGTTC TCGGGGAGGA TCTTTTGAAG GATTGGGAGG GAACGATTGG TACGAATACG AGCAGCAACA ACGACAGCAA CAGCAACGTC GTGCGCAAGA GCGAGCGGAA ATGATGGCTC TACAGCGCCA ACAGCAGCAA CCACGACACC GACCGGACTT CATGGCTCTG GAACAACGTC GGAGAGCCCT GGAACAATCG ATGGAAATTG AGCGGAGATT CCAAATGGAA GAAGAAATGC GCCGTCGCGG ACAACGGAGG GGGCCGGGAG GGAACATGTA A
|
Protein sequence | MNIPGRDERG GDAAPERDYG ELGYASAVRL PVRHTDAFGR NPYSNPREHS VVATRPDPSS TTKKKKAWKK PPGMPKRPLS AYNLFFRRER QEILGEDLSK EFEITDQSKR KHRKTHGKIG FTDMARQISQ KWKDLEEELR RPFIEQAKKE KEKYMVAKDA WVQEQKVAVK ARTEALAKEE AAAAAASRWT TVNPPPMLDR NAWDVSSHTT IPPIDPHHGR FTETGARQLH SMRSAAIPMN ASFEAMGGSR GFPEEMQRRP PPSMNPAEDA YGMLSDQERI REMRMHLERA AALQEEIRRN TGASNAMMDE LRLPIPPPRD GPGPPGGLPT YAQESRRFSS RGGSFEGLGG NDWYEYEQQQ RQQQQRRAQE RAEMMALQRQ QQQPRHRPDF MALEQRRRAL EQSMEIERRF QMEEEMRRRG QRRGPGGNM
|
| |