Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49114 |
Symbol | |
ID | 7195336 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 624480 |
End bp | 626496 |
Gene Length | 2017 bp |
Protein Length | 527 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183643 |
Protein GI | 219126813 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.024249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTTTCGTCC CTGCCCCAAA CACTCGAAAA AACTCCGTCC ATCTCTCAGT ATGGAACAAG CGTATATCAG TGAAGGCACT TCGATCGATA AGGTACGCAC ACTCGTCGTC TGGCGGTGGA TGGTCCGAGC AGAGAGCCTA GTGGACAAGT GCCCACGACA TTGTCGCGCG TAGGGTTGTC CACGGAATCC GTCGGATTTC TCTTGGTGCA CCTGATGTCA TGTAGAACAT TCCGGATTCC ACTATTTCAC TACTTATTTT CTTATAAATC TCTCACGATA CGGAATCTCG TTTTCGTTGG CAATCGGATA CGACTCACAC CCTTTGGATT TTTTTTTCTT TTGGTTCCTC TCCAGGGCGA AAACGCTCGT CTGAGTTCCT TCGTCGGTGC CATTGCGATT GCCGATCTCG TCAAGACGAC GTTGGGACCC AAAGGATTGG ACAAGATCCT CCAAAAGGTG GATCCGCACG ATCAGAGTAT TTCCGTCACC AACGACGGTG CCACCATTCT CCGCTCCGTA CACGTGGACA ACGCCGCCGC CAAGGTCCTC GTCGACATTG CCCGGGTCCA AGACGACGAA GTCGGCGACG GGACGACTTC CGTCGCGGTG CTCTGCGGAG AACTCTTGCG GGAAGCTGAA CAGCTTGTCA CGCAGCGAAT CCACCCCCAG ACGATCTGTG CCGGATGGCG TCTGGCCCGA CAAGTGGCAC GCCAAGCCCT CCTCGACGCC TCCCAACAAG CCACCGACGA GGACGTCTTC CGCGAACAAC TCTTGCAAAT CGCCACCACT ACTCTCAGTA GCAAATTGCT CACGCACGAA AAGGCGTACT TTGCCAATCT CGCCGTCGAC GCCGTCCTCC GACTCAAGGG CAGTCGCAAT CTGGAACACA TTCAGGTTTT GAAAAAAGCC GGTGGCGTCC TCCGGGACAG CTATCTCGAA GAAGGATTCC TGCTCAACAA GACCGTCGGG ACGGGACAAC CCAAGCGCGT GGAACACGCC AAAATTCTCG TCGCCAACAC ATCCATGGAT ACGGATAAAA TCAAAATTTA CGGATCCCGG GTCAAAGTGG ACAGCATCGA TAAGATTGCC TCCATCGAAC AAGCCGAAAA GGCCAAGATG AAGGACAAGG TGGACCGTAT TCTGACACAC AATTGCAACG TCTTTATTAA TCGACAATTG ATCTACAACT ATCCCGAATC TCTCTTTGCC GAACGCGGGG TCATGGCGAT TGAACACGCC GATTTTGAAG GCGTCGAACG ATTGGCCGCT GTTCTCGGCG GTGACGTGGT TAGCACGTTC GACAATCCCG ACAAGGTCAC GTTAGGAGAG TGCGCGTTGA TTGAAGAAGT ACGTACTCGG CAACGACAAT TTCGTGTCGG TGGTACGGTT GCGTGGGCAA TTCCATTCTC ACTATATGCT CACAACCTTG CTTGGTGTTC GTTCGTTGGT TCGTTGTAAT TGTTCTAGAT CTTTGTGGGT GAAGACAAGG TTTTGCGTTT CAGCGGTTGC AAATCCGGTG AAGCCTGTTC TATTGTCTTG CGCGGTGCGT CTACCCACGT CCTTGACGAA GCCGAACGTT CCTTACACGA TGCCTTGGCT ATCCTAACAT CCACCGTCAA GGAACCCCGA ACGGTGTACG GTGGGGGGTG TACCGAAGTG GCCATGGCTG CCGCAATCGA CAAGGCCGCG GAAGAAACGC CCGGCAAAAA AGCGCTCGCC ATGGCCGCCT TTGCCCGAGC CTTGCGTCAA TTGCCGGCCA TTGTCGCCGA CAATGGCGGC TACGATTCGG CCGAACTCGT CACGCAGCTG CGGGCGGCCC ACGCCGCCGG CAAGGCCTCC CACGGACTCG ATATGTACCA AGGAACCATT GGTGATATGG AAGCACTCGG TGTACGCGAA TCGTTCCAGT CGAAATTGCA AGTCTTGCTG AGTGCGTCCG AAGCGGCCGA AATGATTCTC CGGGTCGACG ATATCATCAA GGCGGCCCCG CGGCGGAGAG ACGAGTACGG GTATTAG
|
Protein sequence | MEQAYISEGT SIDKGENARL SSFVGAIAIA DLVKTTLGPK GLDKILQKVD PHDQSISVTN DGATILRSVH VDNAAAKVLV DIARVQDDEV GDGTTSVAVL CGELLREAEQ LVTQRIHPQT ICAGWRLARQ VARQALLDAS QQATDEDVFR EQLLQIATTT LSSKLLTHEK AYFANLAVDA VLRLKGSRNL EHIQVLKKAG GVLRDSYLEE GFLLNKTVGT GQPKRVEHAK ILVANTSMDT DKIKIYGSRV KVDSIDKIAS IEQAEKAKMK DKVDRILTHN CNVFINRQLI YNYPESLFAE RGVMAIEHAD FEGVERLAAV LGGDVVSTFD NPDKVTLGEC ALIEEIFVGE DKVLRFSGCK SGEACSIVLR GASTHVLDEA ERSLHDALAI LTSTVKEPRT VYGGGCTEVA MAAAIDKAAE ETPGKKALAM AAFARALRQL PAIVADNGGY DSAELVTQLR AAHAAGKASH GLDMYQGTIG DMEALGVRES FQSKLQVLLS ASEAAEMILR VDDIIKAAPR RRDEYGY
|
| |