Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27923 |
Symbol | |
ID | 7201668 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 301039 |
End bp | 303513 |
Gene Length | 2475 bp |
Protein Length | 623 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180797 |
Protein GI | 219120102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.143267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCCTAGCG GTGAGTGCGA ACTTGACCAT GAGAATGATT GCATGCTATT TTTACATCCC AGATAGAATA AGAACGATCC CAGTTCCGGA TTGTAAAAGT AGATGTTCGG GAAAGCGCCT GCGAGGCCGA ACTTCCGGAA ATAAATGATG TCGGACCGGA TCTTTGGGCA GGGTTTCGAA CGACATCGAT GCTGCGGACT GGTGTATCTT GTCCATAGCG ACTCACACAT TGGATTTCCG AGCTCCATAT TTTCCCACAC CTTGCCGGTA TCGCTGGTCA CAGGGATCGT GGCCTCTGTT GTTTACACCT TTTTGATGGC CTTCTTGCTC ATTCTCACCA CGCTATCCCT TTTCTTTCTC TATCTAGTAG CGTCGAAGTC TTTGCCTTTT TGCAATCATG AATACTCCTG AATTTGAAAA GGAATCCGAC TATGGATTCA TATACAAGGT CTCCGGACCT TTGGTAATTG CCGATGGAAT GAGCGGAGCT GCCATGTACG AACTCGTTCG TGTGGGTCAC TCCAAGCTTG TCGGAGAGAT CATTAAGCTT GAAGGAGATA CCGCCTCCAT TCAGGTCTAC GAAGACACTT CTGGTTTGAC GGTGGGAGAT CCCGTCCTGC GTAAGAAAGC ACCTCTTTCT GTGGAACTCG GACCCGGTAT TATGGAGACC ATTTTCGACG GTATCCAGCG TCCTTTGGAA ACGATTTTCG TCGAAAGCAA GGATATCTTT GTCAGCAAGG TACGCCGAAC GCCGGATCCA GTCGTTTTTG CTAATTTTAC AAGTAAAATT CTCACATCTT GTATTCATAT GTACTGGGGA TCAATAGGGT GTGGATGTTC CTTGTCTGTC TCGCAAAATT CAGTGGGCCT TCACCCCAGG TGACTTCAAA GAGGGCCAGC CCATTTCTGG AGGTGACGTT ATTGGGATTG TGTACGAGAA CGAACTGATC GATTCGCACA AGATTATCTG TCCTCCCAAC GTATATGGTA CAGTGAGCAA AATCAATTCC ACTGGAACGG ACGGAAAGGA AACTTTTAAC GTTGATGATG TTGTCATGGA GGTATACAAC GAAGCCCAAA ACAAGACCCA CAAGTTGACC TTGAGCCATT TCTGGCCCGT GCGTCGTCCT CGTCCCATTG TAGAGAAGCT CCCTGGTAAC GTCCCTCTGA TTACTGGTCT TCGCGTTATT GATGGTCTTT TCCCCTCTGT GCTTGGAGGA ACTTGCGCCG TCCCTGGTGC CTTTGGATGC GGAAAGACAG TCATTAGTCA GTCTCTCTCG AAATTCTCCA ACTCGGATGC TATTGTTTAC GTTGGTTGTG GTGAACGTGG AAACGAGATG GCGGAAGTGT TGTGCGATTT CCCTGAACTT ACCTTGACTC GGAAGGACGA AGACGGTTCG GAAAGAGAAG TTGGAATCAT GAAGCGTACC ACTTTGGTTG CCAACACCTC CAACATGCCC GTCGCTGCCC GCGAGGCTTC TATTTATACC GGTATTACTC TTGCCGAATA TTTCCGTGAT CAAGGTATGA ATGTTTCCAT GATGGCTGAC TCTACATCTC GCTGGGCCGA GGCACTTCGT GAGATTTCCG GTCGTCTGGG AGAGATGCCT GCCGATTCCG GATACCCCGC TTATTTGGGA GCTCGTCTTG CAGCTTTCTA CGAACGTGCT GGACGTGTAT CCTGTCTGGG ATCTCCCACC CGCGAGGGTA CCGTTACCGT TGTCGGAGCT GTGTCTCCGC CCGGTGGAGA TTTCTCCGAT CCTGTCACGG CGGCGACACT GTCGATCGTG CAAGTCTTTT GGGGTCTCGA TAAGAAGCTA GCCCAGCGCA AGCATTTCCC CAGTCTCAAC TGGCTTATTT CGTACACAAA GTACATGCAA GTCCTCGAGC CCTACTTTAA CAATATGGAC GATCAATATT CGTACCTGCG AAACCAGGCG CGAAATATTC TCCAGCAGGA AGACAACTTG TCTGAAATTG TGCAGCTTGT CGGAAAGGAA TCCTTGTCAG AAGATCAAAA GGTTGTCATG GAGGTCGCTA AGCTCATTCG TGAGGACTTT CTGGCCCAAA ATGCTTTCAC AGATTACGAT TTCACTTGTC CTCTTGCCAA GACTGTTGGC ATGTTGAAGG TGATCATCAC CTTTTACGAT CTCTGTCAGA AGGCAATTGC TGATTCTCCA TCGGATGCCA AGCTCACGTA CGCGCACATC AAAACTGCTT TGGCCCCTGT CATCCAAAAG GTTGTAGACA CCAAGTACGT CGACCCCAAG GCTCATGCGG AAGACATTAA CAAAAGCTAC AGCGTCGTTT TGGACGAGAT GAAGCGGGAA TTCCAAACTT TGACGGACGG AATTTAATCG ACACGTCCTA CAATCCATGG AAAGAAAGCG CGCGCGCACA CACGCAACAG TGTTAGTCCT GCAGTATAAC TTGAAGAAAA AAGCTTAATG AGTAACTACA GTAGTGTTAC TGGCT
|
Protein sequence | MNTPEFEKES DYGFIYKVSG PLVIADGMSG AAMYELVRVG HSKLVGEIIK LEGDTASIQV YEDTSGLTVG DPVLRKKAPL SVELGPGIME TIFDGIQRPL ETIFVESKDI FVSKGVDVPC LSRKIQWAFT PGDFKEGQPI SGGDVIGIVY ENELIDSHKI ICPPNVYGTV SKINSTGTDG KETFNVDDVV MEVYNEAQNK THKLTLSHFW PVRRPRPIVE KLPGNVPLIT GLRVIDGLFP SVLGGTCAVP GAFGCGKTVI SQSLSKFSNS DAIVYVGCGE RGNEMAEVLC DFPELTLTRK DEDGSEREVG IMKRTTLVAN TSNMPVAARE ASIYTGITLA EYFRDQGMNV SMMADSTSRW AEALREISGR LGEMPADSGY PAYLGARLAA FYERAGRVSC LGSPTREGTV TVVGAVSPPG GDFSDPVTAA TLSIVQVFWG LDKKLAQRKH FPSLNWLISY TKYMQVLEPY FNNMDDQYSY LRNQARNILQ QEDNLSEIVQ LVGKESLSED QKVVMEVAKL IREDFLAQNA FTDYDFTCPL AKTVGMLKVI ITFYDLCQKA IADSPSDAKL TYAHIKTALA PVIQKVVDTK YVDPKAHAED INKSYSVVLD EMKREFQTLT DGI
|
| |