Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39238 |
Symbol | |
ID | 7194690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 689551 |
End bp | 691326 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183268 |
Protein GI | 219126026 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.975604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTCTGC TGTCTGCACT CGAACACAAC ACTGTTGTGA AAACTCTAGA TCTTTGGCTG CCTGGCAAAG ACACGGCTCC TTCTTCCAAT AAAGACAGCG ATATCAACTA TTCCATTCTG AGTCGGATAT TGGAGCAGAA CAAAACCCTT CAACGCCTCG TCGTCACGTC TTGGAGCAGC GATAGCGACT ATGCTGCATG GCAAGCTCTC GTAGAAGGTC TCCGCCGGAA TGTGTCAATA CAATCAGTAG AGCTTGGAGA TGCCGTCCAC GAAACAAGCA GTAGTACGGA CTACCAGGAC GTCGTGATGA TCCATTGCGA TGAGCAGGAG GATTCAAAGT TTGGAAATGA TGAGTTGGGG GATTTAGAAA TGATCATGGT CAGTGGAAAA CTAAAAGCAC TGACTCTGAA AGGACTCACA CTGAGTCCAA CTGCTGCAGA GCGCCTTGCC AATGGCATTT CGCAATCCTC TTCGCTTTTC ATGTTGGAAT GCATGCAAGT TTCAATGGAT CGGGGATCAT GGGGTACAGT CTTGGAATCT TGCAGAAATG GTACACTTTC TGAAATTCAC ATCGCTCAGT GTGACTTGGG GTTTTCAGCA ACGACAATGA TATCCTGTGC GCAGCCTTCT GACGAACACC CGCTTTCCGC CTTGTTACGG AAAAACTCAA GTCTTCGTGT CTTGCGGGTA ATTTCCAGCC AATTTGGAGT GGCTGAGCTA CGCGCTGTCA CCGATGCTGG CCAGCTCTTC TTGAAGCAAT TGGAGCTCCG TGACCACGAG TTGACGGGAT GCGGTGGGCT ACTAGCTCAC ATTGCTCAAC AAGCTAGTAG CCTGGAACAT TTGTCTCTGT CCGACACTGG CCTTGGAAAT GGCGATTTGA TAGAACTGTG TAGAGGGCTG TACCTACACG CTTCTCTGAG CTCGCTCAAT ATCCAAAACA ACAATTTCTG CTCAGTTCTA GCTGCCCAAA TGCTTGCGCA TACGATATCA TCTCTCACCA GAATCGAGTC TTTGGATATT TCTGAATGCC CTTGGGGTGA CGAAGGAGTT TCTGTCCTGA GGACAACGAT CGCCTCACAT ACTTCCTTGC GCAAACTCCA CTTGGCGGAC ATCAGAATGA CCGACTGCGG CTTTGTTGAC TTATGTAGCA GCTTGTTGAA CAATGCTTCA TTGGGCGTCC TGGACGTGAG TAGGAACAGA TTGGGAACAT CAGGTATGCA TGCCGTTGCA GACTTGCTAT CTCGATCTAA AACTACCATC TACGATTTGA ATTTGTCCAA TTGCCACCTC ACAGACCACG GAGTGGAAAC TTTGGGGCGC TCTTTGTCCA GCGCTAAATC GCTGGTCCGC TTGTCGCTGG CATCTAATTC GGCTGGTAAT GACGCTTGCC GTGCAATAGC AAGTTCGTTG CCCCATTCTT CTTTGGCCTG CTTGGAGCTC CAATTCAATC GCTTTGACGA AGAAGGCCTG GGCCATCTAG TGGAGGCACT ACAGCATAGT GTCGATTTGC ACGACTTATT TGTATGGAAT GCTTGTGTAT TTACTGGAGG AGTAGTGTCA AAACAACAGC AATCCAAACT ACAATTCCTG TCGGAAGCCA TGTTGCATTG GCTGAGACTC AACAGAGCTG GGCGAGGCGT GATCCGAAAC TACAGCAATC TTCTATGGGA GATCTTACCG ATCATTTTGC ACCGAGCTGG CCAGTTGTAC GGACCGGACG CATTGTTCCA CATGCTGCAA GCCCGTCCAG AGCTCGTGCT TCAAAGTAAG CGCTGA
|
Protein sequence | MCLLSALEHN TVVKTLDLWL PGKDTAPSSN KDSDINYSIL SRILEQNKTL QRLVVTSWSS DSDYAAWQAL VEGLRRNVSI QSVELGDAVH ETSSSTDYQD VVMIHCDEQE DSKFGNDELG DLEMIMVSGK LKALTLKGLT LSPTAAERLA NGISQSSSLF MLECMQVSMD RGSWGTVLES CRNGTLSEIH IAQCDLGFSA TTMISCAQPS DEHPLSALLR KNSSLRVLRV ISSQFGVAEL RAVTDAGQLF LKQLELRDHE LTGCGGLLAH IAQQASSLEH LSLSDTGLGN GDLIELCRGL YLHASLSSLN IQNNNFCSVL AAQMLAHTIS SLTRIESLDI SECPWGDEGV SVLRTTIASH TSLRKLHLAD IRMTDCGFVD LCSSLLNNAS LGVLDVSRNR LGTSGMHAVA DLLSRSKTTI YDLNLSNCHL TDHGVETLGR SLSSAKSLVR LSLASNSAGN DACRAIASSL PHSSLACLEL QFNRFDEEGL GHLVEALQHS VDLHDLFVWN ACVFTGGVVS KQQQSKLQFL SEAMLHWLRL NRAGRGVIRN YSNLLWEILP IILHRAGQLY GPDALFHMLQ ARPELVLQSK R
|
| |