Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54983 |
Symbol | |
ID | 7195287 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 363350 |
End bp | 366343 |
Gene Length | 2994 bp |
Protein Length | 891 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183599 |
Protein GI | 219126721 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAAAAAACG TTCTACTTTG TATTTTGGTC GGGTTTCGGA TCCTTCCAGC AACCATTCTC ATTCAAAGTC ACCACTTGTG CGAACGATGG TACCGAAACC TGAAGATCCC ACAGTCAAGG CAGAGAATAA TGCGGCGATG GATCAACTTA GTCTCCTCGA CAAAGATGAT ATATCGTCGG CTTCTCGCTC GTGCCGAGAA CTCTACGGTA GGTCGTCATC GCAGTGCCGG ACTGGACTAG GCTGGACTCC CTGCAAGATT CAATCGGAAC CGACGAAACA CGTGACTCAC AGTTATTGCA TCATCCTTGA CGCTCCGTAG GACCTTACCC CAAAGCTATT CCTGTGCCGT TCTTGAATTC TCGTAACGAA GCTCGCGAAG GTGACACTCC CGCCGCCAGC GTCATCGCGC AAGCCAAAAC CATCTTTGAC GTACCGGCGG ACTATCGTGA CGTGGGAACA CCGGATGAAT GGGTTCCCCG CGATGGACGC CTCGTGCGTC TGACGGGTAA GCATCCCTTC AACGTCGAAC CACCGCTGGC GATTCTGAAG CAGCATCGAT TTATTACGCC GTCCTCGTTG CATTACGTAC GCAACCACGG AGCGTGCCCG AAGCTGTCTT GGAAAGAACA CACTGTTTGT GTGGGAGGAA AACTGGTACC GAATGCCTTG GAGCTCTCGA TGGACGAAAT CGTAGCGATG GAACCGCGAG AGCTGCCCGT CACGTTGGTC TGTGCCGGAA ATCGTCGGAA GGAACAAAAC ATGATCCGTC AAACAATCGG CTTCAACTGG GGCCCGAGCG GCGTCTCAAC CAGCGTTTGG AAGGGAGTGC TCCTACGCGA TTTGTTGCTC CGCGCAGGGG TTTCGGAAAA GAACATGGCA GGGAAGCACG TCGAATTTAT TGGTGTCGAA GACTTGCCGA ACAAGGTGGG ACCCGGGCCG TTCCAGGAGG AACCATGGGG CAAACTTGTC AAGTACGGAA CCAGTGTCCC GCTCGCTCGG GCTATGAATC CAGCGTACGA CATCCTCATT GCCTATGAGC AGAACGGCGA AGTCTTGCAG CCCGATCACG GCTACCCCGT CCGTCTCATC ATTCCTGGTT ATATTGGAGG ACGGATGATT AAATGGCTTA AATACATCAA CGTGATTCCG CACGAAACCA AGAATCACTA TCATTACCAC GACAATCGCA TTTTACCGCC CCACATCACT GCAGAGGAAT CCTTACAGGG AGGTTGGTGG TACAAACCGG AGTACATTTT CAATGAACTC AACATCAATT CGGCCATCGC TGCTCCTGAT CACAATGAAA CGCTTTCGAT CGCCAAGAAT ATTGCCAAGA CGTATGACGT TACGGGTTAC GCATATACTG GTGGTGGTCG TCTCATCACC AGGGTCGAAA TTTCAGTTGA TGGCGGTATC CATTGGGAAC TTGCCAAACT TGAACGCAAG GAGCAGCCAA CGGACTACGG AATGTACTGG TGCTGGACTT GGTGGAACTA CGAAGTAAAG GTGGCCGACT TGGTGGGAGC CAAGGAAATT ATATGCCGCG CCTGGGATGA GTCCAACAAC CCTCAGCCAG TTGTTCCAAC ATGGAATCTG ATGGGTATGG GAAATAATCA AGCCTTTCGT GTCAAGGTAC ACATGGACAA GACAGCTAGC GGCGAGCATG TGTTTCGGTT TGAGCATCCA ACTCAGCCTG GTCAACAAAC TGGTGGGTGG ATGACAAAGG TCGCCACCAA GCCTGAGTCG GCTGGGTTCG GACGGTTGCT GGAAGTGCAG GGTGAGTCCA AAGAAGACGC GGCCCCGGCT CCACCTCCGA AGGAAAATAC CAAAATTTTC ACGATGGAAG AGATTGAAAA GCACAACACT GAAGAAGACT GTTGGATTGT GGTGAAGGAT CGTGTCTACG ACTGTACCGA GTATCTAGAG CTGCACCCTG GCGGCATTGA CTCGATTGTT ATCAACGGCG GCGCAGATTC CACGGAAGAC TTTGTGGCAA TCCACTCTAC CAAGGCTACA AAGATGCTCG AGAAGTACTA CATTGGCCAG CTCGACAAAA GTAGTGTGGC CGAGGAGAAA AAACAAGAAG ACGAACCTCT CGTCGATGCC GATGGCAATG CTCTTGCCTT GAACCCAAAG AAGAAGACGC CATTTCGTCT ACAAAACAAA ATCACACTTA GTCGAGACAG CTACCTATTG GATTTTGCTT TGCCAAGCCC AAAGCATGTT TTGGGGCTAC CCACGGGAAA GCACATGTTT ATTTCGGCCC TCATTAATGG AGAGATGGTA CTCCGCCGCT ACACTCCTAT CTCATCCAAT TACGACATTG GATGTGTAAA GTTTGTTGTC AAGGCATACC GTCCGTGTGA ACGCTTTCCA GACGGTGGCA AGATGAGCCA ATACCTAGAC CAGATCAATG TTGGCGACTA TGTTGATATG CGCGGACCAG TTGGGGAATT TGAGTACTCG GCCAACGGCA GTTTTACAAT CGACGCCGAA CCTTGTTTTG CCACCAGGTT CAACATGCTT GCTGGGGGGA CCGGCATAAC GCCCGTAATG CAGATTGCTG CGGAAATTTT GCGAAACCCA CAAGACCCTA CACAAATGTC CCTTATTTTT GCATGCCGCG AGGAAGGCGA TCTCTTGATG CGAAGCACTT TGGACGAATG GGCTGCTAAC TTTCCTCACA AGTTCAAGAT TCACTACATC CTATCTGACA GCTGGTCTTC CGACTGGAAG TATTCCACAG GATTCGTAGA CAAAGCGCTA TTTTCCGAGT ACTTGTACGA AGCAGGCGAT GATGTTTACA GCCTCATGTG CGGCCCACCA ATTATGTTAG AGAAAGGCTG CCGTCCAAAC TTGGAGAGCC TTGGTCACAA AAAGGACAAA ATTTTTTCCT TTTAAAAGTT CTTGACTGAT TGTCATATCA ATTTTGCACT TTACAATACA TTTTCAATAG CAATTTACTT TAAGACTAGC GCAATTTTTT TCTT
|
Protein sequence | MVPKPEDPTV KAENNAAMDQ LSLLDKDDIS SASRSCRELY GPYPKAIPVP FLNSRNEARE GDTPAASVIA QAKTIFDVPA DYRDVGTPDE WVPRDGRLVR LTGKHPFNVE PPLAILKQHR FITPSSLHYV RNHGACPKLS WKEHTVCVGG KLVPNALELS MDEIVAMEPR ELPVTLVCAG NRRKEQNMIR QTIGFNWGPS GVSTSVWKGV LLRDLLLRAG VSEKNMAGKH VEFIGVEDLP NKVGPGPFQE EPWGKLVKYG TSVPLARAMN PAYDILIAYE QNGEVLQPDH GYPVRLIIPG YIGGRMIKWL KYINVIPHET KNHYHYHDNR ILPGGWWYKP EYIFNELNIN SAIAAPDHNE TLSIAKNIAK TYDVTGYAYT GGGRLITRVE ISVDGGIHWE LAKLERKEQP TDYGMYWCWT WWNYEVKVAD LVGAKEIICR AWDESNNPQP VVPTWNLMGM GNNQAFRVKV HMDKTASGEH VFRFEHPTQP GQQTGGWMTK VATKPESAGF GRLLEVQGES KEDAAPAPPP KENTKIFTME EIEKHNTEED CWIVVKDRVY DCTEYLELHP GGIDSIVING GADSTEDFVA IHSTKATKML EKYYIGQLDK SSVAEEKKQE DEPLVDADGN ALALNPKKKT PFRLQNKITL SRDSYLLDFA LPSPKHVLGL PTGKHMFISA LINGEMVLRR YTPISSNYDI GCVKFVVKAY RPCERFPDGG KMSQYLDQIN VGDYVDMRGP VGEFEYSANG SFTIDAEPCF ATRFNMLAGG TGITPVMQIA AEILRNPQDP TQMSLIFACR EEGDLLMRST LDEWAANFPH KFKIHYILSD SWSSDWKYST GFVDKALFSE YLYEAGDDVY SLMCGPPIML EKGCRPNLES LGHKKDKIFS F
|
| |