Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56599 |
Symbol | |
ID | 7203299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 354856 |
End bp | 357734 |
Gene Length | 2879 bp |
Protein Length | 899 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182519 |
Protein GI | 219124456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAACCAAAT CGCCTTTGAT CCCCATTTTT CTCAAGGAGC AAAACAAGCT ACCTACCTCT CCTTGTGAGA TCTTCCGGTG TGGTTCACTC TTGGAAGCTC ACAAATCGGA CGTGACCAGA ATGCGTGCGT TCTCCACACC GACTCGAACG CATCCTACAT CCGCCGTATC TTTCGTTTCC GTCGTTGTCG TCGTCGTGTC GATGATCCTG CGCGTTTCCA CCGCGTTTGT ACCGTCGTTG CGACGCACGG GCTATCCTGC CAGTGGACGT GTGCGTCCGT TTCTGGCTAC CACGTCGCAT CGGTCGACGA GGGAGGTTTC CGAAACGGCC GCACCCGTCC ACTACCCGTT CGCGAAAGTC GAATCCAAAT GGCAGGCCTA CTGGGACGAG AACGAAACCT TCCAAACACC CACGCGGGAT CTTTCCAAAC CCAAAAAGTA CGTACTCGAC ATGTTTCCCT ACCCATCCGG CGCCGGCCTG CACGTGGGTC ATCCCGAAGG GTACACCGCT TCGGACGTTA TGAGTCGCTA CTGGCGTATG AAGGGGTACG ATGTCCTCCA CCCAATCGGT TGGGACAGCT TTGGTCTGCC CGCGGAGCAG TTCGCGATTC AAACGGGGAC CAAGCCAGCG TCCACGACCA AAAAGAATAT TGCCAATTTC AAACGGCAAC TCAAGAGTTT GGGCTTTTCC TACGATTGGG ATCGCGAAAT TGCCACCACG GATCGGGGAT ACGTACAATG GACGCAGTGG ATCTTTTTGC AGCTGTTCCG GAAAGGCTTG GCGGAACAGT CGGAAGTTTC TGTGAATTGG TGTCCCGCCT TGGGTACGGT CCTCGCCAAT GAGGAAGTCA TCAACGGGCT TTCCGAACGC GGCGACCATC CCGTCGAACG CGTACCCTTG CGACAGTGGG TGCTTCGAAT TACCGACTAC GCCGATCGTC TAGAAGCTGG TTTGGAAGGA CTCGAGTGGC CAGCAGGAAC TATGACTGCG CAGAAACAGT GGATCGGCAA AAGTATCGGC TGCAATATCG ATTTTGGTGT CGACCAGTGG CCCGACGAAA CAATCTCCGT TTTTACCACA CGGGCCGATA CTCTCATGGG GGTCACGTAC GTTACACTCG CTCCCGAACA CCCATTGGTG GCAAGCCTCG CTACCGACGA GCAAAAAGAT GTGGTCAACG CATACGTCAA GACAACGTCG TCGCGTTCCG ATTTGGATCG TACATCCGCC AAAGAAAAGA CGGGTGTCTT TACCGGTGCC TACGCCATTC ACCCAATCTC TGGGGACAAG GTTCCCATTT GGATAGGGGA CTACGTATTG GGCTCCTACG GTACAGGCGC TGTCATGGCA GTCCCGGCAC ACGATGCTCG AGACTTTGAA TTTGCTCAAA AGTTTGGACT CGATATCAAA TGGGTTGTGG AACCAGCCTC GGGAGAACTC GTCTCGAAAG AACAAGCCTT CACGGAAGCG GGCAGGAACG TGAACAGCGG TGCGTTCGAC GGCTTGACCA CGGATGAAGC CAAAAAGGCC GTAACCAGTA AACTCGAGGA GCTGAATAAG GGAGGACCAC GAATAACCTA CAAACTGCGT GATTGGGTCT TTTCGCGTCA ACGGTACTGG GGTGAACCGA TTCCCATTTA CTTTCCGGTC GATTTTCCAG ACGGAATCGA TCCAACAACG CAGAATCCTG CAGATGACAA TTGCGAATAC ACAATCCGCT TTGATCAACC GATTCCAGTC GCTGAAGCTG ACTTGCCGTT GGAACTGCCG GAAATGGACG ACTTTCAACC CGGTGATGAC CCGGCTGGAT GTCTCGCTAG AGCCAAAGAC TGGCGATTCT TTGCAAAAGA CGGCAAGTGG TTTTCTCGTG AAACGAATAC AATGCCACAA TGGGCCGGAT CTTGTTGGTA CTATTTTCGT TTTATGGATC CAAAAAACTC CAAGGAAGCT TTTAGTAAAG TGGTAGACGA GGAATGGATG CCTGTGGACT TGTATGTGGG TGGGGCTGAA CACGCGGTTT TGCATTTACT GTACGCTCGT TTCTGGCATC AAGTTCTGTT TGACTTGGGC TACACAAAGC ATCCGGAGCC CTTTCAAAAG TTGGTGCACC AGGGAATGAT CCTTGGAAGT GACGGAGAGA AAATGTCCAA GAGTAGGGGG AATGTGGTGA ATCCCGATGA TATTGTCGGT GAACAAGGTG CCGATGCCTT GCGCTTGTAC GAAATGTTTA TGGGTCCGTT GGAAGCTGTC AAGCCATGGC AAACCAGTCA GGTAGCAGGT GTTGTCCGAT TTCAACGTAA AGTGTACGAC ACCGTCTCGA CTGCAGTCGC CAACGGCAAG ACCAAAATGG ACCAAGAAAC AGCACGACTA CTACACAAGA CGATGAAGAA GGTGACGCAA GACATTGAAT CCATGGCCTT CAATACTGCT ATTTCGGCGC TTATGGTACT GACAAACCAC CTTCAAAGCT TGAAAGCAAA TGTACCAGTA GAAGCGGCCG AAAAGCTAGC CTTGATGGTT TCGCCATTTG CTCCACACTT GGGCGAAGAA TGCTGGAGTA TGTTGGGTCA CGAGGGGTCG TTAGCCTACC AAGACTGGGT TGAGTTTGAC GAAACTTTGT GCGTAGACGA CACAGTCACC ATGGGTGTAC AGGTAAACGG CAAGGCCCGC GGTGAGATCA CCGTAGCTAA GGATGCTTCC CAGGACGTTG CCATGGCGGC GGCAAAAGAT GTGGAACGTG TACAAGCCCA ATTGGATGGT AAAGATATCA AGAAGATAAT TTACGTCCCC GGTCGTATTC TGAACATTGT TGCCAAATAA ACCGGAAAGT GGTTGCGCAC GAACGCTCGT TAATTGGACT CGCTTTACGG CGCGGCGTA
|
Protein sequence | MRAFSTPTRT HPTSAVSFVS VVVVVVSMIL RVSTAFVPSL RRTGYPASGR VRPFLATTSH RSTREVSETA APVHYPFAKV ESKWQAYWDE NETFQTPTRD LSKPKKYVLD MFPYPSGAGL HVGHPEGYTA SDVMSRYWRM KGYDVLHPIG WDSFGLPAEQ FAIQTGTKPA STTKKNIANF KRQLKSLGFS YDWDREIATT DRGYVQWTQW IFLQLFRKGL AEQSEVSVNW CPALGTVLAN EEVINGLSER GDHPVERVPL RQWVLRITDY ADRLEAGLEG LEWPAGTMTA QKQWIGKSIG CNIDFGVDQW PDETISVFTT RADTLMGVTY VTLAPEHPLV ASLATDEQKD VVNAYVKTTS SRSDLDRTSA KEKTGVFTGA YAIHPISGDK VPIWIGDYVL GSYGTGAVMA VPAHDARDFE FAQKFGLDIK WVVEPASGEL VSKEQAFTEA GRNVNSGAFD GLTTDEAKKA VTSKLEELNK GGPRITYKLR DWVFSRQRYW GEPIPIYFPV DFPDGIDPTT QNPADDNCEY TIRFDQPIPV AEADLPLELP EMDDFQPGDD PAGCLARAKD WRFFAKDGKW FSRETNTMPQ WAGSCWYYFR FMDPKNSKEA FSKVVDEEWM PVDLYVGGAE HAVLHLLYAR FWHQVLFDLG YTKHPEPFQK LVHQGMILGS DGEKMSKSRG NVVNPDDIVG EQGADALRLY EMFMGPLEAV KPWQTSQVAG VVRFQRKVYD TVSTAVANGK TKMDQETARL LHKTMKKVTQ DIESMAFNTA ISALMVLTNH LQSLKANVPV EAAEKLALMV SPFAPHLGEE CWSMLGHEGS LAYQDWVEFD ETLCVDDTVT MGVQVNGKAR GEITVAKDAS QDVAMAAAKD VERVQAQLDG KDIKKIIYVP GRILNIVAK
|
| |