Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20026 |
Symbol | |
ID | 7200629 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 563365 |
End bp | 565988 |
Gene Length | 2624 bp |
Protein Length | 646 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179870 |
Protein GI | 219118180 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCTACACTAG AAGATCTTTT GACAACTATC TATCCAGACT TGCACTTGAT TTGTATCGCG TTTGTTTTTT TACTCGCTGC CGCTTCTGCG CAAATTTGTA TTCCACGCTA TTTGGGCCGT ATATTGGATG CCTTGGCTGC AGCTTTTCCA AACGCAGATG ACAATGACTC GCGGCACGAA TCCATGTGGG AAGTCCCGAA TTTTATCAAG TACGTCAAAC TGCTTGTGCT TGCGTCGGTT TTGGCAGGAG TGTTTTCTGG TCTACGGGGC TCTGTATTTA CTGTGGTAGG CGGACGAGTG AATGTGAGAC TGCGAATCCA GTTGATGGAC TCGTTGCTTT CTCAAGATAT AGGCTTTTTT GATACTACCA AGACTGGTGA CATAACTAGC CGGTTGAGCA GTGACACAAC TCTAGTTGGC GATCAAGTGA CTCTGAATGT AAACGTATTT TTGAGATCGC TTGTTCAAGG TATGTGCCAC CAATACTATC GTTTCTGTTT TTAAAAGACT GTCGACCTGA AATCCATACA TTTCTCGTTT CCAGCTTTCG GGGTATTGCT TTGCATGTTT CTTATTTCCT GGCAGCTGTC TATACTTGCA TTTATTTCGG TGCCGTTGAT TACTATTCTT TCCAAATGGT ATGGCAACTA CGTTCGATCA TTGACAAAGC TGATGCAGAA GAAACTTGCG GACGGAAATA GCGTCAGCGA GGCTGCCTTT GGATCGATGC CCACCGTACG GTCATTCGAT GCTGCTGAAG CTGAACTAAA GGAATTTGAG CAGCACATGG GCAAGTATCT TGCTTTGAAC AAGCGATCAG CAGTGGCTTA CTGTGGGTAT GCGGCATTTA CTACGGCTGT CCCGCAGTTG GTATTTGCGG TTGTTGGTAT GTCAGATAGA ATGCCCGTAT ATGTTTACAT CTCATTAAAA TCTGTTCTCA CTCATAGAAT GTCTACTTCA CACAAAGTTT TTTATGGAGG AATGCTTGTA CGAAATAACG ATATATCGAT TGGTGATCTT GTTAAGTTCT TGCTCTATTT ACAAGCTCTA TCGGATGCTT TTTCAAGTAT CGGATACATT TTCAGTTCGC TTACCCAGGC AGTTGGTGCC GCCGATAAAG TCTTCGAGCT TCTGCATAGA ACGCCCCGAT ACCGCGAGTC CTCCGCACAA CGCGAGGCTG TGCGGGACCG TAATATGTAC CGCGGCATGC TCGGGATTGA AGCGGTCAAA ACTCGCATGC AGCGTACAAA GGGTATTCGA CTCAGTAAGC CTCGTGGAGA AATCCAGTTT GAAAATGTGG ACCTGTACTA CCCAGCACGA CCCCAGCGTC AAGTTCTGAA TAAGCTCTCG CTTAAAGTTG AGCCAGGTTC CATTATCGCG TTGGTTGGTC AGAGTGGTGG CGGTAAAAGC AGCGTCATGT CACTGATTCA ACATCTGTAT GAGCAAAGCG GAGGAAGAGT CCTTTTGGAC GGTCAAGACG TTCACGAAAT AAGTCCGGAA TGCTTGAGTA GAACAATTTC GATTGTTTCT CAAGAACCTA CTTTGTTTGC CCGAAGTATT AAAAGAAATA TCATGTACGG CCTCGAAGGA ACGGACATGG AACCAACGCA CGAAGAGATC GTCGAAGCAG CGAGACTGGC CAATGCAGAG TCTTTTATTG AAACACTGCC CCAAAAGTGA GTAGAACATA AACAATCGCT ATCGGATGAC AGCCACAAAT TTGCTATCTG TGCTGACATT ATCTTCAATT AAGGTACGAC ACGGAGGTCG GAGAACGTGG TGTGCAGTTG AGTGGGGGCC AACGTCAACG GCTTGCAATC GCACGCGCCC TCGTAAGAAA ACCAAAAGTG TTGCTATTGG ATGAAGCAAC CTCTGCCCTT GATGCAGAAA GCGAACATTT GGTACAGAGC GCAATTGACG ATATGCTCGC AAGAGGGAAA AACGAAAAGG GGGAGGGCGC AATGACTGTC CTTATTGTCG CTCACAGATT ATCAACCGTA CGAAATGCGG ATACTATTTT TGTTATTCAG AACGGCCAGG TCGTGGAAGA AGGCAATCAT GGTCAGCTTT TGGAAAACCC AACTGGTGCG TACAGCTCAT TGATTCGGCG ACAAATGCAG GCACAGGAAA AGCTAGATAT CGCAAAGACC CCGGCCGATC CCAATAACAA GAGTGCCTCG AGACCTGTCA ACCCTTCCGT GAAACACTTG AACAGGAAGC TCGCTCCGAA GCATGATGGC GTATCAAGCC AGCGATTGCC GACTAGATCT CCAACGGCGC TGAAGACAAA TTTGGTGACG GACGAAACTC TAAGTCGACA GCCTGCAAAC ACTCCATCAC GTAAGGGTTC GTCAGTCTCA TCGGCCAGAA TCACGCGTCC AAAATCGAAA AATCCAAAAG GTGCAAACAC GGAATCTTCA CGGAAAAATG TCAATGGTTC TTCGTGAGAG GAGAGCAAGA GCTAAGCCCC AAGCAACATG GTCGACTCTG TAGGCGAATT GAGGCACTCT GGCTGGCAGT TTGGAGGTGG AAAGGTTGGG TCCATTTTGT TTCCCCAATC ATGAAATACG TAGCTGTATA TAAGAAATAC CTGCATGAGC GTGTAGTACA AAGGGAGCAA TGTC
|
Protein sequence | MWEVPNFIKY VKLLVLASVL AGVFSGLRGS VFTVVGGRVN VRLRIQLMDS LLSQDIGFFD TTKTGDITSR LSSDTTLVGD QVTLNVNVFL RSLVQAFGVL LCMFLISWQL SILAFISVPL ITILSKWYGN YVRSLTKLMQ KKLADGNSVS EAAFGSMPTV RSFDAAEAEL KEFEQHMGKY LALNKRSAVA YCGYAAFTTA VPQLVFAVVV FYGGMLVRNN DISIGDLVKF LLYLQALSDA FSSIGYIFSS LTQAVGAADK VFELLHRTPR YRESSAQREA PRGEIQFENV DLYYPARPQR QVLNKLSLKV EPGSIIALVG QSGGGKSSVM SLIQHLYEQS GGRVLLDGQD VHEISPECLS RTISIVSQEP TLFARSIKRN IMYGLEGTDM EPTHEEIVEA ARLANAESFI ETLPQKYDTE VGERGVQLSG GQRQRLAIAR ALVRKPKVLL LDEATSALDA ESEHLVQSAI DDMLARGKNE KGEGAMTVLI VAHRLSTVRN ADTIFVIQNG QVVEEGNHGQ LLENPTGAYS SLIRRQMQAQ EKLDIAKTPA DPNNKSASRP VNPSVKHLNR KLAPKHDGVS SQRLPTRSPT ALKTNLVTDE TLSRQPANTP SRKGSSVSSA RITRPKSKNP KGANTESSRK NVNGSS
|
| |