Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47159 |
Symbol | |
ID | 7202056 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 644991 |
End bp | 650728 |
Gene Length | 5738 bp |
Protein Length | 1633 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181416 |
Protein GI | 219122152 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0929677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGAAGTAA CAACCTGCCA GGGTAAATTC GACGTAAGCC CCTTGTACGA TGCTTCGTTT CCTGTGTTTC GCCATGATGG CCTCGACGGC GCACGTTGCG GCTCAAAACA CGGAATGCGC CATGGCGACT ACCATTACGT CACTTCCGGC TACTCTCGAT GGCGACCTCG CCATTGCGCC ACCGATCACG CATTTGAGAA CCGACAGCTG TGCTATAGCT TCCACCGGCC CTGTTCCTGG TAATTGGTAT ACATTCACGG CCGAAAGCGA TGGCTGTCTC ACTGCCACCG TCGAGTCCAC AAGCCAACCT TTTTTCGATA CCATTCTCAC AGCCTACGTA GATGGTTGCG ACGCATTATC TTGTGCAGCA ATGAACGATG ACATTAGCAT CTTTCGGGTA GACGTAAGCC AATTGAATAT CAATGTCGAG GCCCAGACGA CCTATCACTT CTTTGTTCGT GGACTTTTTC CGCAAGAGTT TGGGTCTTTT ACCTTCAACA TCACGGTAAG CAGACTTTTC ACATCATACT TTGTATCGCT TTTGAGCCAA TTTCATTCAC GGTTCTTTTC ATTCGCAGCA AGCGGACGGT ACTTGCGAGA GACCTACCGG TAATACTTGG TGTTCTGCTT GCCCCAACGG CGGAGTCGCG GATCCGTCTG CCGTCTTTGA TATCGAAGAA AACGTGCGCT GTAACGACAT GGGCGAGTAT TTTGAACTCG TCGAGGGATC AGAAATTTGC ATTAGTCAGC AAATCGTCGG TTCTGTGGTG TGTGGATGCC AGCCAGTGAA CGAAGTGACT TGTCCGCTGT GCCCTGGAGG CGAAGATGTG CCAGACCCGG ATCTTCCTGC CGGGACGCTG TCCGGTGGCA CCACGTGTGG TGACCTCAAT GTTGTGTCTG GCGTGGATAG TTGTGGGGCG TACAAATCCG GCCTGGCAAA TCAATGCCAA TGCCCCGGCT CCCCGGAACC TTGCATTATG TGTCCCGGTA GCACTACCTA CGACCCCGCT GTCGTGATCC TAAACGAAGA AGATTCTCTC ACGTGCGGGC AACTTGCGAC GACGTTTCAG GAACTTCAGG TGCTGTTTCC GGTTACTGTC TTTTGTGAGC CCTTGTTGAT GGACTACTTG ACGGGACTGA ACGTGGTGGA TTTTTGCTGC AACGCAGCGG AGCTCGTGTC TGCGGTAGAC GCAACATTGC CTCCGTCACC ATTGGTGTTG ACGGACGCTC CCTCGCCCCG TCCCGAGACA GATGCGCCCG TTACCCCGAC TCCGGTGTCC ACTCCGGCTC CAGTCGGCGA GACTTCCCCC ATTGACACTC CCGCGACGGC TCCCATTTCC TTCGGATCCC CGGCTTCCAA CGGAGTGGTC CGTTCGGTTT CCTTTGTGAC AGCGCTGGGA CTTTTGGTCC ATAGCTATCT AGAAATGTAG ATGGCGCTCG GTAACAGTAG ACTGTGAGTA ACTAGCTAGA GCCTTTGCAA TTCGTCACAG TCCGATTTTC CTTGGATGTT TGCTACAATA GTCCCGCGAC TGGTTTCCAG TGAATTAGAC ACTGTGACCA AAGCGCAAGG CGTAACAGTC ACTGTCAACG ATCCGTTGTG CAGACGTCGC CGGTGTTTGT TTGTGATGGA CGCTACCACG CCACGGTCCC GCCCCTGGGG CTGACGAGGG CTTGCCCGTC CGCTTCGACC ATCCATCGCT CGCTAGAAAT GGATGGATCT AGTTGGGTGC GTAGTGTTCG TAAGAGTACC ATGGACGGGA GGATTGTGTC TGCTTCGGGC CGTGGACGTG CCAAGTCCCT TCCCTGTGAG TGCCGATCGA TGCGTACGAT TTCTGTTGCT CAACGCTGGA ACGTAGAATC TGTCGCGAGT CACCCTATAT AGTACAGGGT TCTACCGTGC ACTACCGTCC ACCGTGCCTG GAAACGCACG ACACGAATCC GCAGCACACG CGACAATCGA CGGATTCCGG TATTTATGGT ACATTCACAG TCAGTGTGAC TCTTCGTATT CTCATTGTTA CCAGTTTGGC AGTTGAGTTT TGGGGGGTTT TGGTTTGGGA AACTCCGTCC CGTCGGACCA CTTTCGGGAA CTCCGGATCC GTGCAGACAT CCCCGCATTG CGGACAGTAC AGACTACTCA TACACACACA CACTCTCTCT CACACACACA CCGAGACAAT TGTCGAGTGA AAGAGCTAGT GTAGGCTTTT TCGGTCTCCA AGAGCGGAGC GTGAGTCGAC GTGTACGGTA GTGTGCGATA CAAAGAAAAA AGCCTTTGGT AGAGCACCCT TGTCGTATCG CGTCGTACCG TCCCACAGTC CGTGAAAAGT CTTCCCGGCA GTAAACTTGC GTCTACTTTG ACGGAGCCGC AGTCCTCTTC CGACCGACCG ACCGAGGTAC CGACTTTTTG GCCAACACTT TCATATCCAA AAAATGACAA CGGAATTCGA TTCGAACACG AACGACAATG ACGACGATGA CATGGCGGAT CTCTTTTCCT TCGATTCGTC CGACGTGCCC TTGGCGTCCC CCCACGACAT TGTTGAGACC GGTATTGGCG CTGCCGACAC GGCGACGGAC ACCGAACGTC CGCGCAAAGC GTCCAGTGAT TCCTTCCTTG CACTCCTGGA GAGTGCTACG GAAGGGACCA CCAACGCAAC CACCGCCGGT GCCCTGGGAC GCCTAAATGG GGATTTGGGG GACCACGACG TGGAAACCCA AAATATTCTC GACTGGCTCG ACGAAGACGA CGTCAATGCG GTCCCCGACG ATGACAACAA CACACTCAAT ATTAACCAAA ACACCAGTAG TGCCAGCGGA AGCGATCCCA CAGCCGCGAG TATCCTGGCG GGTCTCCCCA CGACCGAAAC ACCGACGTCG ATCGACGCTT CGTTGTCTAA AACCCCGTCC AACGAATCAT CCTTGACCGT CACTCCGGTG AAGACCGCGG AACCAACGGT CGTCGCCTTG CCACCGGTCT TTGCGACGCT TCGGGAAGCC CTGGAAAGTC CCCAAGCAAC CGTTGGGCAA TTACGTCACT TGTACGCCAC TGCCCATCCG GTCGTCGACG CCGATCTGCG CGCCGACTTG TACTGTCGTA TGGTCTGTGG CAAATCGCTC GCCGAGACAC AGTCCAGCAG TTTGGCCGAT TCCTTCCAAC ACTGGCCATT GCCGGAACCG GGCGCGTCCA ACGCACCCGA CACGATGCTC CAAGCCTTGC CGGAACTGCC CACTTTGGCA TTGCGCGTCG CCACCGAAAC GCACCGCGAG GTGTTGGACT GTCAAGACGA TTTGACCAAA CTCCTGGCCT ATCATTGGCA ACAAAACAGC ACCGCCGCGC CGTCGGAAGC CGATTTACTC GTGCCCGCCG TGGCGGCGGT AATCCTATCC ACCAACATGC CCGTGGCCGC CGCGAGTGTT GTTCTGGCCC AGCTCCTGCC AGCCTTTACA CCCGTACTCG CCCTGGAACC GCCGGAGCGG TGGGAGGCTG CGCTTTCTCT ACACTCCGAA CTTTACCTAC TGGTCTGTTA CCACCTGCCA CTGCTGGTCT ATCATTTGGA TCAGTACGCA CCCGGATGGC ACTGGCCCAA GCTCCCGGCC TCGGTTCGCC AAAAACAGGA CGCCGGGGAG ACCAGCACCC ATCTGGCTCG AAATTTGACC CGGCACGGAC GGATTCCGCC TTCCTGGACA CTCAGTTTGG CGGCGGGAGA ATGCGAACAC GTGGCTTCAA TCTTGCCGAC GGGATGGATC TTGCAATTGT GGGACGGAAT CTTGATGGAC GCTTCACAGC AACATCAGGC AACTCCTTTT TTCTGGACGG TGGCCGTCTT TGAACAAGCG GCCGATCAAC TCTTGCTCCT GACGGGCGAG GAGCTGTTGG CCGCCTTGGA CAAAATCTTT CAGTTGGAAG ACAAGCGCAG TACGGACGAT TGGATGGATG AGTGGCGCAT GCGAGTTCAC AGCTTGCAGC GCTCGACGCC AGAATCTGTC CTGCAAACGT TACGTCGAGC CGAAGACGAG ACCGTCCAAG CGTCAATTCG TCGACGACAA GAACGCGCCG AGGCCGCGAT CAAGGCGCGG ATGGAAGCGG AAGCAATCGC GCATCTGGAA ACGCAGGAAC GCAAGGCGGA GGAAGCCCGG GCACGCTTGA CACGGGCCCG CTTGGTGGCT TACTACCGCA AACACGCTCC GGACAAGGAA GACAACATTG ATCAGATTCT AAAAGTTTAC GCGGGTCGAC TGGATGTTTT GGACGGGAAG TTATTGAAAA AGTACGGTGA ATCGTTCAAC CCTGCCTTGA AACCGAAACA ACCCAAACCG GTGAACAAGA TTGCTGCCAA CCTTCTCATG CAAACCATGA ACCAAGGACT CGGCCGACGG CCACAAGTTG CTTTGGAAGA TGTTTCGGGA GTCGCCGCCG GTCGCCATGC CGACAAAGTT TCGGTGTTGG TGACGGCTGA CGAAGTCTTA TCCGATCTCT GTTGGAGTAA AGAAGCCGCA GTTCACCGCG GAGAACTCCG CCGCAATCGA ACGGAAGGTC GCAAGCATTT GAAGTTTTAT CTTGTAGACA GTCGCGCCGA AGAAGCTGCT TTAGAGCAAG GCCGGTTTCC GACCGCTGTG AGTCTCAGCC CCGAAGCAAT GCTAGATCCG GAACGAATTC AACTGAACGA AGAAACGTTC GAATCGCTAC GTGGCGCGGT ACACATTGTC ATTATGGGAG AAGGCTTTTC GGCCATACCC AAACTGTACG GACAAAAGCT ATCTCCCAAA CTCGAGGAGC TGATGCAGCA AGACGAGTCG CGTACCGACA TTTGCGCTCT ATTCTTTGTC AAGAAGGGCT TTCCCTTTGT TTCCGTTTTG GACGGAGGAT TCGCCGCAGC GCACTCGTGG CTCGTTCGCG AAGGTCCGTC GTGTCACCTG AAAGCTGCTG CGGTTCTCGT GGACTACAAT TCGGAAATGT CATTGTTTGG TCAAATGGAG ACGCTCCACA ATGCTTCAGC AGCCGAAAAG GCACAACGGA AAATGCAAAA CTTGCTGGAA AAGTCACTGG TGTCCATGAC GCGTCGGGCC CAGCAATTCG AAAAACTCGC CAACGAACGG GACTCTCGAG AAGGTCGGAA GAACGTAGGG CTGCAATTTT TCAAAAGCAA GCAAGTAGCC GAAAATGCGA ACGAAGCCAC GAATGAGTTG CAACCAGCGG ATTCACAGTC AATCGCGGAT GAAAAGAAAC TGGCCTTTAA GAACCCATTT AAAGGCGTAG GTAGAGCCTT GGATTGGACT AGATCGCCGG ACGATTCAGC ACCTGCACCC GAGGCTCCGG CTCCCATCGA TAACGGAGCT TTTGAGGCAG CCGAGCAAAC GACGTCCGCT TTCAAAAATC CATTCGCCGG CGTAGGAATG GGTCACGCGC CTAGTACATC TGACAACACG GCTGAAACGA CAGGTGCCTC TGGAACAAAG ACCGCCAATA CCACAGCCAC CGACGACTCG AGCAGCAAGG CTAGTGTTCC GTTGATAAAA CGTAATCCGT TTGCTCGCTT TGGTAACAAA GAGAGCAATC CAGCAGGTGC TTCTACAGAC AGGAAGGGCG GCTTTGATTT TACCAATTTT CGCAAGAATG CGACAGCTCG ATTGCTTTCG CGTGACGAAT TCGATGCCGC TTCGGTGGAA GAGTCTATTT CGTTTGATTA GAATTAGTAT TAGCAACCTG ATTAGTTAAC ATCAGCTCTC AAGAGGCA
|
Protein sequence | MLRFLCFAMM ASTAHVAAQN TECAMATTIT SLPATLDGDL AIAPPITHLR TDSCAIASTG PVPGNWYTFT AESDGCLTAT VESTSQPFFD TILTAYVDGC DALSCAAMND DISIFRVDVS QLNINVEAQT TYHFFVRGLF PQEFGSFTFN ITQADGTCER PTGNTWCSAC PNGGVADPSA VFDIEENVRC NDMGEYFELV EGSEICISQQ IVGSVVCGCQ PVNEVTCPLC PGGEDVPDPD LPAGTLSGGT TCGDLNVVSG VDSCGAYKSG LANQCQCPGS PEPCIMCPGS TTYDPAVVIL NEEDSLTCGQ LATTFQELQV LFPVTVFCEP LLMDYLTGLN VVDFCCNAAE LVSAVDATLP PSPLVLTDAP SPRPETDAPV TPTPVSTPAP VGETSPIDTP ATAPISFGSP ASNGVTSPVF VCDGRYHATV PPLGLTRACP SASTIHRSLE MDGSSWVRSV RKSTMDGRIV SASGRGRAKS LPFSVTLRIL IVTSLAVEFW GVLVWETPSR RTTFGNSGSV QTSPHCGHPL PTDRPRYRLF GQHFHIQKMT TEFDSNTNDN DDDDMADLFS FDSSDVPLAS PHDIVETGIG AADTATDTER PRKASSDSFL ALLESATEGT TNATTAGALG RLNGDLGDHD VETQNILDWL DEDDVNAVPD DDNNTLNINQ NTSSASGSDP TAASILAGLP TTETPTSIDA SLSKTPSNES SLTVTPVKTA EPTVVALPPV FATLREALES PQATVGQLRH LYATAHPVVD ADLRADLYCR MVCGKSLAET QSSSLADSFQ HWPLPEPGAS NAPDTMLQAL PELPTLALRV ATETHREVLD CQDDLTKLLA YHWQQNSTAA PSEADLLVPA VAAVILSTNM PVAAASVVLA QLLPAFTPVL ALEPPERWEA ALSLHSELYL LVCYHLPLLV YHLDQYAPGW HWPKLPASVR QKQDAGETST HLARNLTRHG RIPPSWTLSL AAGECEHVAS ILPTGWILQL WDGILMDASQ QHQATPFFWT VAVFEQAADQ LLLLTGEELL AALDKIFQLE DKRSTDDWMD EWRMRVHSLQ RSTPESVLQT LRRAEDETVQ ASIRRRQERA EAAIKARMEA EAIAHLETQE RKAEEARARL TRARLVAYYR KHAPDKEDNI DQILKVYAGR LDVLDGKLLK KYGESFNPAL KPKQPKPVNK IAANLLMQTM NQGLGRRPQV ALEDVSGVAA GRHADKVSVL VTADEVLSDL CWSKEAAVHR GELRRNRTEG RKHLKFYLVD SRAEEAALEQ GRFPTAVSLS PEAMLDPERI QLNEETFESL RGAVHIVIMG EGFSAIPKLY GQKLSPKLEE LMQQDESRTD ICALFFVKKG FPFVSVLDGG FAAAHSWLVR EGPSCHLKAA AVLVDYNSEM SLFGQMETLH NASAAEKAQR KMQNLLEKSL VSMTRRAQQF EKLANERDSR EGRKNVGLQF FKSKQVAENA NEATNELQPA DSQSIADEKK LAFKNPFKGV GRALDWTRSP DDSAPAPEAP APIDNGAFEA AEQTTSAFKN PFAGVGMGHA PSTSDNTAET TGASGTKTAN TTATDDSSSK ASVPLIKRNP FARFGNKESN PAGASTDRKG GFDFTNFRKN ATARLLSRDE FDAASVEESI SFD
|
| |