Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39708 |
Symbol | |
ID | 7195317 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 512261 |
End bp | 518679 |
Gene Length | 6419 bp |
Protein Length | 1883 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183618 |
Protein GI | 219126761 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.405185 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGATG AGGATACCGA GCAGACCAGC CGTGCCGAGT CGGAAAGAAA TTCCGCGATC GAGGAGCTAC ATAGCAACGA TTTTGAAGCT CAGCAAGTCA CACCTGACCC TATGGCCAAT TCAACGACGC GACTACTGGC ACCTACGGAC GTTGAATCTC TGGACGAAAG ACTAGAATCG AGCACTTCCC CGTCCGGATG GAGTCAAATC ATCGTCCTGC TCCGAAAGAA CCTTTTGATC AAGATCCGGA CTCCGCTGCA AACATTTCTG GAGATTTTCA GTCCTGTGCT CATGATGTTG GTACTTGTGG CAGCATACCA ATTGTCCAAA GTAACTTATC GCGAAGCCAA AACGTATGAT ACAATTGAAA TCGACCTTCC CGGCCCATGG CTCGACATCG TTCGCCAGGC GACGCAGCTA TCGAACTTCT CTGAAAACGG CCGTCGGCTA CTACGAAGTG AATGGGATGA ACCCGAATTC CCTGGGTCTC TGTTCTTGAT CGAGAAATGG GAGCTGTTTC GGTCGATGTA TCTACGATCG ACGACACGTC GTTCTGAGCG GCGTTTGCAA TTTACCGAGA CTACAGATGA TGACATTGAG GTTGACGATA ACGAGCGAAA AGACGTCAAC AATATATATG ATGTAGTTGA CGAGGCCTAC AGGGAAATCC GTCGCCTTCT GAAAAATCCC ATGTTTATTC CGTCCTTTTC CCAGTATGTC AATATTTCTC AGCAGATGTC GAGTCTGATC AATGTAAACG ACTTGCCCCG AGTCTTTTCT GAAAGTAGCT TTGGACGGCA GTGGGGAAAT TTGTTAACGT TGGGTACCAT TCATCTTTCT CCGCCCTCAG GGGTCGCCCT AGATTTTTGG GCTCATCTGA ACGATACTTA CCCTGACGCC ATAGCCTCCT TGAAAATTAG AATGCATCGA AGCGAGAGTG AAGCTATCAA GTTCATAGAT GAAAATTTGA ACGAGCGAAC GTGGGCACTA ATCGACTTTT CCGCTTGGTC TACCGATGGC TCCATGGATA ACGACGCGAC CTTTAAGATT CGCATGAACT ATACCACGCT ACCAAATACA GCTCAAATCA GCGACTTTGT CAGCATTGGT CTCAACACAG CTTACCAGCG CTACTACTTA TCAGGATTTT TGACTATACA ACGGACACTG AATGAGTTTG TATTTTCAAG GGCCGGTGGA AGCTGTCCAG AGCTGTTCTC GAACTCGAGC GAAATTTGGA GCATGCCCAT GCCGACAGCG GCATATTCAC AAAATAGCTT TTTTCTTGCG GTTGGCTTTT TATTAGGGTT GGCCATCGTT ATGGCGTATC TTTACCCCAC TTCACGACTG ATCAAGCTCA TGGTAGAAGA AAAGGAGACA AAAATGAAAG AGACGCTTTT GATCTTGGGA ACGCTTCCGT GGGCTCATTG GTGGTCTTGG CTTTTAACAT CATCGATTGT ATTCTTTGTC ATCGCTTCAT TGGTGACTTG GGTGATAAGT GCCAACATTC TCAAGTTTTC AGCACCGATT TATATTTTCG CATGGATCGG ACTATTTTCA TCTTCCTCTT TAGGGTTTTG CTTTACGGTC TCAGCTCTAT TTAATAAAGC GAAGCTGGCG AGCATTTTAG GCCCGATGAT CTTGTTTGCA ACAATTCTCC CTCGTTTCAT ATTTTTCGGT TACAATCGCT ACGAAGCTAC AGCAAAGAAG AAATGGGCGT CTTTGCTGCC GGCATCTGCT TTCGCTTTCG GGGCTGATAT CGTCGCTGGT AGGTTTTGAT ATCCCTGGAT TGTCTGAATC GGAATGTAAC TGATTGTGGC CCTCTTTCTG ATTCTTGTCA TCTTAGATTA TGAGTATGCT GAACAAGGAA TTCAAGCCTG GAATGCTGGC GAAGGCGAAT ATTCTTTCCA CACCTCTTTG GCGTTCCTCT TGTTTGATAC GATTCTTTTC TTGCTTTTGG GCTGGTATCT TGAACAGATT ATGCCCCGTG ATTTTGGTAC ACGAAGGCCA TTTTGGTTTT TGGTTTCAAC GAAATTCTGG TGTAAATGCA AAGATGCATC GGCTCATAGT CTGTCCAATT CGGGGTCTGC AAACAAAGTG GAAAGGTAAG TACTGATAAC ACGCAGATTG ATCCTTGACC ACAATTCTAA CGTTCGGTTT CCTTTATCGC TCGTAAAAGT GCTTCTGTCG ATGAGTCGTT TCTTGTTCCG TCTGTGCAAG CTACCAAGCT CCTGAAACAT TACGGGGCAA AGAAAGGGAC CGGGCTTCAG CCGGCTGTCA ATCAGCTAGA TCTAACGCTC TACGAATCAC AGATCACGAC TCTACTAGGT CATAACGGAG CGGGCAAGAG CACATGTATT GGTCTTCTCA CTGGTATGTT CCCACCGACG TCTGGTGATT GCAAGATATA CGGGGAGTCT ATTGTTCACA ACGTTAACCG GGCCAGAAGA TCAATTGGGA TCTGCCCTCA GCAAAACATT CTTTTCGAGC GACTGACTGT CTTCGAACAC ATCGTATTCT TCCAACGTCT GAAGGGAGCG AGGCAGAATC GAAGGAAAGC AAAGGCACTG GCGACAGATC TAGGCTTGGA ATCTTTTCTT CATACCACCG CTGCTGCTCT AAGTGGTGGC AATAAAAGAA AGCTCTGCGT TGCCATTGCG CTATGCGGGA ATCCAAAATT TCTTGTCTTA GACGAGCCGA CCTCCGGTAT GGACCCCGAT GCGCGGCGCA AAACGTGGGG CGTGCTGCGT AAACAACGAG CAGGGAGAAC TATTCTGTTG ACAACCCATT TCATGGACGA AGCTGAACTT CTTTCAGATA GAATTGTTGT TATGCGCGCA GGAGATTTAC AGTGTACGGG ATCACCGGTG GAGTTAAAAG TAAGGTTTGG CCTCGGTTAC AATTTTACAG TATGTATCGA AAGGCAAGAA GAACAACATC CGACCTCTGT TGATATTGGA GAAATGACCA CAGAGTCCTA TATGGCCTTT AAAAGTGCAA ACATCTTATG CTTTATACAG AAATTCATAC CAGATGCGAA GTTGCTGCGT GTTGGAGGCA GAGAACTCAC TTTTAACTTA CCACAGAGCT TTGAGGACCA GTTTGGAGAC ATGTTCGATT CCTTAGAGGT CAATCAAGGG TCGCTTGAAG TCGGTTCTTT CGGTATCCAA AACGCATCAC TGGAAGAAAT CTTTATCAGC CTGGTCGAGC AGGAATCTGG CGATCAAACA GCCAAAGCAC ATGATGAGGG ACCTAAAGAG CCTCGATTTG ACACGAGAAC ATCAAAAGTC TCAGAGGGGC ATGATATTAC CATGGGCGAT CACTCACAGT GTTTGTCCGG TTTGACCTTC ACCGATGCAG CTGGTCAAAC GCTTGAATTA TCCGCCTTAT CTTCCTCACG CCAGATAGTC GTTCTTTACA AAAAGAGAGT CACGGTTCAA AAACGAGACT TTCGAGGGCT TTTTTTTACA GTTGGAGCCC CGGTTCTTGT TTCCGCTCTC GTTCTACTTA TATTGAAGGT CAACCTTCCG ATAGTTGGGC CGGAGATTTC CATGTCCTTA GCGCTCTACA CCCAGTCCAG GACCGGCAGC CGAAGTGGGA CAGAAGTTCT TGTTGGGGGT GCTGCAGGTT CAGCGCTGCC TACTAGCCGT CCTCTTATGG CAGACATTAC TATTTTCGAG AAAACCTACC CCTCCGTCCA TTTTGAAGTA ATCGAAGAAC TAGCTTCATC AAGCGATATT TCAAGATATC TACTCGACAC GATCAATTCA CAGAATCACT CTGTTCAGTA CGGTTCCTTC TCGATAAACG ATTCGATCGC TTCTGTTTCA ATCGTGGACT GGAGCGCTCT TAAGAATGAA AATGGAACGG ATAGCGTTCT TCTGGATCTA AGCGGTCCAA TGGGGATCGG TGACCTAGTC GACCTTGTCC CTTTCATGAC GGGCGACCAA GAATATCGTG AAACCGTTGA AACTGACATG AGTATTCTTC ACAACTCTTC CTCACCCCAT GCTGTGTAAG TCGGCGTCAT TTCAGTAGGG ATGGTGCCCT GCTTGATGGC CGCTTTATCT CATTTTCTTT TTTCTTAGTG CCGTGTTTAA TCAAGCTTAC GCTGATCACA TCCTGAAGGT ATGTTCGGGG GAACCAAAGA GGATACTTCA ATCACTTAAC GCCCCTTTGC CATTGACAAC GGCACAAACA GTGGAAATAA AGGCGATTCT TAGCATTCGT AAGTAACAGA TTTGAAGGGA TCAAATTTGC GTCCCTAAAT GAATCTAATC AAAATTTCTT CCTTTCCAGT GGCTTCATTG TTCCTCCTGA TCCCATACTG CTACATCCCT GGTGCCTTCA TTGTTTTCAT GGTTAAGGAG AAATCTTGCA AATCAAAACA TCTACAGCTT GTAAGCGGTG TGGATGTGAG ATCATATTGG ATATCAAATT ATGCCTTTGA TGCAAGTGTT TTTTTATTGT TGACCCTGCT GGTGATGGCT GTTTTCATGT TTTACGGAAG CGATTCAGCG GAAGTTTTTG TGGGAGACTT GGAGTCTTTC TTCTGTACGA TGGCGCTAAC GTTCGGCTAC GGTTTGAGTA TTCTTCCTTT TGCTTACCTG TGCTCCCGCC GTTTTCACAA TCATAGCTCC GCTCAAATTG CAGTCATAGT AAGTTTGGAA GTCGCGGAAG TCGGTCCATT CGTACACGGG ATATTCAATT CTCAATTTTG CTTCGCTTCG TTTAGGGAAT TGGATTCGTC ACGGGCTTTG TTTTCGTCAT GGCATATTTC ATTATGATCT CGATTGAGTC GACCGAGCAA CTTGCAAAAA CCCTTCGCCC AATTTTCAGA ATATTCCCGG GGTACAATGT TGGAGATGGA TTCATTCAAA TGGCCAACGC CTTCTGGGAG AGGCGCATAC AAGGGACTGA TTCTGGGAGG CCGTTCAGCT GGGAGGTCGC TGGTAAACCC GTATTACTTT TGTATGGACT AGCGCCAATC TATTTTTTGA TATTGCTGAT CCTTGAATAC TCAGGGGACG GAAGCGCTGG AGGGAAGATA GGCTGGGTAA TTCGGTCAGC CAAATCTTCA TGGGAACGGC TGATTCTTCG CTGTAACGGT GTCCATGCTG ACTGCTCGCT TGACGATGGA TTGAAAGAGG GCACGCGAGA CGAAGACGTT GAGGGAGAGC GCATATTTGT CTATGAAAAT ATCGACCAAT TAAAGCATTC CGCTCCTATT GTCTACCAGG ATCTGTGGAA AATCTACCCT CCCTCTGTTG GTTTGTTTGG GACGATGGCG GCATTTGTAC GTGGCCACAT ATATATGATA TGCTGCTTGC TCAGAAAAAG CTCGGCAGAG AGAAGGACAG TGATATTGAA GGAGAGGAGA CGCGTATTGC TTCCTAAACG GGCCGTTCGT GGTGTAACAA CGTGTATACA GGAAGGCGAG ACGTTCGCAC TTTTAGGTGC AAACGGGGCT GGAAAGAGTA CAAGTCTGAA TGCCATTACT GGAGACATAT CCGCAACGAA AGGCAAGGTA TTTGTCGCCG GATACTGTGT TACTGGAAGC AACGATGACT ATGACGTCAC CAATGCACGG AAGCACCTTG GCTATTGTCC ACAAATTGAC CCTCTACTGG AGCTAATGAC ACCGAGAGAG ACCTTGGCTA TGTTCGGTCA AATTCGGGGC ATTCCGCTTG AGATACTGAA CGGGCATGTA GAGAAGCTAC TCGAATTTCT GTTACTTAGC ACACATGCAG AGAAGACATG CGAAAACTTG TCGGGAGGAA ACAAGCGCAA ATTAAGTCTT GGAATTGCGC TAATTGGTGA CCCGACAGTG TTGCTGATAG ATGAAAGCTC GTCAGGTCTG GACCCAGTTG CTAAACGCCG GATGTGGAGC CTCATATCCC GAGCGGCAAA GAATCGGTCT ATTATTCTGA CTACTCACCA GATGGAGGAA GCGGAGGCGT TATGCACGCG TGCAGGCATC ATGGGTAACG GGGAACTGCT TTGTCTTGGC TCGGTTCAAC ATTTGAAGGT AAGATTCAAC AGTAGATTTG CACACAAGAT TGTAACTTCG TCCTTTCGCT GACGTGTTGT ATCCTCATGT TGCTGCGTAG TCCAAGTACC TCGACGGGTA CACAATCGAC ATATTTTGCA GTTCAACGAG CTCGGAAACG GATAGAGATG CTTTGGTGTC GGAGCTACTG GATAACTCTC TTCCTGGGTC TCTGCTAGCA GAACGTCATG GTCGTTTTCT CCGTTTCGAT GTACCAAAAT TGCCGTCTCT TGGCCTTGGC CATACTTTCC GCCGGTTGCA AGCGTTGAAA GGATCTTGCA GCTTTCCTTT GGAGAACTAC AGTATTTCTC AGTGCTCCCT TGAGCAAGTT TTCATCAAAC TTACTAAGCA AAATAACTTT GTTGATTAG
|
Protein sequence | MMDEDTEQTS RAESERNSAI EELHSNDFEA QQVTPDPMAN STTRLLAPTD VESLDERLES STSPSGWSQI IVLLRKNLLI KIRTPLQTFL EIFSPVLMML VLVAAYQLSK VTYREAKTYD TIEIDLPGPW LDIVRQATQL SNFSENGRRL LRSEWDEPEF PGSLFLIEKW ELFRSMYLRS TTRRSERRLQ FTETTDDDIE VDDNERKDVN NIYDVVDEAY REIRRLLKNP MFIPSFSQYV NISQQMSSLI NVNDLPRVFS ESSFGRQWGN LLTLGTIHLS PPSGVALDFW AHLNDTYPDA IASLKIRMHR SESEAIKFID ENLNERTWAL IDFSAWSTDG SMDNDATFKI RMNYTTLPNT AQISDFVSIG LNTAYQRYYL SGFLTIQRTL NEFVFSRAGG SCPELFSNSS EIWSMPMPTA AYSQNSFFLA VGFLLGLAIV MAYLYPTSRL IKLMVEEKET KMKETLLILG TLPWAHWWSW LLTSSIVFFV IASLVTWVIS ANILKFSAPI YIFAWIGLFS SSSLGFCFTV SALFNKAKLA SILGPMILFA TILPRFIFFG YNRYEATAKK KWASLLPASA FAFGADIVAD YEYAEQGIQA WNAGEGEYSF HTSLAFLLFD TILFLLLGWY LEQIMPRDFG TRRPFWFLVS TKFWCKCKDA SAHSLSNSGS ANKVESASVD ESFLVPSVQA TKLLKHYGAK KGTGLQPAVN QLDLTLYESQ ITTLLGHNGA GKSTCIGLLT GMFPPTSGDC KIYGESIVHN VNRARRSIGI CPQQNILFER LTVFEHIVFF QRLKGARQNR RKAKALATDL GLESFLHTTA AALSGGNKRK LCVAIALCGN PKFLVLDEPT SGMDPDARRK TWGVLRKQRA GRTILLTTHF MDEAELLSDR IVVMRAGDLQ CTGSPVELKV RFGLGYNFTS FEDQFGDMFD SLEVNQGSLE VGSFGIQNAS LEEIFISLVE QESGDQTAKA HDEGPKEPRF DTRTSKVSEG HDITMGDHSQ CLSGLTFTDA AGQTLELSAL SSSRQIVVLY KKRVTVQKRD FRGLFFTVGA PVLVSALVLL ILKVNLPIVG PEISMSLALY TQSRTGSRSG TEVLVGGAAG SALPTSRPLM ADITIFEKTY PSVHFEVIEE LASSSDISRY LLDTINSQNH SVQYGSFSIN DSIASVSIVD WSALKNENGT DSVLLDLSGP MGIGDLVDLV PFMTGDQEYR ETVETDMSIL HNSSSPHAVA VFNQAYADHI LKVCSGEPKR ILQSLNAPLP LTTAQTVEIK AILSILASLF LLIPYCYIPG AFIVFMVKEK SCKSKHLQLV SGVDVRSYWI SNYAFDASVF LLLTLLVMAV FMFYGSDSAE VFVGDLESFF CTMALTFGYG LSILPFAYLC SRRFHNHSSA QIAVIGIGFV TGFVFVMAYF IMISIESTEQ LAKTLRPIFR IFPGYNVGDG FIQMANAFWE RRIQGTDSGR PFSWEVAGKP VLLLYGLAPI YFLILLILEY SGDGSAGGKI GWVIRSAKSS WERLILRCNG VHADCSLDDG LKEGTRDEDV EGERIFVYEN IDQLKHSAPI VYQDLWKIYP PSVGLFGTMA AFEGETFALL GANGAGKSTS LNAITGDISA TKGKVFVAGY CVTGSNDDYD VTNARKHLGY CPQIDPLLEL MTPRETLAMF GQIRGIPLEI LNGHVEKLLE FLLLSTHAEK TCENLSGGNK RKLSLGIALI GDPTVLLIDE SSSGLDPVAK RRMWSLISRA AKNRSIILTT HQMEEAEALC TRAGIMGNGE LLCLGSVQHL KSKYLDGYTI DIFCSSTSSE TDRDALVSEL LDNSLPGSLL AERHGRFLRF DVPKLPSLGL GHTFRRLQAL KGSCSFPLEN YSISQCSLEQ VFIKLTKQNN FVD
|
| |