Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49134 |
Symbol | |
ID | 7195606 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 14232 |
End bp | 20604 |
Gene Length | 6373 bp |
Protein Length | 2097 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183914 |
Protein GI | 219127379 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAAAA CATCGTTGAA ACAAACGGGA GCCGTTCCGT TGCCCAACGA TGGTCCCATC CACTTTGTTT GGCAGCAAAG ACACAAGACT ACCGGGGTAG ATGATTCCGT CTCTAGCGAC GTGCACGAAG CTGTTGCATT TCTGCGCTCG GTGGCGACGC CAACGAGTAC CGCGGAGTCT CGTTCGCCGA CTTTCCGTGC CATCGACAGC ACCAGTTTGA GTTGTCTCGT ACACGACTTG TGGTGTTTGA ACGATATTCG CGAGCAGTGT CAACGACTGC GTGTCTTTCG GAGTCATGTG CTCCAAGCCC GTCAGAAACA AAGCAGTGAC GCGGGTGTTG TTATCGCCAC GCTGGATACC CGGACTGCCA AAGCATCGCA TCGCGTTTTG TTGGAATGGT CCTTGGCAGT CGCTACACCC GTACCTTTGC GTCGCGCCGT ACACGCGCAT TTGCGGGAGC GTACCGGTTT GTCGGACGAC GCAACGTTGG ACATTGCCGG TGCCGTGTTG GATTCCATCC TGTCGAGCTC GTCACAATCA ACGTCTGGGC CGTCGTTGTG GTGGTGGAAA GATCCCATTG CAACCTTGCA AGAGTGGATC GCCTTTATGT CCTCCGACCC TTCTTTAGCG GTGGACTGGA ACGAAACACG TCCACGTATA CTGGTGTTTC TCCTCACCTA TGCCGATACT TGCAGTATGC CCACTCTTGG GCGTTTCGCC TATACGTCAC CCTCCACTGA AATGGTGGAT CAACAGTCGC AGGTCGCCAT TGCGGAAGCC GTCCGCGTCG CGGATTTATT TAAGGCTTTA CTCCAACCTG GCGACCGAAA TTGCTACTGG GAGACACGAG ACGAACCCTT TCTGGAACAA ATGCGGGACT TTTTGTGGGC TCTGCTTTCC TGTCGAGCCG TTTCCGAGCC TTCGTTGCAA ATTATCGGCA TTGCGTACGG ACGGGTTTGT GTATGGCCCT ACCATACGAA TACGCATACG AACACTTCCG AAAGTGTGGA CCCGGACGCG TTGGTAGCCA AGCTGCAAAA AGATTGGTCA CGGCTACCAG AGTTGTCGGA TTTGGCGCGC GCCGTTGTCG TTCAAGGGTT GGCAGCGACC CTTCCCGACC AGACACTATT TGCGCACAGG CATGAGGACA TGGAGACGCC CATTGATATG TCCACACTTA TGGATACTTT CGGTGATTTG GCCACCGGAG CGATCGATCC GGATGTACGG TTGGCTGCTT TGAAAGGTTT TCGCACGTTG ACGAGTCGCA GTCTAACCGC TCTCGCTGTA GACAATTCAG ACTCTGATAG GTTGGCAACA CGAAATTTGA GCTTCGCGTT GGCCCAACAA TCTCTCAAAG TTGTCATGCA GGCGTGGGAA AGCCCTCCCA CCCGTCGACT CGCCAACGCT ATTCCGCCCC TCTTTGACAG TGTCGTCAAC TTGCTGCGTC GACTGGATCA AGATTGGTCG GTTGACGCAT TGGTTGGGCG TGTACTGTCC CAACCTGCTC ATTGTAAAGG GCGCTATATA GCGCTGGAAG CACTCTTGCC AATCTCCGGC GCTCAAGCCT TGGTGTTCAC TGCGAGCCGC CCCGCCAACA AACACAGTAT GCTAGACGAT TTGCTCTCCG GCATTGGGGA CCACGGGCAC AATACGAATG CTATTGCTGA TTTGTGGGCC CGTATCCTTC AGCAATATTT GCAAGAACTT TTGCTTCAGA ACGGAATACC GACTTTATCT TCTTCATCCC AGAGCCAACG TACCAAAGGG GCACCGGCAG TGGCAACCGT TATGGAGAAC ATGGAAACGT TGCCGTCGGT TCTGACGGCT TGGTGTGATG CCTGGGTACC GAGCTTGACA GAGGCCCTCC TCGTACCTGA AGTCAGTCGA CGCAAGCAAA TAGCTTCGTT CTGTTTACCT CGGGTTTTAA AGATGGCTGG CAACGCTCGT CCTCGAGCCT CATTGGCCAT TGTCCAAATA CTACAACACC TGAACGCAAC CGGTCAGAAT CATTCAACTC GACTGAACAA ACCATCAAAA GAACGTGAAA CGAAGAAAGA CCGCGTTTCG TGGGCCATGT TGGAAGTGGT TCGACTAGCA GCGGTTGACC AGTTGTTGAG AGAAAAGCCT GTGGGTAGCA CCTTGGCACT AGATACGTGC GTAGCGTCAC TTATATCCAA AAAGTGTCTC CAATCGGCGT TGGTGCATTT CTCGCCCTTT GTTCGTCTCG TTGCCTTTCA AGCCATGCCG CATGTAGTGA AAGCAACCAG CTCGTTCAAG GACGACGCAT CACGACTTGG GCATGAAGTA TTTTTGTGGA AAGCCACGTT ACCCTTTGTG GTCAAAACGA GCGACAAAGA GTACGCATTG ACTTTAGTAC AGTGCTTACT TTCTTTTATG GATCGCTTGT CTTCTTGGGA AGCAAAATGT GTGTTATCCG ATGATCCGAA CGCCCCGCAA TCAACAGAAA ATTCGCAGCT ACAAGTTTCA TCCTTTGCCA TTGACTTTTT GATCAACGAT GTTCTTCTAC AAAAACTTAC GTATCCGGGA TCAGTTGTCG ACAAAGAGTG CTTGGCTCTC TCATTGTTTG AAGCGTTAAT TGTCTTTTCA TGTCGAGATC TCAAGTTTGC TCAAGAATGT CGGCTACTAC CTAAAGCCGG TGCTGCCTTC CACCGAAAGA GAAATTTTGC GGAGGAGGCT GTATGTGGTC GGATACGGCG AGCTCTCTTT GGTTTTGAGG GTCTTGCCGC TTTGTTCTCG ATGCTGAGCT CCAATTGGGA TGGCATACGA GACGATGCTT ACACCATTCT TAACAGTGCG CTCGCCTTGA CTACTAGATT TCACCTCTTC GTCCCAATGG AGTTCACCTC CAACAAAGCT CGCTCAAAGT TTGAAGCTCG CGCCTTATTT TTGGCCTCCT CTCCGAGACA GCGCGAGGCG GACGCCGGTG CTAGAATCTT AGCATTTCTC TTTGTTTCAT CGCTCACGAA CGACCAGAGG TATGACTATC TCGAGCGTTT GGGCTCAATT CTTCAAGAAA GGTTGACTCT AATGAAAGTA AAACTACAGG ACATTCTGTC AAATGATACC GACTTTGTCG ACGGCGCTGA GTTGCCCTTG GCCCACGGCA TTATCCGAAG CATTCGGCTG ATCTACGAAC ATCATCAAAC ACTTACCGAC ACTCTGCAGA ATGAGCATGA TCGATTGGGC ACTTTGTTCC AAAACATGAT ACCAATGTTT TGCAAAGCTC TGCAATTGTC ACTAGGCGTT GTAGCTGATC TCCGGGATGG CGAAGCTGTG GATGGGATGG ATCGTGAACT TGAATTTGCA TCAAGCAAAG TAAACCCTGG AGCTATTGGA GCCAACGGGA TTTTCTCGTC AGTTCAACGC CTGAGCAAAA CAGAGCACTC GAGACGATTG GCATCGCAGC GAATTGTGAT TGGTTCTTGG TTATTGACCA AGGAAACCTG CGCAGCCGTA TCGATATTGT TGTCGTTAGG ATGTGTCAAT CCAAGCAAGG AGAGCATGCA GCAGGTAGGA ATGCTCCTGA TATCAACCCT TACCTCATTG AAACACACAG GAGCTGCGTT TGCCGCCCAC AAAGCTCTTC AACAAATTTC TGTGGCTTGT TTTTCAATCC CAAACCTCGA GACTCTTCCT AAAGAGTGGG CGTTACGACT GTTAGATGAG CTAGCAAACT CCGATAAAAT ACGCGACTCG ACTTTGCGGC GAAGCACCGG ATACGGGCTT GGATTTTTAT CCATAATGCG ATCAGAAGTT GCGTGTCACT CTGCAAATCA CTCTCTGTGC TCTTTCATTA TGAAGAAAAT ATTGTGCCTT TCACTTCCTT CAAAAAGCAT GCTGTCGTCA TTCCTTTCGT CGATTGCTTG GTTTGATAGA GAAAGGCCAA TACCGCTTCC TTTCAGTCTC GGGGATGAGG TGGACGGTAT TGGATTATGT AATCAGTATA CGCTTCGATC GCGAGTCCAT GCGCTAAATG TATTGCGATC GATACTGTCG GACGCGCCTT TAGCGAAACA AGTCTTTCCA GCCGTCGGCG ATTCTATTGT ACGTGGCTTG TAGACCCAGC TCTCTTGTTT GCTGCATTTG CTCAGTTTTC CTGCGTTACA CTTTTACGGT TTTGCAGGTG ACAGCAATGA TGGGATATTC TGATACGGAT TGGTCTGTTC GAAACTCTTC CACGATGGTT TTCGCGGCTG TCATGCTCCG TGCAGTAGAT GCGGACAAAA ACGCATCCAA CGAAGATGAG ACAAGTAGTA GAGCTATCAC ACTAGCGGAG CTCTTTCGTG TTTTTCCATC TCTACCGGAT TTCCTAGTGT CTGTTTTGCA GGCAAGTATT GCAGGGGAAT TGGGCGAGGC GTCGACTTCT CCTCCCGTTC TTCCTATTCT TTTGCTTCTG GCCCGCTCTC AGCAATTGTC AATGTCGGGT CACGACTCCG TCTCGATTGC TGAACCTTTC ATGCAAGTTG TGTTTCGCTG TCTTGGTCAT GCTCACTTAA GTACCCGGGA GGCGGCCGCT CGGGCTATCG CAAACCTGTC CTCGGCAGAG AAGAAATCGG TCACATCTTT CCATTGTATT CTTCAGACTT GTAACGATCG CCTGAAAGTC AATTTACGGA CGATGGTATG GAATGACCTA CACGGGACGC TGCTCTGTAT TCGTGACATG TTGGCGCTTT GGAAGCTTAT TGTGAAAGAA AGTCTTGGGA AAGAGTTGCT TGCGTGGTTG TGGCGTTTTA CGAGGGTAGA AAACAGTCAG TTTGTTGTGC CTCCCCTATG TGTATCGGTT GCCCTAGAAA TTTTGCACAG AGCTAACCTT ACGGACATTG AGTCCCCCCA GTTGCTGCGT GCCTGTCTCG ACATTGAAAG CAAATTGGCA CGTGAGAGGC ACCATACAAG GGGTGACACA GGTCTATCCA AACTGGGTCT GACCGCAGGC TCAATAGCCT GCTCTGTCGT ATCAGAGCAG CTATGGCACT TTTCCGCCAA ATGTAGAGTA CAAGTTGACG AAAGCGTTTT CTCGAAGGTT GAGACTCTTC TTCGAAGTGA CTGCCCAGAC GTGAAGCTAG CGAGCGTCAA AGTCTTCAAG AAGTCAATAT ACCGTGGCTT AGACTCAATG CTGAAATCGT CTGATGCTCA TGAATGCTCA ACGTTTCTTA GCAGATTGAC GACAACAATT TTAAATGCAC TGAAAGCAGA GCTCGGAATC GGTAGCTCAA AAGCAGAGAA GATCTTTATT CATCCACCAA CTGTTCGTCG ACTCTCCCGG TGCCTTTTGG AAAGCTTTGA CGCCTATGAT GCTTTGGCAA TGGAAGTCCC ATCAACTGTT GCTTGCTCAG TCCTATGTGG AATAGGAGAA AGTCTCCTTA AACTTGGGGG TTTAGAAGCC AATCAAAATT GGAAGTTGAT TGATGTTACG CAACTAGCAG GGAATGGCAT TGAGCTGCTT TCGCGATGTA CACTCTTTAA TGATTCGTAT GAAGATGGAT TTCCGTTCCA AGGTGTGATT GGATTTTTGT GCGATCACAG ACTACCATGG CGCCTCCGAT ACTCTGCCAT CACCGCGATA GGAACGCTTT CTACGAATTT GACGTATCCC CCAGATTTAT GTTCAAAGTG GACGTCTGCC TCGTTGGAGT ACCTACAGGA CGAAGATCCA GACGTCCGTT ACGCTGCTTC CAAAACATTA GTATCGCCAA TCAACTCAAA TGTCCAAATT GCAGATGTTG CTTTGAACGT TCTTTTTACA GGAAAAAGCA ACTCTCGTAT TTTTCTACCG TGGCTCGAGA GATGTATTGA AAAAGTTCTG GAGAACTATT CGAAAATGAA TCGCCGTGTT TGCATGCTCG CGAACGAGTT GACGCAAAGC GGGATCAAAT CAGGAGAGAT ACTGAACGTT GGGAATGTGC GCGAAATCTT CGAAGAGGAA AATCCCAACT CGTATGGCGA ACATTTGCTT TTCGTTCAGC TTGCGGTTCG CGGCACATTG CAAGCGACTG TAGGTACTGT TGGACTCAAT CGAGGGATCA TTGACAAAAT TTTAGCATGT TGCAGTCATA TGCTTTCGCA GCTGCTCCTT CATTACACCC AGCTTCCGGA TTTGGACATT CTTCATGATC CCAGCCGATC CAGCGATATA TTCCCCGAGC TCCATGCCAT CTTTCTCCTA TCTGCCACCG TATTGACGTT CGGTGTTTTG CAAGAGTCGT CGGACATAGT CCAAAAAGCC AATTACTTTG CGCAGCATTC TTCGGTTGCG CGTTTGCAAC ACGCGTCGCG GGCCTTGGCG CAAGCAAACA ACAGCGAAGT TTCGAGACGG CAAATCTGTT CATGCTGCTT CTTGCTTTCG TAA
|
Protein sequence | MGKTSLKQTG AVPLPNDGPI HFVWQQRHKT TGVDDSVSSD VHEAVAFLRS VATPTSTAES RSPTFRAIDS TSLSCLVHDL WCLNDIREQC QRLRVFRSHV LQARQKQSSD AGVVIATLDT RTAKASHRVL LEWSLAVATP VPLRRAVHAH LRERTGLSDD ATLDIAGAVL DSILSSSSQS TSGPSLWWWK DPIATLQEWI AFMSSDPSLA VDWNETRPRI LVFLLTYADT CSMPTLGRFA YTSPSTEMVD QQSQVAIAEA VRVADLFKAL LQPGDRNCYW ETRDEPFLEQ MRDFLWALLS CRAVSEPSLQ IIGIAYGRVC VWPYHTNTHT NTSESVDPDA LVAKLQKDWS RLPELSDLAR AVVVQGLAAT LPDQTLFAHR HEDMETPIDM STLMDTFGDL ATGAIDPDVR LAALKGFRTL TSRSLTALAV DNSDSDRLAT RNLSFALAQQ SLKVVMQAWE SPPTRRLANA IPPLFDSVVN LLRRLDQDWS VDALVGRVLS QPAHCKGRYI ALEALLPISG AQALVFTASR PANKHSMLDD LLSGIGDHGH NTNAIADLWA RILQQYLQEL LLQNGIPTLS SSSQSQRTKG APAVATVMEN METLPSVLTA WCDAWVPSLT EALLVPEVSR RKQIASFCLP RVLKMAGNAR PRASLAIVQI LQHLNATGQN HSTRLNKPSK ERETKKDRVS WAMLEVVRLA AVDQLLREKP VGSTLALDTC VASLISKKCL QSALVHFSPF VRLVAFQAMP HVVKATSSFK DDASRLGHEV FLWKATLPFV VKTSDKEYAL TLVQCLLSFM DRLSSWEAKC VLSDDPNAPQ STENSQLQVS SFAIDFLIND VLLQKLTYPG SVVDKECLAL SLFEALIVFS CRDLKFAQEC RLLPKAGAAF HRKRNFAEEA VCGRIRRALF GFEGLAALFS MLSSNWDGIR DDAYTILNSA LALTTRFHLF VPMEFTSNKA RSKFEARALF LASSPRQREA DAGARILAFL FVSSLTNDQR YDYLERLGSI LQERLTLMKV KLQDILSNDT DFVDGAELPL AHGIIRSIRL IYEHHQTLTD TLQNEHDRLG TLFQNMIPMF CKALQLSLGV VADLRDGEAV DGMDRELEFA SSKVNPGAIG ANGIFSSVQR LSKTEHSRRL ASQRIVIGSW LLTKETCAAV SILLSLGCVN PSKESMQQVG MLLISTLTSL KHTGAAFAAH KALQQISVAC FSIPNLETLP KEWALRLLDE LANSDKIRDS TLRRSTGYGL GFLSIMRSEV ACHSANHSLC SFIMKKILCL SLPSKSMLSS FLSSIAWFDR ERPIPLPFSL GDEVDGIGLC NQYTLRSRVH ALNVLRSILS DAPLAKQVFP AVGDSIVTAM MGYSDTDWSV RNSSTMVFAA VMLRAVDADK NASNEDETSS RAITLAELFR VFPSLPDFLV SVLQASIAGE LGEASTSPPV LPILLLLARS QQLSMSGHDS VSIAEPFMQV VFRCLGHAHL STREAAARAI ANLSSAEKKS VTSFHCILQT CNDRLKVNLR TMVWNDLHGT LLCIRDMLAL WKLIVKESLG KELLAWLWRF TRVENSQFVV PPLCVSVALE ILHRANLTDI ESPQLLRACL DIESKLARER HHTRGDTGLS KLGLTAGSIA CSVVSEQLWH FSAKCRVQVD ESVFSKVETL LRSDCPDVKL ASVKVFKKSI YRGLDSMLKS SDAHECSTFL SRLTTTILNA LKAELGIGSS KAEKIFIHPP TVRRLSRCLL ESFDAYDALA MEVPSTVACS VLCGIGESLL KLGGLEANQN WKLIDVTQLA GNGIELLSRC TLFNDSYEDG FPFQGVIGFL CDHRLPWRLR YSAITAIGTL STNLTYPPDL CSKWTSASLE YLQDEDPDVR YAASKTLVSP INSNVQIADV ALNVLFTGKS NSRIFLPWLE RCIEKVLENY SKMNRRVCML ANELTQSGIK SGEILNVGNV REIFEEENPN SYGEHLLFVQ LAVRGTLQAT VGTVGLNRGI IDKILACCSH MLSQLLLHYT QLPDLDILHD PSRSSDIFPE LHAIFLLSAT VLTFGVLQES SDIVQKANYF AQHSSVARLQ HASRALAQAN NSEVSRRQIC SCCFLLS
|
| |