Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43378 |
Symbol | |
ID | 7197411 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 286832 |
End bp | 292890 |
Gene Length | 6059 bp |
Protein Length | 1934 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177591 |
Protein GI | 219111679 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGCTG CGGAGCGCTA TCTCGCTTCG GGGAAGAGCG CCAAGTCGTC CAACACCAGC ATGTCACTTT CGGTGGACTA CGGATACGGT TCGGTATCAT CCAGTAGCGA TACCAACGCA TCCCCGAGCT ATAACGGGCG CTTGGAAATA GAGCTGGATG GTAAACAAGG ACCTCAGCGG CGTGCGCAAC GACGATCATC CGCTTCGTCA TCACGGAGAT CATCGACGAG CTCAATGGGG TATTACAAGC AATCAGATGA TGATCTAGAA GAAAGTCTAC ATTCATACGC TGGGGACAAC GGGTCAGCCG ACGACGAGGA ACAACACCTG AAGGAGCCGT CTTTCATGAA AGCGTTGCGG AGGGGATCTT TGTCAAAGGA TGAGAGTGAA ATCAAACTCA AGCAGTCGCT TTCCAAAGTG AGCGACTACG TAGATCCACA TGACGATGAT CGAATTCAGT TGATGAAGGA ACGCATTAGT CTCAGCAAGC AGCTGCAGAA TGTAGCGGGC ACCCCAAAGA AGCGCGAACA GCGCTTCGGA CGTCGGTCGT CCATTGAGTC CACCAGTAGT AGTATGTGCT CCTCGTTCGC TCCCGACACC CCGACCCGCA CACACTCTGG ATCATTATTT AAAAGCTTGC AAATCTCCTC AGCCGAATTG ACTGGCATCG CTTCCAGCGT CCGACATTCT GATTTATCGT CATTCAATAT GTCCACGAGT ACAATTCCGG TCACGAATGC TGTGAGAACT GATGAAGGCC GGAGACGCAA GCACGTGGTC AAGACGGATC AAATAGTAGA ATCGTTGGTG TGGTTTTCCT TTCATACCCC ACGTGCCGTT TTGGAAGATC TCATCAAGCA CGAAATGGAA ATCTGGCGTC GCCAGAGTCT ACAGCCAAGA AGGGGCAACT CTTTGCCCAC CGAATCTGCG CTTGGCAAGT CGTTACTAAC TCCGTTGGAT GACGAAATGG ATGACCATAG TTTGAGCTCC CTATCTGAGG ACGGCAATGC CCGTATAACG CCAGAACCAG GTGACAAAAC GTTTTCGCAA ACTCTTCAAC GAATGAAACA AGACACGAGG TACGGAAAAT TCGACATGAT AAAGTTACCC AAAGCTGTTG AACGGGAAAG TGCTTTGCTG TTTGTTGACA TGTCAGGATT TACGAAGTTG TCCACAATGT TGGACGTGGA ATCACTTTCG AGGGTCATTA ATAGCTATTT CGACATGATA TTGAGCGAAG TCATCTTGCA TGGTGGCGAC ATTCTAAAGT TTGCCGGTGA CGCGTTCTTC GCTGAGTGGA AAGTTCTGAG AGAAGAAGGT TGCGATGCGG AAAAGGCTGA AACTACGAAC AATCCTCTGG CCGACCTGAA TGCGTCCTTG GCATCAATCA ATGAAATGGC GTGGCATGAT GATATTGACC TCCCAAAACT ATCGACTTGC GTTTTGTCTG CTGCCAAGTG TGCAACCGCT ATTGTGGCAA AGTTTAGCGA CTATCAGGTC ACCTCGGGAT CTGCTGGTGC TACGGGGGCT ATGCTCAACG TCCATTGTGG TATTGGAGTG GGGCAGTTGG TGGGCTTGCA CGTTGGTGAC TATAAAGAAA ATCAGGAGGA AGACGGCGTT GAACTTCGCC GTGAATTTTT GATTCTTGGT GAAGCGATTG ACCAGGTATG CTTTCTACTC GCTTAATCAT CGATTTGTAT GTTTCACCTC TTACACAATT ATTTTGCTCC GTAGGTATCC ATAGCAGCTG ATGTCGCCAC AGACGGTCAA GTTATGGCGT CACCCGAGGC AATGCTACAT CTTGCCTTCT GTTGCGATAT GCCCGACTCC GCTCACAATT CTACCGATCC TGTTTGCATC GCATCTCGTG GGCAGGCTTT CTTGCAGTTC GATTCGGATT TTCAAGTGGA CGGCCAATTG CCTTCCGCCT TGCTTCCTTA CGAGTCGCTC CGGCTGCATT GCAGAACCTT GAACCATGAG GCTTTAGCCC GACTACATCT TCAAATGGCT CTATATGTCC ACCCCGTGAT TCGTGCGGAT GAGCTGGCTC TATCCACTGC CATACAGGCA GGAAAAATTT CTCAGCCTAC AGAAGCCCTC GAGAGTCGCC ATCGAGCAGA GGCAGAGCTT CGATCAGTTT TCACAATTTT TATCAAGCCC ATCGTTTTGC CAAGAGTGAC GGGGATGAGA GAAGTCGATG AAGAATTATT CAAGACTCTT GCCGATATCA TGCATATTAC TTCTCGTGAG CTTGATCGCT ACAGTGGACA TCTTCGACAA TTTATTGTTG ACGACAAAGG TGTTGTTCTT CTTGCAACTT TTGGTCTAAG GGGGTCGACG TTTCCAAACA TGGTTGCCAA TAATGCATTG CCAGCTACGT TTGCGATTCA TCGAGCATTG AAAACGGAAA TAAATGTTGA AAGTCGCATT GGCGCTACAT TCGGAAAAGT GTACTGTGGG GTCGTTGGGG GTGTTAGAAG GCACGAATTT GCGGTCATGG GTGCTCCAGT GAACTTGGCA GCTCGACTTA TGAGTTCGAA GGTCAACAAT GGTATTCTTG TCGACGAGGC AGTAAAAGAA CAAGCTGGTG CTAACTTCGC TTTTCGAAGC CTGCCCCCGG TTCAAGCGAA GGGTTACGAC AAGCCGGTTC CAATTCTCGA ACCGTTACAT GCTGTCAACG TCGGAAAAAA GAAGAAAGTC TCATACCCTT TTGTTGGCCG AAGGACGGAG AAGGAAAAAA TTCTATCGAT CACTTCTGCA ATGTACAGCG GCATACCAAG TAGTGCTATG ATATTTTTGA TGGGAGAGAG TGGTACTGGC AAGAGCTCAC TCTCCACGGT CGTTATCGAC GAGATAAAAA AAAGATGCGT TATTTCTTCG AAAACGATCG TGTCTGCAAG ATCAACTAGT ACTGAAACAG AGCAGAGAAT CCCACTCAGG TATGAAAATC GAATTTCAAT AACCAGTTAT TGGCTGATAG TCATCAATGT ACTCACCTTC GTTTCATTGG ATGTACCCTA GTGCCTATCG AAAGATTTTG CTTAGTATCA TAAGAGATTT ATGCGAGCAC GATGGTACAT GCGAAGACTG GTGTGGAGAA AGCCCTGGTT TTGACAACCG AAGCCATCAA AGCGTATCAT CGAAGTCATC GTTTGCTGGA ATACCGCTCA CGTCGCTGAA AGCTCTATAT GCAATGAGAG GAAGCTCACG GTCAGGACTA AAGAACAGGG ATTTTGCGAC TACTTCACAG GGTGAAGGTG ATTATAGAGC TTCAATAATG CGGTCTAGAG GGCTGCGGGA AATCAGTGAT TCAGAAGAAG CATCTTTAGA ATCTTCAGTG CGATCAGAAA GTAGCGAACA ATCACCGAAG CGTGTCAGCC AATCACTTCG AATCCTTGGT GAAGCCCTTG CCAATGGGAC AACAAGACCA ACCAAGAAAG ATTTTTTGGC TCCAAATTTG AATGTATCAC TCCACCGCGC AACGTCCCGG CCTACGAAAG GTGATTTCTT GCCAAGCAAC TTGAGTGCGT CAGTGCACAA CTCAATGGAT GCGAGTAGAC ACAAGAGCAA GCGGGAACCC CTATCCAACT CTCTTCATCG GATTACTTCC GAAGAGCCGA AACTAAGTTC CAAAACACGA CTTGGCAGGG ATGATACCAG CATCGCTAGC GGCGAGAACA GCACTCACAA GCATGGGGTT GCCGTTCCAT ATTTCGAAAA GCTGTGTTGG GTATGTGAGA AACTAGACTA TCCATACGAA TATGCTGACC TTGTGGGATC TCAGTTCCTG TCACTGGACG GAGCAAGTCC AGTCACGCAT GTCGACGGTC ATGTGCCGAC AATGGACGAA TTGGTCGAGT TCCTTGCTCT GGCGTTTATC TGCATTACAG AGTACGCTGA TATTTCTGTC ATCTTAATTG ATGACTTCCA GTGGGCTGAT TCGTTTAGCT GGAAGATTTT TCGAGAGCTT TGTCAACGAG GCAACAAACT GCTGTTAATA TGCGCAATGC GCTCTCATGA TAAGCAAGCG CTCAGGCGAC TCTCAACTGC TGTAACTCAA CGAAGCCAGC TCCACTCAAA AATGACCGAA ATATCTTTGA CTTCACTCGA CACAGACGAC ATTCGAGAGC TGATGGCCCA TGTTCTTGGA CACAAAGAAG AGCTTATTCC CGAATCGCTT TGCACAGATG TTTTTCAAAG GACTGGAGGT TTGCCTGTAT TCGTCATTCA GGTTCTCGAA AATATCAAAA GGGCAAAAAC AGTTGAGCTT GGCGAAGATG GACGATTGCA ATGGAACGCT GCAGGTTTGA AAGAAAAGGT AAGGTCACGC GTGCTATGGC CGATAACTTC AGTTAATTCC CGTCTTACGA GCACAATTTG GAATATTTAG AGAGCGATTG GATCAAATAA AGCAGGTGCA GTAATGGAAG AGACTTTTTT GAGCCGATTT GACGGCCTAG ACGTACAGGT GAGGAAAGTC CTTCAGACTT GTGCAGTTCT CGGGTTGACA TTTTCTTTTG CAGACGTCGT TCAAGTTCAT CCAGAGATGG AAGAAGCCGA CATTGAATAC GCGATGTGCT CCGCTGTTGA TGAGATGGTT CTTGTGGAAC AAAATGACGA AGATGAAGAA AGAACTTTAA TCTCTGCTGC CAGCAACGGC GACTTCGATG ATTCAATATC GAATTTCCTT GCCGCCAGTA GAGCCTCGAC AAGCGGCAAA ACGCACGATG AGCGCTTTTT TCAATTTAGC CATGCAATGT GGAGAAAAAG CGTACTTACA ACGATGTTGA AGGAGAGAAA GATTGAAATA CATCGCTTGA TTGCAGAGGC TATGGAAACA GACAAAGTAT TAATCTTGGA AGAAAGCGAT ATAACACGAC TGCTTACTCT TTTTGACCAC TGGAAATCGT GCGGCGATTT TTCGAAGTCC GCGCCACTTG CATTAGTTGT GGGTGCTCGT TTGGAAGAAT GGGATTTGTC TGCACAGAGT TTAGAACTCT ATGAAGACGC TTTAAGTATG GCCTTTGACA GCGTTGAAAC TCACGAGGAC ATGGAAGCTA TTAATGATGA ATGGGTACAG GTTTCCGCAA GACCAATTGT CTTGGATCTA ATTCTCCGAT TACATATTCG AATCGGGCTC TGTCATCAAC GTTTAGGTGA TGAAAGCGAA AGTATCGCTA CGTTCGAAGA CGCTTTCAAC ATTATGAACA CATCTTCAAA GTTTCCTGGT ATAAGCAGAT CCCTCATGAT GCCAATAATC TCCTCACTCT GTGTGCTCAA ACTGGAACAC GATGTACAAG ATAGTGAAGG CAAAGAAGAG CAAGAACAAC TTCTGCAGCA GTTTCTTTGC GAAGCGACTA TAAACGGTAA TCCTGTTCAC ATCGGTCGTG TTTTAGCACT CCAAGCAATG TATCATGCCA AAACTGGCTC CCTCGATCGA GCGCTTGACG ATGTTCATCT TTTACAGAAA AAGTATGACA TCCAAGAGAA TTCTTTCGAT ATGATCACTG AATATGGACG GGACTTTGCC CTTGAGTGCT TCTCGGAAAG TGTTCAATGG ATATATTTGC TTGAGAAACA TGACGCTGCA TCGGACCGTG CCGACTTAAT AATGGACCAA TACCTGACTT TGATCGACCC AATAGACGGC GAAAGTATGA TGGCGATCCT TCTGCCTATA CTGAATGTTC TCAAACTTTT GAACCGAGCA ACTGATGCAG ATTGGCTGTT GAAGCGACAT ATAATCAACC CGGCTCACGA CCAGGCCTTT CATTGTGACT TTTGGGTTCC TCTTTTTAAT CCACTAGCTT ATCTTCTCGA AGTTGTCGTG ATGGAAGAGA GCGAAGACTT TGACGAACAG GTGTTGCAAG AGATGGAGCT CTGGGTATTG GAGGAAGAAA ACAGCAAATT TGATTTTGAT CTTGAGTACA AGGCGCACAC GCTCATGGGG GAACTTTGCT GGAGGCTAGC CAACTTTAAA GAAGAGAACA ATCCAACCCG TGGACCGCTC ATTGACAAGG CGCGCGACTT TCTTACCCCG GTAGCGCAAT ACCCTCATG
|
Protein sequence | MQAAERYLAS GKSAKSSNTS MSLSVDYGYG SVSSSSDTNA SPSYNGRLEI ELDGKQGPQR RAQRRSSASS SRRSSTSSMG YYKQSDDDLE ESLHSYAGDN GSADDEEQHL KEPSFMKALR RGSLSKDESE IKLKQSLSKV SDYVDPHDDD RIQLMKERIS LSKQLQNVAG TPKKREQRFG RRSSIESTSS SMCSSFAPDT PTRTHSGSLF KSLQISSAEL TGIASSVRHS DLSSFNMSTS TIPVTNAVRT DEGRRRKHVV KTDQIVESLV WFSFHTPRAV LEDLIKHEME IWRRQSLQPR RGNSLPTESA LGKSLLTPLD DEMDDHSLSS LSEDGNARIT PEPGDKTFSQ TLQRMKQDTR YGKFDMIKLP KAVERESALL FVDMSGFTKL STMLDVESLS RVINSYFDMI LSEVILHGGD ILKFAGDAFF AEWKVLREEG CDAEKAETTN NPLADLNASL ASINEMAWHD DIDLPKLSTC VLSAAKCATA IVAKFSDYQV TSGSAGATGA MLNVHCGIGV GQLVGLHVGD YKENQEEDGV ELRREFLILG EAIDQVSIAA DVATDGQVMA SPEAMLHLAF CCDMPDSAHN STDPVCIASR GQAFLQFDSD FQVDGQLPSA LLPYESLRLH CRTLNHEALA RLHLQMALYV HPVIRADELA LSTAIQAGKI SQPTEALESR HRAEAELRSV FTIFIKPIVL PRVTGMREVD EELFKTLADI MHITSRELDR YSGHLRQFIV DDKGVVLLAT FGLRGSTFPN MVANNALPAT FAIHRALKTE INVESRIGAT FGKVYCGVVG GVRRHEFAVM GAPVNLAARL MSSKVNNGIL VDEAVKEQAG ANFAFRSLPP VQAKGYDKPV PILEPLHAVN VGKKKKVSYP FVGRRTEKEK ILSITSAMYS GIPSSAMIFL MGESGTGKSS LSTVVIDEIK KRCVISSKTI VSARSTSTET EQRIPLRDLC EHDGTCEDWC GESPGFDNRS HQSVSSKSSF AGIPLTSLKA LYAMRGSSRS GLKNRDFATT SQGEGDYRAS IMRSRGLREI SDSEEASLES SVRSESSEQS PKRVSQSLRI LGEALANGTT RPTKKDFLAP NLNVSLHRAT SRPTKGDFLP SNLSASVHNS MDASRHKSKR EPLSNSLHRI TSEEPKLSSK TRLGRDDTSI ASGENSTHKH GVAVPYFEKL CWVCEKLDYP YEYADLVGSQ FLSLDGASPV THVDGHVPTM DELVEFLALA FICITEYADI SVILIDDFQW ADSFSWKIFR ELCQRGNKLL LICAMRSHDK QALRRLSTAV TQRSQLHSKM TEISLTSLDT DDIRELMAHV LGHKEELIPE SLCTDVFQRT GGLPVFVIQV LENIKRAKTV ELGEDGRLQW NAAGLKEKRA IGSNKAGAVM EETFLSRFDG LDVQVRKVLQ TCAVLGLTFS FADVVQVHPE MEEADIEYAM CSAVDEMVLV EQNDEDEERT LISAASNGDF DDSISNFLAA SRASTSGKTH DERFFQFSHA MWRKSVLTTM LKERKIEIHR LIAEAMETDK VLILEESDIT RLLTLFDHWK SCGDFSKSAP LALVVGARLE EWDLSAQSLE LYEDALSMAF DSVETHEDME AINDEWVQVS ARPIVLDLIL RLHIRIGLCH QRLGDESESI ATFEDAFNIM NTSSKFPGIS RSLMMPIISS LCVLKLEHDV QDSEGKEEQE QLLQQFLCEA TINGNPVHIG RVLALQAMYH AKTGSLDRAL DDVHLLQKKY DIQENSFDMI TEYGRDFALE CFSESVQWIY LLEKHDAASD RADLIMDQYL TLIDPIDGES MMAILLPILN VLKLLNRATD ADWLLKRHII NPAHDQAFHC DFWVPLFNPL AYLLEVVVME ESEDFDEQVL QEMELWVLEE ENSKFDFDLE YKAHTLMGEL CWRLANFKEE NNPTRGPLID KARDFLTPVA QYPH
|
| |