Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_24186 |
Symbol | |
ID | 7199381 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 82249 |
End bp | 86154 |
Gene Length | 3906 bp |
Protein Length | 1088 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185483 |
Protein GI | 219130672 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCACCCAA CTAGTGTCAT TCCCCAATCC CAACGACTGG CCTCTGGAGT TATAGAGCAT CTTTGGGTCT CGCATTCAAA GGTTAACGTA GTCGGATTCT CCGATCCAAT CTCTCCTAGC TCTAGTACAC GTACACACAC ACACAAACGC GTGTAGAGAA AAAAAGGGTC TTTCCCTCCT CCTTGGTTGC ACCTTGGCGT CCCGACGACT CCCTGTCTGT CCCACGCTCG TTTCGTACAA ATCTACGTAG AACTCTACGT AGAGGCAAAT TCGCTCTCTC GACGACTGTG AGCGATTTTG CGAGACAACG AACCATCGTC TCCACACCCG TTATCTGGAT TGGACTCGTG CGCGCTTGTT CCTCCACGTT CGTGAGAGTG CCTCACAACT CGACAATAGG ACGTTGCAGT TCGTTCGATT GTACCGGAAG GAGACACCAT GACCGACTTT GCCGCCTTGA CGCAAAGCCT GTTGCAGCCG GAAACCTTTG ACGTTACCGC ACTCGATCGA GTCGTTACAG CAGCGTACAG TTCCGGTGAT CCGCATCAAG CTCTGGCCAA CCAGACCCTC ATGCAGTTAC AAGAAGTCGA CGGGCTCTGG ACCAAAGCGG ACGCCATTAT TGAACAAGCG CAGAATGCAC AAGCACGATT CTTTGGACTC CAGGTACTCG ACAATGCCAT ACAAACGCGG TGGAAAATTC TGCCATCCGA ACAGAGAGAA GGCATTCGCA ATTACGTCGT GGGGAAAATC ATTCACATGA GTTCCGGGGA TGAATCGGTG TTGCAGAAAG AACGCGTCTT TGTTGGTAAG CTGAATCTGA CGCTCGTGGA GATCCTCAAA CAGGAATGGC CCCACAACTG GCCAAACTTT ATTACCGATC TCGTGGGAAG CTCCAAAACT TCGGAGGTCC TGTGCGAGAA CAATATGCAG ATTCTCAAAC TCCTGTCCGA GGAGGTCTTT GACTTTTCGC GCGATCAGAT GGTGACGGAA AAGGTCAAGC GCATGAAGGA ATCGTTGAAC GGGGAGTTCG CTCAGGTATA TCAGCTCTGC GAATTTGTTC TGGAACATTC CCAGCGGCCG TCACTCCTGC GAGTGACCTT GCAAACCTTG CAGAGGTTTC TTACCTGGAT TCCGCTCGGA TTTATTTTCC AAACGAACCT CATCGATACA CTGGTGCACA AGTTCTTGCC CGTGCAGGTG TTCCGCAACG ACGCACTCGA TTGTCTGACC GAAATTGGGA GTCTACGAGA TCTGGATCCG ACCTACGATC CCCGGTTTCG CAGTCTCTTT ACGTCCTTTC TTACACGACT GGCGGATATT TTCTCTCCCG AAACCGACCT ACAACCAGCC TACGAGAATG GCACCGAACA AGACTGCGAG TTTCTGCAAA AGCTGGCGCT GTTTCTCTCC GGCTTTTTTC AAGCGCATTT GAGGGTCCTG GAAGTCCCGG AGACTCATCA AGCACTCATT GCGGGTTTGT TGTACTTGGT ACGGATATCG GAAATTAAAG AAACCGAGAT TTTTCGTATT TGTCTCGAGG CCTGGTACAT GCTGGCGGAG GACTTGTACA AATCGGAGCA GATTCCGCAT CACGGAAACA TGACCCGCAG TATGCCCGTC TCTCCCATGG GTTTGCAGCT CAACGGCGGT GCTACCAATA ATGGCGGTAC GCAATCGCGC AAATTCCTGT ACGCTCCGGT ACTGAACGGG ATTCGGCAAG TGATGATCAC AAACATGGCC AAGCCGGAGG AAGTGCTAAT TGTCGAGGAC GAAAATGGGG ATATCGTTCG CGAGACAACC AAGGACACGG ACGTGATTGC TCAGTACAAG ACCATGAGGG ACGCTCTCGT GTACCTCACA CATTTGAACC CTGAAGATAC GGAGACGATC ATGCTATCAC GGCTTGCTGC GCAAGTGGAC GGGTCTGCTT GGAGCTGGAA CAACCTCAAT ACGCTCTGTT GGGCTATCGG ATCGATCTCT GGCGCCATGG CGGAGGACGA GGAGAAGCGC TTCCTCGTCA CTGTTATTAA GGATCTGCTT GGTTTGTGTG AAATGAAGCG TGGAAAAGAC AACAAGGCTG TTGTGGCATC CAATATAATG TACGTGGTTG GCCAATATCC ACGCTTCCTT AAGGCACACT GGAAGTTTCT GAAAACGGTA GCCAACAAGC AGTTCGAGTT TATGCACGAG ACACATCCCG GCGTTCAGGA CATGGCCTGT GACACCTTTT TGAAGATCAC CATCAAGTGC AAACGGAAAT TTGTCACACT GCAACCCGAC GAGTCGGTGC CGTTCGTCGT TGAACTAGTC GATTCGCTGG GATCGATCGT GTCCGACTTG GAGCCCCATC AAATACAGTC CGTTTACGAA TCCGTAGGTA CCATGTTATC GGACAAGGGC CCTTCGGTGA CGATCGATCG GAAAGCGGTA CTCGCCAAAC TGATGGAACA GCCCAACCAG ACTTGGCGGA TGATTATGGA ACACGCGGCA AAAAACGTGG AGACTTTGGT AGAGCCAAAC ACCATCAAAG AAATCGTAAA GATCCTCAAG ACCAACAACC GTGTGTGCGG TGCGGCCGGT TCGCTCTTTA CACATCAGCT ACAAACCTTC TTTTTAGACA TGCTCAACGT CTATAAGGTA TACTCGGAAC GAATATCCGC GGCCGTCGCA CAGCAGGGGG CCATTGCGAC TCAGATGAGT CTGGTTCGAC TAATGAGGAG TGCCAAGAAG GAAGTCTTGC GGCTGCTAAT TGTCTTTATC GACACGAGTG GACCGCCTGA GGCCGAGCCA AAGGCTGTTG CCGAGGGCTT CATTCCACCT GTCCTCGACC CCATACTGGG TGACTACAAG CGCAACATTG CGGGTGCACG CGATCCCGAG GTGCTCAAGT TGTTCGCCGT TGTTGTGGAG AAGCTACGCA ATAATGTAGT GAACGACGTC CCTCGCATCA TGGAGGCTGT CTTTGAAGTG ACACTGGAAA TGATAACCAA GAACTTTGAG GACTTTCCGG AACATCGAAT TCGATTCTTC GAGTTTCTCA AGGCAATCAA CGAGCATTGC TTCCCGGCCT TGTTCAGCAT ACCGCCCGAG CATCAAAAGT TGGTTGTTCA CTCTATTGTT TGGGCGATGA AGCACACGGA ACGAAATATC TCCGATACGG TGAGTCCGGT GGGACGGGTC GTGTGCGATA TAGATTTTGC TGTGTTGTTT GTGTAGGTAT TCTCACTCAC TTACTTTGCT TCTTTGTTGT CATAGGGCTT GGATATATTG AACGAACTGC TGGTTAATGT CGGTAAGGCT CCCAACGTTG CGCAAGCATT CTACCAGCAG TATCTGCTTT CGCTGATTCA GGATGTGTTT GCAGTTATGA CGGATCGGCT GCACAAGTCT GGATTCAAGA TGCACGCTAC GTTGCTGCGG CACATGTTCC ATCTCGTGCA AATGAACCAA GTGACGGTGC CCCTCTTCGA CCCATCCCAG CAGCCGGCAG GAACGACGAA TCCGTCCTTT TTGCGCGAGC ACATATCGAG CTTGCTGCTC ACTTCGTTTC CCAATCTGGC CCGATCACAA GTCGGCAAGT TTGTCGAGGG TATGCTTGAC GTCAAGATGG ATCTACTGAC GTTCAAGACG CATTTGCGAG ACTTTTTGAT CGAGCTGAAG GAGTTTAACG CGGAAGACAA CTCAGCCCTG TTTGCCGAAG AACAAGAGCA GGCTGCGAGA GAGCAGCAGC AAGCCATGAT GGCTGAGCGT AGTGCCGTAC CCGGGATGTT CAGTCCAGCC GAAATCGACA ATGATCTTTG AAAATGGGGT GATTCGGATG GCTACGGGGA AACGTCGCTT CTTTGTGATC GTTTGACGAG CTTATATTAG CCTTTGCTCT ATAATTTAGA CCTTTTTACG CTGTGG
|
Protein sequence | MTDFAALTQS LLQPETFDVT ALDRVVTAAY SSGDPHQALA NQTLMQLQEV DGLWTKADAI IEQAQNAQAR FFGLQVLDNA IQTRWKILPS EQREGIRNYV VGKIIHMSSG DESVLQKERV FVGKLNLTLV EILKQEWPHN WPNFITDLVG SSKTSEVLCE NNMQILKLLS EEVFDFSRDQ MVTEKVKRMK ESLNGEFAQV YQLCEFVLEH SQRPSLLRVT LQTLQRFLTW IPLGFIFQTN LIDTLVHKFL PVQVFRNDAL DCLTEIGSLR DLDPTYDPRF RSLFTSFLTR LADIFSPETD LQPAYENGTE QDCEFLQKLA LFLSGFFQAH LRVLEVPETH QALIAGLLYL VRISEIKETE IFRICLEAWY MLAEDLYKSE QIPHHGNMTR SMPVSPMGLQ LNGGATNNGG TQSRKFLYAP VLNGIRQVMI TNMAKPEEVL IVEDENGDIV RETTKDTDVI AQYKTMRDAL VYLTHLNPED TETIMLSRLA AQVDGSAWSW NNLNTLCWAI GSISGAMAED EEKRFLVTVI KDLLGLCEMK RGKDNKAVVA SNIMYVVGQY PRFLKAHWKF LKTVANKQFE FMHETHPGVQ DMACDTFLKI TIKCKRKFVT LQPDESVPFV VELVDSLGSI VSDLEPHQIQ SVYESVGTML SDKGPSVTID RKAVLAKLME QPNQTWRMIM EHAAKNVETL VEPNTIKEIV KILKTNNRVC GAAGSLFTHQ LQTFFLDMLN VYKVYSERIS AAVAQQGAIA TQMSLVRLMR SAKKEVLRLL IVFIDTSGPP EAEPKAVAEG FIPPVLDPIL GDYKRNIAGA RDPEVLKLFA VVVEKLRNNV VNDVPRIMEA VFEVTLEMIT KNFEDFPEHR IRFFEFLKAI NEHCFPALFS IPPEHQKLVV HSIVWAMKHT ERNISDTGLD ILNELLVNVG KAPNVAQAFY QQYLLSLIQD VFAVMTDRLH KSGFKMHATL LRHMFHLVQM NQVTVPLFDP SQQPAGTTNP SFLREHISSL LLTSFPNLAR SQVGKFVEGM LDVKMDLLTF KTHLRDFLIE LKEFNAEDNS ALFAEEQEQA AREQQQAMMA ERSAVPGMFS PAEIDNDL
|
| |