Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32379 |
Symbol | |
ID | 7196944 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2326359 |
End bp | 2332052 |
Gene Length | 5694 bp |
Protein Length | 1606 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176959 |
Protein GI | 219110415 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.069954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTGG AAACAGAAAC GAGCCATATT CCAGAAAGCT CTATTTCGAA CGCAACCAAT GCAAAAGGGG TGACAGTGGA AAAGCAGGTT GTGAAAGAAG CAAAAGAGAA GACCATCCCA TCTTCCGGCG ACGGCATTAA TGACTCTATT ACAGCGGGAA AATCATGTTC GAGCCAAATT CGAACATGGT TTGAATCGCT CTCTGCTGAT GAACTATCAG CTGTGATGAG TTTCTCAGAC AAAGCGTTTT TGGAAACTTT TTTGGACCTT TCTTCATGGT CCGATTCAGA TCGGGTCCCA TACCTTCGTG AAAAGCACGG CAGCCCCTCC GACGTAGGTG AGTACACGAA TGGAACGGGT CCTTTTTTAG TAATTTCGTA GCATTCAAGG TTGAAGGATG GCAAGATTCC CATTGGTTGG CAGACAGACC TTGATCAGAT ATATCTCACA ATCAGCCGTT TCAGTTCAAC AACCGTCCCC TCACGTTTTA TATCCTTTTC ATGCGTTTTG ACTTTCTAGG GTCAAGACAG ATTCAATGGG AAAAGTTGGT TCCGTGGAAG GCTGTCGAAA CGTCCTTGAG TATGTGTAAG CTATGGGTAG AAGAGGTCGA AAATGATTCT GCTTTGAGCA TGCCCAGTGC CTCAAAAACC GAGACGGAAG GTCCTGCTGA TGCATCACTA TCTCTAGCGG AATCGGAAGG GACGGGGAGA GTTGCTGAAG AGATTGGCGA AATTTTTTTG GAAGAATCGT CTTCAATTTT GGAAGTTGAG GATGAATGCG TAGGCGGGGA GGGAATCATC GATATTCAGC CGTTTTTGTC TCTTTTGGAT CGAACCTGTG TGATATATTC ATCCGCGATC CGTTATGTTT CGGAGAGGGA ACGAGAGGGG AAGGAAAGTC CTTTTGTGAC TGTTCATCCT AGTCACTTCG AAGCACTACA AGGTAGTCAG CTTCTATCAG TTTTGGATAC AGCGGCATCG TCCATGTCTG AGCCATTCTT TTTGTCAAAA TCAGTATATG ATAAGGCACC ATGGCAACAT CGCTTTGGGA CCAAGAAAGC TTTAAAATTT CCTCTCTGGG GATTATTTCT TGCGCGTTTC GAAAAGTCTG TCTACAATGC ATTTTCAAAG CATACGCGAC AAAAGAGTTT TCATGCCAAC GAAGCTCTCA ACGCTGATTT CCCATTGCGC GATGTCTTCT CGCATGCTCG GTTGTTAGGA GAAAAGATTG TTGAAATTGA CGTGAAAACG CTGGAAAAAG TTATGTTGCC GTTACATTTA GTCTATTTTG GAGAGGATCG TCCAAGGTCT TGTACGTTGG AAAATCTCAT GTTTTTGCCA TTGAACTGGT TGGTTTACGC CAAGTCTACT TGTAGAAAGG TAAATTTTTT GTCCGGTGAC GTTGCAGTTC AATGTTGTCT CGAAAGAGTG AAGGAACACA GCGGCCTCGA AAATGTTTCT TTTGGAAATG TGGGTGCAGA ATTCGAAGTG ACCACACCGA GAGAGGGCGG CCAATCTCAA GGTACTGTAT TAAAGAAGAG ACGACGGCGG CGCGTCAACA GGAAGAAAAA TTCGAATGAA ACATCTATTG GTCCAATCGG AGGTGAAAAG AGGCTAACTA CAGTCGAAAC CGAGTGTGCA CCGGTCGAGA CTCCACTGAC AGAATTATCG ACATCGCTTG AAGCGGCGGT CGTCCCAAAA GGCCTCGTCT GTGAACTTTC AGTAGCAACA GACGAGATCT CAGCTCAAAG CGATAGATCG GCGGTAAAAC AAGCGGAACA AAGTAATGTT GATGATGGTA GCCGTGAAAT AACTATTTCT CGTGTTCAAG GCACGTATGT GGTCGCCAGT ACGCCGAACA TGGAAAATAT AGAACAAGTA AACTTGTCTG ACGATCGGAA GGGAGACGAT GGGGATTCTT GGGAAAAGGT TGAAGTCCGC GGGCGCGCGA ACCGCAAAAA AGCATTTGAG CGACCACATC ACGGTCAACT ACGGGGTAGC CATCGGGGAT GTGCTTCCCA CTCTATGCAC GGGCACATTG ACGGGTCGAA GAAGGCCAAG ATCTCCAGGT CAAGTGGCGC TCGGAGAAGA AACACGAACC GCAAAGTCGC TCGTGATATA CTTTATAAGG TTCTCGATTC CGTTGAGGAC CAGGTGAAAA ATAGAGGAGA AGAGAAACCA CCTCTACCCT CTAGTAATCC GTGGAAGACT GTTCATTCAG TAGGAAGGAC CCCGGGAGTG AGCACTCCTT GTTCCCAGAA AGGAGCCCAA AAACATGTGA CAATGAGAGA TGTGGTTCTT GGTAAGAGTG TCCGCAAGTC AACTATTAAA TCTTCTCTTT CTTCTTTTGA AGCTTCTCCA TCAGGAACAA ATGCATTGAA AGAATCGATG AAGGTGCAAA ATTCTTTGGT GTCTAGCCAG CAGCGAATGG TGGATCAGAA TACTGCCCCT ACTTATCAGG AGACTGTTTC AGCAGTCTCT GGCAATACCC ATGACCTGTT GCCTCAGAAA ACAAGCATCT TTAAAACGAA AAAGCACGAT CCCAAAAGCG ATTGTATCTC AGACAATAGC GAAGCTCCGC AAAATCGAGC TGGTGAAACG TCTCCATCGA ACGAAAAGAA TATCGCGCTA TCTCCACCGC TGCCAACTCT ACTCAATCCG GAATCCGGAA ATAGTTCGAC CTCCTCGGTT TCATCGAGTT TGGAAGTTCC ACATGCAAGT CATTGCCACC ATCATTCCAC TACCGCTGTC GATGTCAATG ATGTGGGCTA TCATCTTTTG GATGTTTGCG ACCGTCTTAG TCAAGATATG AGATTATTTA TGAGCCGCCG ATCACAGGCA CTAAATGCAC GGCGAAAGGA ACGGGTTGCA TTACTTGGTG CTCTGCAAAA CTCGGTTGCC AAACTCTGGC CCGCCAGTTG CCAAGTCGAG CTATACGGAA GCTGCGCAAC TCATCTAGAC TTGCCTTCCT CTGATCTTGA TGTCGTGGTT GTAGGATTAG ATCGTCGGAG GGGTACTTTG ATGCAAGTGG GTACTCAGAA CAATGGCTGT CTTGACGGTA AGATGGAATC AAAAAGTGAT TTTGATGTTG ACGACGTTCG TCAAAAGATT TTGAACGCAT CACTTCCTCC GTATATTCCT ACATCGACGT CGCTGAACGC GGAACGAATT AAGCGTCTTG CAGCCGAGCT TGCAACTCAA CCTTGGGCGG TCCAGGTGAA GGCGATTCCG ACAGCGTCTG TTCCTGTCGT CAAGGTTCTT GCTGATCCAT ATCGTCTCCA AAATGTTTCA GGAAATGATT GGAAGTTGGA TAAACACCCA AAGATTGCTG CTGAGACTTC CAAAGTCAGC CAAGTTTCAA GTGATTCCGC TAGACTTCAA AGTTTCCAAC CTTGGCGAGG AGCAGATGCA ATGAACGGAC TTCTCTCTTT CGACATCACT TTTGAAGGGC CGGAACATGG TGGAATTGGC TCGACGGAAT TTTCGATTCG GACTGTGAAC GAAGCTTGCC GTGAAACGGG TCTCCCACCG GAGGGTACAC CATTTGTTCA AGTGATAATG GTGCTAAAGG AGCTTCTTGC CCAGCGCAAA CTAAACGAGC CGTATTCCGG GGGCCTCAGT AGCTATGCAC TCTTGCTTCT TGTTGTGTCT CTTCTCCGTG AACGTACCGT TATTCGGGAG GAGCTGGATC GAGTGGAGCA ACAAAGACAG GCGGTGGCTG CCGATACTAT GGATACCCAA TTTAGTCGGA TGGGTCACGA CAGAACGGCA CCTGCTTGCC CAAAATCTCA ACAAGTTACA AAGACGTCAA ATATACAGCG CATGAAGAAG GTTGCAGAGG GAGATATAGG AAAGCAGCAA GTGAGCTCGA CGTGGGCCTC GATTGCAAAA AGCGAAGCGA ACGACTCTGA TCTTGCTACG AATAAAGTTT CTGGCATCTG GAACTCGCGG CAGAAGCGAC ATTCATCTTT TGCCGATGTG ATGCTGCGTC CAGCATCGCA ATGCAAAGCT TCCAATAGGT CCAGTACTCA TATGAACGAT GTTGCCCAAA ACTTTGATGC GCTTCATGAG GCCACGTCTG AATGTGAGAA AGATGGAACT GGCAACAGAG AACATATGCC TGATGTGCTG CCGGCAGATT CTTCACATGT TCCCACTGCT CCGTCCTTCT TTCAACAAGG CTATAACGAT GTTGTTGATG TACTGTGCTC CGGCAATACC ACTGCAGGCA AGCTGTTGAT GCATTTCTTA CTTTTTTACG GGCATCACTT CGAGGCGCAA AAAACGGCAA TAGACGTTTC GGGCACACAT GCACGAGATA TTACCGGACG GTACTTATCG TACTTTTCAC CATTTATTCC ACGTGGAGCA CTGGGATCCA TTGATCCTAT GACAGGCATG TTAACGGTGG ATCCTGTTAT AGTCTACGAT CCTTTGGAAG GTGCTGAAAA CAATAATGTC GCCCGACGGT GTTTCGCTTG GAACAGCATT CGATGGATTT TTGCTCAATC ATACGCAACT CTGTCGAGTG CTGTTGAGCG AAGCACCAAC CCACCGACAT GTCCTAATAA TGGTGCAACG CCAACACCGG TCGCAGATAT GACGGTATTG GCTGATTCGT CGCGTGGCTT TATTGATAGA GCTGAAGGTA GAACCGAGGG AACTGTGGAC GTAGACTTGA TGGATCCATC GTCCCCACTT TTGCGCTGCT TGCTATCGTT TTGAATGGGA TGGGCTCAAT TGACATATTG ATTGAACCAC TTATGCGTAT GGATGTAAAG GAAATTCGCT AATTTGGCCA TTACATATCA CAAGCACAGG GGCGTGCCTC AACGCCACAA GCATAAATAG CCCAAAATGG GTTGTTTTTA ACGCTTTTAA ACTTTAAAAA GCGGTAATTT CCGGTAGGCG TTAAAATTAC GAAGTGAGCT ATTTTCGGAA TCAAATCCAC TTAAGGATTA GAACTCAGTC CCGGATTTAG TGAAGGTTTC TGAATCCTTC ACTAAATTAG TGAAGGATTT ACAGTTAGTG AAGGATTTGG TGAAGGATTT ACAATTAGAT TTTTGAAACG ACCGTGACTT GGGAGACGAA AAAATAGCCA GTAGTCCCTG TCTTTCCTTT ACTGTTAGCA GGCGCTCTTT AAATCTCCAA AAGGCGTCTA TTACAGTTTT ACTTGAACAA TAACCGCAAA GGCACAGGCT GTGTGAGCGT TGGAGTCTCG AAGCGACGTC GGGAAAAAAT TTCGGGTCGT TCAAGCACTG TGAGTTCAGC CTTGGCGAAG GGACGCGGAG GAGGATCTGC AGGAAGGGGT CGTGGCAAAA CAGGAGGGCA AGGACTTCAT ATGCAAAGGC ACAGTTTGTG TGAGCATTTG AGTATTGCCG TGATGTTGGG AATAATGTAG GGTCGTTCGA GCGCTACGAG TTCAGCCTCG GTGACGGGAC GTGGAGGAGG ATCTGCAGGA CAAGGTCGCG GCAAAACAGC AGGGCAAGGT CCCAAGCGCA AGGCCGATGC TTCCGTATCA ACGGTAACAG GAGCTTCCAC TCCTACCTAC CACACTGAGA CAGCAGCAAC GCAAACAAAA TGCTTCCTGA AAAGAATCGA CGGCACACAT TCGGGGAACG GACGTGTACG TTCCTACGGC GGCAGCAGTG TTCGTCGTAG CTTTCAATCA TTACAGATTT CCCAGCACGT TTAA
|
Protein sequence | MSVETETSHI PESSISNATN AKGVTVEKQV VKEAKEKTIP SSGDGINDSI TAGKSCSSQI RTWFESLSAD ELSAVMSFSD KAFLETFLDL SSWSDSDRVP YLREKHGSPS DVGSRQIQWE KLVPWKAVET SLSMCKLWVE EVENDSALSM PSASKTETEG PADASLSLAE SEGTGRVAEE IGEIFLEESS SILEVEDECV GGEGIIDIQP FLSLLDRTCV IYSSAIRYVS EREREGKESP FVTVHPSHFE ALQGSQLLSV LDTAASSMSE PFFLSKSVYD KAPWQHRFGT KKALKFPLWG LFLARFEKSV YNAFSKHTRQ KSFHANEALN ADFPLRDVFS HARLLGEKIV EIDVKTLEKV MLPLHLVYFG EDRPRSCTLE NLMFLPLNWL VYAKSTCRKV NFLSGDVAVQ CCLERVKEHS GLENVSFGNV GAEFEVTTPR EGGQSQGTVL KKRRRRRVNR KKNSNETSIG PIGGEKRLTT VETECAPVET PLTELSTSLE AAVVPKGLVC ELSVATDEIS AQSDRSAVKQ AEQSNVDDGS REITISRVQG TYVVASTPNM ENIEQVNLSD DRKGDDGDSW EKVEVRGRAN RKKAFERPHH GQLRGSHRGC ASHSMHGHID GSKKAKISRS SGARRRNTNR KVARDILYKV LDSVEDQVKN RGEEKPPLPS SNPWKTVHSV GRTPGVSTPC SQKGAQKHVT MRDVVLGKSV RKSTIKSSLS SFEASPSGTN ALKESMKVQN SLVSSQQRMV DQNTAPTYQE TVSAVSGNTH DLLPQKTSIF KTKKHDPKSD CISDNSEAPQ NRAGETSPSN EKNIALSPPL PTLLNPESGN SSTSSVSSSL EVPHASHCHH HSTTAVDVND VGYHLLDVCD RLSQDMRLFM SRRSQALNAR RKERVALLGA LQNSVAKLWP ASCQVELYGS CATHLDLPSS DLDVVVVGLD RRRGTLMQVG TQNNGCLDGK MESKSDFDVD DVRQKILNAS LPPYIPTSTS LNAERIKRLA AELATQPWAV QVKAIPTASV PVVKVLADPY RLQNVSGNDW KLDKHPKIAA ETSKVSQVSS DSARLQSFQP WRGADAMNGL LSFDITFEGP EHGGIGSTEF SIRTVNEACR ETGLPPEGTP FVQVIMVLKE LLAQRKLNEP YSGGLSSYAL LLLVVSLLRE RTVIREELDR VEQQRQAVAA DTMDTQFSRM GHDRTAPACP KSQQVTKTSN IQRMKKVAEG DIGKQQVSST WASIAKSEAN DSDLATNKVS GIWNSRQKRH SSFADVMLRP ASQCKASNRS STHMNDVAQN FDALHEATSE CEKDGTGNRE HMPDVLPADS SHVPTAPSFF QQGYNDVVDV LCSGNTTAGK LLMHFLLFYG HHFEAQKTAI DVSGTHARDI TGRYLSYFSP FIPRGALGSI DPMTGMLTVD PVIVYDPLEG AENNNVARRC FAWNSIRWIF AQSYATLSSA VERSTNPPTC PNNGATPTPV ADMTVLADSS RGFIDRAEGC VSVGVSKRRR EKISGRSSTG RSSATSSASV TGRGGGSAGQ GRGKTAGQGP KRKADASVST VTGASTPTYH TETAATQTKC FLKRIDGTHS GNGRVRSYGG SSVRRSFQSL QISQHV
|
| |