Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48922 |
Symbol | |
ID | 7195349 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 36395 |
End bp | 39615 |
Gene Length | 3221 bp |
Protein Length | 965 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183659 |
Protein GI | 219126846 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCAGA ATTCACCGAC AGTACCCAAA GTAATCACTG GTAAAATAAT GGACGAATCA CTCTATGAGC TGGTTCTTGA GCACTTGAAC AATTATCAAC AAGCTATTGT TGTGGACAAC CGCGGGGAGA ATGATGCGTT GCTCGATAGG CTGTCTCGTT CATTTGGGGA TAAGCTCAAT GAACTCGTCG TGACGGGATC GCTCGAGAAA ATGGGCTTCC GAGATGCTTC ATCCAGTGAC ATGATCCAGT CCGATAAGAA TCAGGATTCC GTGGAAAGCC AGTTGGGATG CATACTTGTA TCCTTCGTAG ACACGGTTGT CTCCACCGCA GAGCTTTCGG CAACAACGAA AGACTCCGTT TTCGCCTTGT TGGAGGCTAT TGCTTGCTTG GCTTCTACCT CTCTCGTTTC CTCACTTTCG GTGTTGACTC GTGCAATTGA GTTCTCCAAC GTTCTATTAG AACGCGTTAG AAACGTTGCT TGTCGACTGA TTGGGAATCT TGTCGGATTC TGGATGCAGA GTTCAGCGCA GCACCTTTTT CACGGTCTTC TCGACCAGGC GTCACAGGCT GTACTGCCTC GCTTTACGGA CAAGACCCAG TCAGTTCGGA ACGCGGCCAT TATCGCTGCT AAACAATTCT TCACCGGAAC CATTGATGAT GCCGATTTGC GGACTGCCTT GGTGTGGAGT GTCCAACACG ACCCGTCCGT TACTAATCGC CTGCAGGCTC TCGAAAGCCT GCCACTAAAC GGCCAAACAT TGGATATCAT TGTTGCCCGA ATTGCCGACG TGAAACCTAA AGTCCGAGTT GCAGCCTTAC ATAAGCTTTC TACCGTTTCC ACTTGGCAAT CACACGAACG AGCGGCGCTC GTGCGAGCTG GTCTTTCCAA GCGGTACGTG CTACGACAGA CACGCAAGCT GATCATGACG TACCTTGCTT ATGTACTAAT CACTTGACCT CCGCTACATC CTTTTGTCTT CATCAATCTA CAGCTGTACT GCCACGCGGG ACGCGACCGT CAAAATGGTT TGCCAGGCTT GGATGAAAGC CGTCAAATAC GAACCACTGG AACTTCTTCG TGGATTGGAC GTGGTGAATT TTGAAGAGGA GGCAGCTCAA GTTGTCAAAC TATTGCTAGA CGCGACCAAG GACCCTACAG CGTACCTGGA AGAGTTGTCG ATGAGTCCAC CCGAAGTACG TGCGTTTGTT GAGAATGTGA ACGCGTCGTC TTCCCTTGTG ATTGCAAATG CTGAACAAGT TTTGACGCCA GAGGCGTTAC TGTGGGCTCG TGTAGCTGTC CAACACACCA AAACATCGCA ATCAAACTCT CGAGCTGAGG CCATGCTTTC CCTCATCATC CCTGAAGTTC CCATTCTTTG CTCTGTTGTC GAGAACCACG CCGCCCAACT CATGACAGTG CTTAGTGAGC AGTACGAAGA TGAGGACGGC GGCTTGGTGG ATAGCCTCGT CACAATTTGC TTGCAGCTCT TACAGCTAGC GACGTGCGTT TCAAAGTCTT CGATGGAAGA AGGATCGCGT CGAGTCTTTA CTGCCGTCAT GAAACGCATG CTCAGTTCCG TCGTTACACC GGACGACCTT GTCGAAGGCT GCGTTCAAGC CCTACACTCC GTTTGCATCC TCGAAAAAGA TCTCGTCGAC GCAACCAGCG AAATAATTGC CGAACTCAGT CGTCTGAGCC AAGAGCACGC GGAGTTGCAA AAACAGCATC ACTTGCGCAT ACTTGCAATT CTCAGCCTAG TCTTGGAACG TGTGTCTCCC GGATTTGGTT CAAATCCCGA TGCTTTAACA TCCTGGGCGA CTCATATTAT CCCAGCTGTA ACCAGTGAGA ATCTATTGAT CCGTCAACTC GGCGTTTCTT GCTTTGGCAA GCTCGGTTTG TTTACCCCTG TCGACACAAT CTCGGAACAA TTCCTCCCAC TAATATTGCG CATGGCGTCA AACGAAGTGG AGACGGACGA AATACGCGCT CAAGCCTTGC TTGCCCTCTC GGATTGGGCT ACGCTATTCC CAGTTGTCTT AAAAACTCAA GAGTTAGACG GGAAGATGGT GTCTATTTCA GACGTTGTCC ATTATTGGCT CGATAATCTT CCGAAGGGGT CGGGCAATCA TACGTCTTTG GCTTTTATAG CTGCAGAGGT TGCAACGAAG CTCTTATGGT CGGGACGAAT TGTGGACAGC TCATGGTTGG CCAACCTTGT GGTCATATTC TTTGACCCCA ATCTTTCGAC TGGGGAAATG GAGGAAGAGT ACCACGAAGA GGAATCAAAG GAAATTGGGA GCCTCGTTCG TTGGCAACAG TTGCAGAGCG TGTTCTTTCC CGCATACGTT CGTCGTGGTC CTGTTTATCA AGAGGCTCTG TTGAATTCCA TTTCTCACAT CTTGCAGGTT TTCTCCTCTC GCTCGCAAAA AACAACACGT GGCAAATCGC TGCCCGTTGT AAAAATGATT GATTTCGTCT GCGCCTTGGT AGAGGAAGGA GTAGCCAAAG CAGACTCGAC AAAAAGTGCT ACGGAACTTG ACAATGCGAA AGACGAAGAA TGCATGGCAC TCACGTCTGT CGGACTTTCT TCCGCAGTTC AGATTGCCAC TTTTCTATAC GCTAGTCACG ACAATCTCAA TGCGGCGGGT CTACGAGCGC TTTGCAAATA CTTGGGCAAC ATCAAATTAG ATTTGAGACG AGTTTGTTCA ATAGAACTGG TGAAGCTCAA GGGGTCCGTC GAAGACCTGA CCATGGCAGT CTCTGACTCG ACGTGTCTTC GCGCGTTGGA CTTGCTTACG GAGAAACTGG CCGGTGTCGA GATCCCTGAT AGCGAAGAAA GCTCGGATGG CGAAGAATCC CTAACGGAAG CAATGGGCGA TCTGCAGGTA GGAAAAGAGA ACTCGATTCG GCAGGACAGT ACCGCATTGA AGGGGGATCC TGCAGCCGTA ATCCCCGTAA CTTCCAACAA TCGAGCTACC CTTTCTTACG TAAACTAAGA GTAAAGGGAA CTGTATCTGA AAGCACATTT AAGTCGTCTG TCAGAGTGAG TTGCTTTCGC CGCTGTCCAC GGAAGAACTA TAAAACACCA TCAATCTACC GGACCGAGTT TGAAAACAAG GTACATTGAC TGTGGATATG AGCAGACCGT CTTCTCGCTC GATTTGAGGC TGAATAAGGT CAAGTGCGCC TAACAGTAAA TTACTTCGAG AAGATGGCTA A
|
Protein sequence | MVQNSPTVPK VITGKIMDES LYELVLEHLN NYQQAIVVDN RGENDALLDR LSRSFGDKLN ELVVTGSLEK MGFRDASSSD MIQSDKNQDS VESQLGCILV SFVDTVVSTA ELSATTKDSV FALLEAIACL ASTSLVSSLS VLTRAIEFSN VLLERVRNVA CRLIGNLVGF WMQSSAQHLF HGLLDQASQA VLPRFTDKTQ SVRNAAIIAA KQFFTGTIDD ADLRTALVWS VQHDPSVTNR LQALESLPLN GQTLDIIVAR IADVKPKVRV AALHKLSTVS TWQSHERAAL VRAGLSKRCT ATRDATVKMV CQAWMKAVKY EPLELLRGLD VVNFEEEAAQ VVKLLLDATK DPTAYLEELS MSPPEVRAFV ENVNASSSLV IANAEQVLTP EALLWARVAV QHTKTSQSNS RAEAMLSLII PEVPILCSVV ENHAAQLMTV LSEQYEDEDG GLVDSLVTIC LQLLQLATCV SKSSMEEGSR RVFTAVMKRM LSSVVTPDDL VEGCVQALHS VCILEKDLVD ATSEIIAELS RLSQEHAELQ KQHHLRILAI LSLVLERVSP GFGSNPDALT SWATHIIPAV TSENLLIRQL GVSCFGKLGL FTPVDTISEQ FLPLILRMAS NEVETDEIRA QALLALSDWA TLFPVVLKTQ ELDGKMVSIS DVVHYWLDNL PKGSGNHTSL AFIAAEVATK LLWSGRIVDS SWLANLVVIF FDPNLSTGEM EEEYHEEESK EIGSLVRWQQ LQSVFFPAYV RRGPVYQEAL LNSISHILQV FSSRSQKTTR GKSLPVVKMI DFVCALVEEG VAKADSTKSA TELDNAKDEE CMALTSVGLS SAVQIATFLY ASHDNLNAAG LRALCKYLGN IKLDLRRVCS IELVKLKGSV EDLTMAVSDS TCLRALDLLT EKLAGVEIPD SEESSDGEES LTEAMGDLQV GKENSIRQDS TALKGDPAAV IPVTSNNRAT LSYVN
|
| |