Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50138 |
Symbol | |
ID | 7198842 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 162501 |
End bp | 167882 |
Gene Length | 5382 bp |
Protein Length | 1699 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185061 |
Protein GI | 219129785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGA CGCCCAAGCT GATGGTGGAC CCGTTCACCG AGGCCGTCGA ACGCATCGTG CTCCGCGTCA CCATGGTCGA AGGCCGAGAA GGCGACTACG CGTACAAAGG CACCGCAGTA GATGGACCCT TGCTGCCACA AGAACTTGCG GAATTATCCC TGCTCTGTAC ACGCCAAATG GCACTGGAGG GAGCAAAAAC TGATGCGCAA GGGTTTGCCG CCGTCGAAGT GGAACAGCTT GCTGCACTGG TTGCTCTACT CGATCAGCAT ATTAATTCCG CTATCGGAAT CCAGCTTCTC GCCAATGCGG TGAAATTGCT GGAAAGCGAG CTTTCCCCGA GTAAATCTTC GCAATTGATG GACCAGGTAC GTGTTTAGGA CATTGCCAAG GACACTTGGA TGCTTTGTAT TGCGCACCAT AAGCTACAAC TAACAGTAAT TCCATATTCT CCCGATCGTC AACAGTGGCT CCAAAGGGGT GGAGCCGGGT CCACGCAACT CCAGGTGCTT CGCATGGGAC TCCAAGCTGC CTGCGTGGTG CTTTTGATTG CCGTATGCTC AGGGGTCGAC CGACGAGCTG TCAACGAAGA TGCAATAGAG GCGGCAATTA CCCTTTATCG CTCCCACCTT TCCAAACACA TTGTTCCCGC CTGGAACCAG ACCGGGCACA TGCTACACAT CAAGTCGTTG GATGAGAAGG CAGGGCTTGC TTCTCCATCA AAGAAACGAC GGCTAAGCAC CGACGAGAAC TACTCCTCGG CTAGTAGCGG CCTTGTCAAA GATCTCAAGA AGGTATACAA GCACATTGCT TGTACTGTTC GGCTTCAGTT GACCTTAACG GAACGTTTGG AACTACTCAT TCGAAAAGTG CCCCTAGACG ATCAGCAGAT TCTCATGCTG ACTAATGGAT CGCTGATAGG TCTCGAAATT GATTGCGGGG CGCCGAAACT CGGTACCTTT GCCAAAGACT CGCCTCCACC GCCACAACAG CTTCAGCTCG CTTGCGTCAA TCTTGTCTCA AGTGCTTTCC GTACTTACCC GTCCCATCGG GACACTATAC TGGAAGACGT GTTTCCACTC ATGCTACGAC TTCCCACAGG GAAGAAGTCT CTCCGTGCTT TGGGCGTCAA CTACGGTTCG GCTTCGTCAC CGACAACTCT TGCTGCGCTC AACGCCACGC TGGTTGATGT CACCAACACA CAGCCAAACA TTCAGACAAT CACTTATTTG ATTCTCTCAT GCATCCAAGC CTGTGTTGTT CGACCGGTGC TGGACACCAG CGACGATTCT CCACAAGGAC AGATCGTCTC GGGCTTGAAG GCGTGTCGTG CCGTTAGCGA AGTCTTTGTT TCTCAGCTAC TTAAGAGATG TTCCCGTACC AAAGACGGTG CACTGGAATA TCGACCAATT CTCGCCAATA TAGTCGAAGA CTTGTTGCAA CTTACGCTGA TACCCCAATT TCCGGCAGCG GAGCTCGTGC TGGTTTGCAT TTCTCAGCGT TTGAATCAGG AGCTGTCGGC AGCGTCTTCA TCTGCAAAAA CTAAGAGTCA TTTTCCTCCC GAGCCGACTT TTCTGACGAT TGCTTTCGAT ATCCTAGGGA AGGTTATGGC GTACCAAGCC CGTGTACTAG CCACCAGTAG ATCCAAGCCT CTGGCGATAG CAACCACAAT TGAGAACGAG CCTACAATCC TTCAAAACGA AGTGCATGAA GTGCACCTAG CTTGTCATTG CGGAGATACC AGAGCTGATG CTCTGATGAT TCAATGCGAC CACTGCCGTA CCGCCTCGCA TTGCTCTTGC GTCGGTGTTG CCCCCGACGA TATTCCAGAG GAATGGTTTT GCGATGGTTG TCGTCTGGGA CGCATAGCCT TACGAGAACG ACGACAGTTG GGGGTTGAGG CAGCGACGGG GATCGTGGAC GAAGTGTTTA CGTACCGTCA TTCCTTTTTG TCCAGACTTT CGCACCGCAT AGGTGTGGCA GAACTGGTAG ATGCGACACA ATTTCACTTA TCGCGATGGG TGGATGAGAT AGAGCGCAAT AGTAGATCCA CCATTGACCA GCAAGCAACT TTTCGCCAAC TTGTTCAGGA GTTGTTAACG CGTTGGGAGG CACCCGTCGG AGCTCTTGGT CCAGGTACGG ACTCGCTGAC GGAAGAAGGC TCATCGCGAA TCACCTTGCA CCTTCTTGCC CGGACATCGC CACTGTGTTT ATCATTCCGC CATCAAGTTT CGCTTATTCT GAAACTCATG GCCGACGAGT CTGTACCGAT GCTACGCAAG CTGGCGGTGA AGACTATTGA AAAGGTATGA AGTCCTGTCT ATTGACGCAT GACTTGTTAA GTTTGACCTA GAATTCCTCA TATTACACAT CTGTTTCTGT TTTTTCCCTT CAGGTGGCTG ATGGTGACCC GCAGCTCATG CTTCTCCCAA TTGTTACGAA GGCTGTCTCA CGCAGACTCA CAGATGACAG CATATCAGTC CGCGAGGCCA CAGTCTCCCT CGTTGGTGCT TACGTAGTAC AGTTCCCAGC TGTCGCGAAT GCTTTTCATT CATCGTTGCT TGACTGTCTC GTGGACGTTG GGGTCAGCGT TCGCAAACGA GCCGTGAGAA TCTTTCAGGA GATTCTTATA TCGAACCCGC GCTATCGAGG TCGCTCGTCC GTTTGTGATT CCATGATTAC CCGAGCTATA GATCCAAAGG AGGAAGACGG AGTCCGAGAT CTGATTTTTG AACTTTTTAC CAAACTCTGG CTAGAGTACG ATGATGATGT CATCACCAGT CCTACAGTAC CTTTGCCCAA ACCTGCTTCG ATTCCAAGCT CTCCGGATGG AAATACCAGA GCACTCCTAT TAGAAGGGTT TGCACTAGGG AGCTCCGTCG TCACCCCCAC TCCGCCCGCA CTGAGCGAAA GGTACAAACA TTCTCGCTCA AGTCAGAAAC GGGTCGACAT CGCTGCTGAA CAAATGATGG AAGTGGTCAG AGCTAGTGGT TCTGGAGAAC GCTTGCATTC TCTAATTGTC GAGCTTCTTG CTGGTTCTAA CAATGCGCGA GCTTGCAAGT CTTCGGAACG CAAGCAGCGG TCGGCTCTCG ACCAAAAGCA GTGTTCTTGT TTAGTGGAAT CTCTGTTCGA ACTTTTGCTT CAAGTAGAAG AGCAGCGATC CATCCGAACC TCTCGTGTTG GGAAGGATGT TGCTGCTACC TTACAAACGA TTGCCGTCTT CGCAAATTTG GCTCCTAACT CAGTTTTCGA GCATTTCGAT ACAATAGGTC CCTATCTCAA GGCTGACAAC GGCGTTTCCT TTGACGACGA GTCCAAAATT GTCGGTGCAG TTTGTGATAT AATTGTGTGT CTGTCTCCAA ACCTGAAGTA CGAGAACATT CAGCAAATGG CCTCAGGGAC GCTGGCGAAG GATATCGTGC TGGTCATTTA CAAATTCGGT TCTTCAGCTC TGGGATCTGC TATCAAGGCT CTTTCGTCTT TGGGGCACCA TCCGGATGGT GACGAGAATA GCGTTTTCCG AAAAAAGCTT CTCGAGATGG CACGAACATT CTACTGCTAC CTTTTGCGGA AAGAAACGGT GGAAGACTTC TCAAATACCG ACGTACGTTC CATTTTAGAT TTGTAGCACA ATCACGTGGA TCAATCTCAC CGGTGCGTAT ATTTCTCTTG ACAGGACAAA ACTCGTAGCA ACACCCACCG GGCCTTAACA GTTCTTGGTT TGGTGTGTCG TTATCATGAA CGACCTTACG GGGTAACGGA AGAAGAAAGT GAAGTGGACG ACAGTGTGGC TGAAATTTCC TCTTCTGAAT TGACTTACGC AAACCTCATC GTGGGTTGCT ATAGAATTTT TTCAACGTAC TTGCAAACGC TAGATGCACC GACAAAGTGC TCAGCCCTTC GTGCACTCGG AGGCCTTTTT GTTTCACAGC CTAGGCTGAT GCTAGAATTG GAGCAAGTCG GTCTCATCGA GCACGTCATG TCGGAAGAGT CTCACATTAG TCTTCAGCTC GAATCACTTC AGTGCTGGAA GACAATTTTG TTGGTACGTT GGGTTGACTG ATTTCTGCTG CTTAGTGCTA TCATACTTAC AGTTTCGTTT TTCGTAGGCG GAAGAACGGC GTATCGACGG CGGAGTCGCC ACGGAGAAGC TGGAAAAGAA CGAAAGAGTT ACTTTGTCAA ACCGTATCTC CGGGGATCAA GATGGCGACG CCACGCTATT TGGAGGGGTG CTTACGAACC ACGCAGACCG GCTCTTTGAA ATGAGCCAAT CCAAGGATCG GCGTGTAAGG TATGCCGCGT TGGATCTAAT TGGGCTCCTA CTGCGACAAG GGCTCGTAAA TCCAAACGAG TGCATCCCTT TCTTGTTTGC GTTACAAGGC GACGTAGAGA ATGCCGCCAT CCGTAGTTTG GCCCTTCATT TGTTGATGAA AGAGGGAGAG CGCCGGCCTG ATGCCTTGCG TCAGCGCATA TGTGTTGGAG CGAAGCAAGC CTATGACTTC CAACGCCGTA TTTACTCTCA AAAGGACGCA GCGTCTGCAT TGATCACTGT CCGACGAGGC CGCGTACAGG GAACCGAATG TATTTTCGGC AGCGTTTTTA GAAATTGCAT TTCCAAGAGC CAAAAGCAAC GTCGTGGACT ATTCAAGAAC CTGCTTTCTT TCTTCGAAAC CGCAGAAGTG AATGTCGAGA CACCCTTTGC CAAAAATGTC CACCTCCTAA AGATTTCTGG AGGTGGGCAG GCAAACGGAA GCGACTTGTC CCTTCTATCT TTCACGTCAC AAATTCTTGC GTATCTGCCA TATGCCGCAG CCAGTGATCC TTTGTTCATC ATCCACCACA TTGGCTCAAT TGTCACAATT CAAGGGACCC AAATCGTCGA CGCATTTGCT GCTTTATTGC GCCCTGCTGG GTTAGCAAGC AACGATGAAT ATGATGAAGC TAACGTGACA GAAGATGCAC TCGAAAAAGT TGCCCGTAGC AAGTTTCCTA GCCGCACCCA GGAGGCGAGT GCTTTGTCCA AAATCGATCA ATTGAAATTC CTACATCTAT GCCGCAGGGG TGCAGCCATA GTCTTGCTAC TGCGCCTCAA GGCGCACCTT CGCCGCTCGT ACAATTTGAG CGAATTTCGA TGCCTTGAGT ACGACCCAAA TGCTAAGGAT CGCATAGCAG AGAAAGGAAT TTCAAAAGTC GACAATTCTC CGCCATTCGA TGCATCGGTG CCCGCCGATC TCATCCACTC CTCTGACAGC TTTATTCACT GGGACACTAT GATTCGAGAG TATGCCGAGT TCCGTCAACT AATGCGCAAA GAAAACAGCT TTGATATCCC CATGGGGGAC GTCTCGGATG AGCAGCGGAT ATTTGAAGAA GTCGTTGACT GA
|
Protein sequence | MSTTPKLMVD PFTEAVERIV LRVTMVEGRE GDYAYKGTAV DGPLLPQELA ELSLLCTRQM ALEGAKTDAQ GFAAVEVEQL AALVALLDQH INSAIGIQLL ANAVKLLESE LSPSKSSQLM DQWLQRGGAG STQLQVLRMG LQAACVVLLI AVCSGVDRRA VNEDAIEAAI TLYRSHLSKH IVPAWNQTGH MLHIKSLDEK AGLASPSKKR RLSTDENYSS ASSGLVKDLK KVYKHIACTV RLQLTLTERL ELLIRKVPLD DQQILMLTNG SLIGLEIDCG APKLGTFAKD SPPPPQQLQL ACVNLVSSAF RTYPSHRDTI LEDVFPLMLR LPTGKKSLRA LGVNYGSASS PTTLAALNAT LVDVTNTQPN IQTITYLILS CIQACVVRPV LDTSDDSPQG QIVSGLKACR AVSEVFVSQL LKRCSRTKDG ALEYRPILAN IVEDLLQLTL IPQFPAAELV LVCISQRLNQ ELSAASSSAK TKSHFPPEPT FLTIAFDILG KVMAYQARVL ATSRSKPLAI ATTIENEPTI LQNEVHEVHL ACHCGDTRAD ALMIQCDHCR TASHCSCVGV APDDIPEEWF CDGCRLGRIA LRERRQLGVE AATGIVDEVF TYRHSFLSRL SHRIGVAELV DATQFHLSRW VDEIERNSRS TIDQQATFRQ LVQELLTRWE APVGALGPGT DSLTEEGSSR ITLHLLARTS PLCLSFRHQV SLILKLMADE SVPMLRKLAV KTIEKNSSYY TSVSVFSLQV ADGDPQLMLL PIVTKAVSRR LTDDSISVRE ATVSLVGAYV VQFPAVANAF HSSLLDCLVD VGVSVRKRAV RIFQEILISN PRYRGRSSVC DSMITRAIDP KEEDGVRDLI FELFTKLWLE YDDDVITSPT VPLPKPASIP SSPDGNTRAL LLEGFALGSS VVTPTPPALS ERYKHSRSSQ KRVDIAAEQM MEVVRASGSG ERLHSLIVEL LAGSNNARAC KSSERKQRSA LDQKQCSCLV ESLFELLLQV EEQRSIRTSR VGKDVAATLQ TIAVFANLAP NSVFEHFDTI GPYLKADNGV SFDDESKIVG AVCDIIVCLS PNLKYENIQQ MASGTLAKDI VLVIYKFGSS ALGSAIKALS SLGHHPDGDE NSVFRKKLLE MARTFYCYLL RKETVEDFSN TDDKTRSNTH RALTVLGLVC RYHERPYGVT EEESEVDDSV AEISSSELTY ANLIVGCYRI FSTYLQTLDA PTKCSALRAL GGLFVSQPRL MLELEQVGLI EHVMSEESHI SLQLESLQCW KTILLAEERR IDGGVATEKL EKNERVTLSN RISGDQDGDA TLFGGVLTNH ADRLFEMSQS KDRRVRYAAL DLIGLLLRQG LVNPNECIPF LFALQGDVEN AAIRSLALHL LMKEGERRPD ALRQRICVGA KQAYDFQRRI YSQKDAASAL ITVRRGRVQG TECIFGSVFR NCISKSQKQR RGLFKNLLSF FETAEVNVET PFAKNVHLLK ISGGGQANGS DLSLLSFTSQ ILAYLPYAAA SDPLFIIHHI GSIVTIQGTQ IVDAFAALLR PAGLASNDEY DEANVTEDAL EKVARSKFPS RTQEASALSK IDQLKFLHLC RRGAAIVLLL RLKAHLRRSY NLSEFRCLEY DPNAKDRIAE KGISKVDNSP PFDASVPADL IHSSDSFIHW DTMIREYAEF RQLMRKENSF DIPMGDVSDE QRIFEEVVD
|
| |