Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45319 |
Symbol | |
ID | 7199963 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 829056 |
End bp | 832318 |
Gene Length | 3263 bp |
Protein Length | 1013 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179302 |
Protein GI | 219117015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.678444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAGTACTA TAGGATACTT CGTCCCGTTC TACCATAGCC TCGAGAGGAA GTTGAGTAGT GAATACCCTA TACCCGGTTG CGTCTACCGT CCTTGTTGCG CTTTTTCTAT CGCACAGCTT TGGCAATCTC TAGAGAATAT ATCTTCTGTT GCGCCATGTC CGACAACGAA GAGGAGCTCT ACGACGAGTT TGGGAATTAC ATCGGTCCCG ATCTGGATTC GTCCGACGAC GACGACGAGA ATGCGTTGGT GCCTCCCGGG ACTGTCGCTC CGGACGACGC TTCGGATGTG TCCGGTGACG ACCAAGACGA GAATGCTCTC GTGATGCGCG ACGACGAGAA CGTGATGACA ACAACGGCAG CAACCACGGC CGATCCAATG CACGCCATTG TCTTGCACGA AGACAAGGAA CACTACGCGT CGGCGGAACA AGTGTACGGT GACGACGTGC GCGTCGCCGT CTTAGATGAA GACGCCATGG AACTCGAAAC ACCCATCGTG GAACCAGTAC TCACGAAATC GCATCACGCG GATAGCGACG ATCGGGACAA ACAAAACGTC TTTGCGCCCG AGGATTGGTT GTACACGGAA GACTATCTCG GCGTGCAATT GTCTAACGAA ACTACCCGCA CTCGGCGGTT AGTGGCCATC GTCGGACACT TCCATCACGG AAAAACCTCA CTGGTGGACT TGCTGCTGGA ATCTACCTAT CGAGTCAAGA AAAATCACAA GAATGCTGTC GTCGACGAAA GTCGCCAAGC GAATACACAG GCAGGACCGC GGTATCTCGA TACGCTGCTG GCCGAACAAG CCCGTCAAAT GAGTCTCGTC AGTACGCCCT TGACGACACT TTTGCCGGAT ACACGCGGAA AGACCTTCGC CATTTCCATG CTGGACTGTC CCGGTCATGT GCAATTCCAT GACGAGTCGG TGGCGGCCTT GAAAGCGTCC GATGGTGCCG TGGTGGTAGT GGACGTTGTG GAGGGAATCA TGATGCATAC GGAAATGGTA GTCCGACAAG CCATATCCGA AGGACTTTCT CTGACCTTGG TCTTGAGTAA AATGGATCGT TTGATTGTGG AATTGAAGTT GCCCCCACGG GATGCCTACT ACAAGTTGTT GCACATTGTG GATAGTCTCA ACGAATTAGT AGGCATGGTG TCGAGGGGAC GCTATCCAAA AATCTCACCC GAACGAGGTA ACGTGGCCTT TTGTTCCGCA CAACACGGCT ACTTGTTTAC GTTACCCAGT TTTGCGCAAG TGTACATGGA ACACTTTGAT CGTTTGGGAG ACAACATTGC CGTCGATGGC TTTGCTCAGC GTCTCTGGGG AGATGCCTAC CTGGATCCGG AAACGCGGAC GTTTCACCGG TCGTCACGCG ACTGCTTGAC TCCGAACGTG GAACGCACAT TCTGCGTGTA CGTACTGGAG CCACTCTACA AGATATACAG CGCCTGTCTG GGGGAGCGCG AACCCGACGT CAATGCCCTT TTGCGCGGTG TCGGCGTTCT CCTTCACAAG GACGAATTGC GAGCCAATTC GACGGTCCTT TTGAAAGCGG CATTGTCGCG GTTTTTGCAA ACCGCCAACC ACGGGTTTGT TGATATGCTC ACGCAACACG TACCCTGTCC GGCCGTGGCC GCTGCCGGAA AAATCGCTCG GTGCTACACT GGCCCCCTGC TCGACGATGA TGCGGACACT GCGGATTCGA AACAAAGGCT GGTGCAGGCC ATGCGCAACT GCGACCCCCA CGGACCACTG ATCATTCACG TCGTCAAACT GTACGCCTCC CGGGATGGGC AATCGTTTCA AGCGCTCGGA AGAGTTTATT CAGGAACGGT TCGACCGGCA ACTCCCGTCA AGGTCCTCGG CGAGGCGTAC GTCCCGAATG TAGACGACGA AGATGTCGGT ACGGCGACGG TGGAGAACGT TGCGATACCT CGGGGACGCT TTCATACAAG CATAAGCCTG GTCAAGGCGG GGAACTGGGT TTTGCTGGAA GGCGTGGACG CCACCATTGC CAAGACAGCG ACCATCGTAG GTTTGGAGTG CCCCGAGAAT GTGCACATTT TTGCCCCGCT CAAATTTCCA CATACGGGAG GAGAGTCTGT GATGAAGCTT GCGATCGAAC CGCTAAATCC GGCAGAGTTG CCCAAAATGG TGGAAGGACT TCGGCGGGTC TCCAAAGCGT ATCCCATGGT TCAGACCAAA GTGGAGGAAA GCGGCGAGCA TGTTTTGCTG GGTACTGGAG AGCTGTATTT AGATTGCGTC ATGTACGATC TTCGTCAAGT GTATTCGGAC ATTGAAGTCA AAGTTGCGGA TCCGATTGTT TCGTTTCGAG AAACGGTGAT TGAAACGAGC AGTATCAAAT GTTTTGCGGA GACTACCAAC AAAAGAAACA AATTAACATT TCTTGCTGAG CCTTTGGATG ACGGTCTGGC GGAAAATCTA GAGGCGGGCA AGGTCAAGAC GCAGTGGGAC CAGAAGAAAC TGGGTCGCTT TTTTCAAGTA AATTACAATT GGGATTTGTT GTCGTCACGC TCAGTTTGGG CGTTTGGGGA CTCGCCGACA CACGGAACCA ACATACTCAT GGACGACACT TTACCTAGTG AGGTCGATAC ATCTCTTTTG AAAACATGCA AATCCAGTAT TGTCCAAGGC TTTCAATGGG CCACTCGGGA AGGGCCGCTT TGCGAGGAAC CCGTACGAGG TACGAAGATC AAAATCCTGG ATTGTGTCCT CGCCGATAAG GCTATCCATC GAGGTGGGGG TCAGGTCATT CCTACAGCTC GCAAAACTGT ACATTCTTCG TTGCTGACCG CCACGCCTCG ATTGATGGAA CCAGTGTATC GTTTACAGAT ACAATGTCCC GGTGCAATTG TTGATGCGAT TCAACCCCTG CTGACGCGTC GCAGAGGCCA CATGGTGCAA GATCGACCGG TTTCGGGCTC GACGCATTGC ATCGTCAAAG CTTACATACC GGTACTAGAC AGTTTTGGAT TCGAAACGGA TCTTCGTACC TTTACTCAAG GTCAGGCAAT GGTCTTCTCC GTTTTTGACC ACTGGTCGGT GGTGCCCGGC GATCCACTCG ACCGGAGCAT TATTTTGCAT CCGTTGGAGC CTAGTCCGGC GCAGCATTTG GCTCGAGAAC TGTTAATTAA GACCCGTCGA CGGAAAGGTC TGTCCGAAGA TGTTCCTGTG AGCAAGTTTT TCGACGAAAG CATGAAGGCG CAATTAGAGC AAGTGAATGC CGTGCTACAA TAA
|
Protein sequence | MSDNEEELYD EFGNYIGPDL DSSDDDDENA LVPPGTVAPD DASDVSGDDQ DENALVMRDD ENVMTTTAAT TADPMHAIVL HEDKEHYASA EQVYGDDVRV AVLDEDAMEL ETPIVEPVLT KSHHADSDDR DKQNTISACN LAIVGHFHHG KTSLVDLLLE STYRVKKNHK NAVVDESRQA NTQAGPRYLD TLLAEQARQM SLVSTPLTTL LPDTRGKTFA ISMLDCPGHV QFHDESVAAL KASDGAVVVV DVVEGIMMHT EMVVRQAISE GLSLTLVLSK MDRLIVELKL PPRDAYYKLL HIVDSLNELV GMVSRGRYPK ISPERGNVAF CSAQHGYLFT LPSFAQVYME HFDRLGDNIA VDGFAQRLWG DAYLDPETRT FHRSSRDCLT PNVERTFCVY VLEPLYKIYS ACLGEREPDV NALLRGVGVL LHKDELRANS TVLLKAALSR FLQTANHGFV DMLTQHVPCP AVAAAGKIAR CYTGPLLDDD ADTADSKQRL VQAMRNCDPH GPLIIHVVKL YASRDGQSFQ ALGRVYSGTV RPATPVKVLG EAYVPNVDDE DVGTATVENV AIPRGRFHTS ISLVKAGNWV LLEGVDATIA KTATIVGLEC PENVHIFAPL KFPHTGGESV MKLAIEPLNP AELPKMVEGL RRVSKAYPMV QTKVEESGEH VLLGTGELYL DCVMYDLRQV YSDIEVKVAD PIVSFRETVI ETSSIKCFAE TTNKRNKLTF LAEPLDDGLA ENLEAGKVKT QWDQKKLGRF FQVNYNWDLL SSRSVWAFGD SPTHGTNILM DDTLPSEVDT SLLKTCKSSI VQGFQWATRE GPLCEEPVRG TKIKILDCVL ADKAIHRGGG QVIPTARKTV HSSLLTATPR LMEPVYRLQI QCPGAIVDAI QPLLTRRRGH MVQDRPVSGS THCIVKAYIP VLDSFGFETD LRTFTQGQAM VFSVFDHWSV VPGDPLDRSI ILHPLEPSPA QHLARELLIK TRRRKGLSED VPVSKFFDES MKAQLEQVNA VLQ
|
| |