Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45697 |
Symbol | |
ID | 7200749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 965815 |
End bp | 969018 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179755 |
Protein GI | 219117940 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.920354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGGGG ATCCACGCCA GCCACCGCAG CCGTCACCGG AATCGGCGCA TATCGGAGAC AAAAAACCCG AGACGGCACG ATCCATTCTA TGGAGTGCTG ACGATCATCC CGACGAGGAA CTGGAAGATG TGGATTTGCC CTTGCCAGTC GTCGATCATC CGTTGACCGT GCCGTTGTCG ACGAACGCGT CGCACCCCGT ACCGGCGGCG GATCACGAAC TGATATCCCG GATTCTGGTA CCGTCTCCGT TTCCTCATCA CCACGCGGGC CCCACCAGCG CTCGTTTCAG GAATGCCTTG AATCGGGTAC AGACTCAACC CACTGCGGAC GTCGAAGCCT GGCAAGCCCT CTTGACGGAA ACGCAAACGT GCTACAAACA GATTGTGGCC AACAACACAT TGACAAACTC CGTTACGCAC CCGTACACAA CCGACGCCGA GACGGATGTT GTTGCTCGCG TCAAACAACA ACAATTGGAT TGGGTGGAAT CCTGCTACGG TGCCGTCTTA CGCTACTTCC CCTACGCCAG TAGCCACGTA CACACCGTGG CAGAAATTCT GTGGACACTG TCGTCGCACG GCGTCGCGGA AGAACAGCAG TTACTTGTAT TCGACGGCAC CAACCATAGT CACCACAGTA CCATCAACGC CTCGTCTGTG TCTCCCCAAC GAACTCAACT GTACGAAGCC AAACTGGAAC GACTCTTGTC TCGCTATCTC GGTGTCACGT TCGACCATTC CTCGTCCGGT AGCAATTATC CCAACGGTCG GGACGCCGAT ACGCCACCGT CCCCACCCGA AAACACGGCA CTACCCGGAA TGTGCGATTG GATGGTAGAA TTGTGGTTGT TGTACGCTCG GAAAAAACGG CGTGACGCTC TACGGCACAG TAACCTCGCA CAACAACAAC AACTACCCGA CGCGCGTGTA TCTTACGTTC GGGACCAAAC GTTACAAGCC TACGAGCAGG CACAACCCTT TGTCGGACAC GGTGAAAATA ACGTCATCTT TTGGAAAGCA TATTTGGACT TTGTCCGTTC CTGGACGGCC ATGGCCAATG AGGACGCGAA AAACCATCAC GCGGTGGCGC AACAGCAAAT GGTGCGACTC CGGACCATTT ATCAGGCCTT GATCAAGTAT CCCATGACGG GATTGGATCA GCTGTGGCAA GAGTACGAAG CCTTTGAGCG GGGTCAGAAC GAAACACTCG CGCAGGCGTT GACGCAGGAA CTGTTGCCCA CGTACCAACA CGCCCGGACC GTGTATCTCG AACGACACCG TGTGTACGAT ACCAACGATC TACAACTAGG TCGACTCGCC ACGCCACCGG CGGACAATGC CGTCACTCAA GAAGAAGATT ACGAAACGAA ACGAGCCGAA GAGCAAGCAT TACTGCGTGC GTGGAAAGTG CGGGTTGCCT ACGAACGGAC CAATCCGGAA CGTTTGAATT CGAGTGAGTT TGCACGACGC GTCCGACAGG TCTACCAGGC GATGGTTTCG GTGCTGACGC GGTACCCAGA AGCTTGGCAC ATGTGGAGTA CCTGGGAATT GTCCGTGGCC ACCGGTACTA CCACGACGTC TGATGTCACT GCTGATGGAC GACACCACGA ATCAACCATC ACGTTGGCCC GTGCCGTCTT ACAACTCGGA CAAAGTCATA TTCCAGACTG CACATTACTC GCGCATACCG AAGCCATCCT AGTCGAACTT CATGCAGTGG ATCCCAAATC ATGCTTAAAT GTCATGGAAC GGTTTGTGGA TCGTAGCCCC AATACTCTCG GCTTTGTTCT ATATCAGCAA CTGACGCGAC GATATCAAGG TATGGAAGCT GCGAGAAAGG TGTTTGCACG TGCACGACGC GTATTGGTAA ATCCGGCCGA AGCCGCGGCC GCTGCAGCGA AACAAGATGT CCGGACTGAG GACGGGGTTG ATGCAGAGAA TCATCCACAC GACGAGGGAA GCGGCGGCAA ACGCTGGGTA GTGACAAATA GATTAGATCC TAACATTGGA CCGACAAATG GGCAACAGGT GCAAGGTGCT ACCGAAACGA CGACAGGACA AGAAGGGGTT GTAGACGGAA GCGAAAAACA TCCACCGGGT GTTATTACTT GGCACCTATA CGCCTCGCAC GCAAACATTG AGCACAGGGT CAACAAGGCA CCGGAAGTTG CAGCGCGAAT TTATGAGTTG GGTTTGCGGA AGCACGCGGC CTTTTTGACC GTACCGTCCT ACGTAATGCG ATACGCCCAA CTACTCCTGG AATTAAACGA TACAATGAAT CTACGGGCCT TGTTAACTCG CGCTGTCGCT GCTTGTGAGG CCCAAGAAAA GGAAAACTCC CTGGCGTTGC TCTGGAACAT GACTTTGCAT TTTGAGTCGG TCATGGGAGG GTCCGATCCA ACTAGTGCCG TAACGATGCA GAAAATAGAA CGACAACGTC GTGCAGCCCT GATGGGTGCT AACGTGGAAG AGGTAGCCAC TGGAGGGTTT GTTGGAATTA ATGAACCAGC CTTGATTGGT GCTCAAAAAT CCACTATTGC AGACCAGTTG GTGCGAACGG AAAGCTATGA TACGAGTTCC TCTATTGTAA ATGGAATGAA CCGTGCGGTG GATGTTTTGG AAATAATGGG GTTGTGGGGA AGCGGGGAGT CATCAGTGGA TCAAGCTCGT CGTCGGATCA AGCAAAGCAA AAACCGGGAA AGCGAAGTTG ATATTTCCGG TGGGAAGAGC GACACAAGTT TTCAAAAGCG ACTCGAGTAC CAAAACGCAG TATCGGCAGG GTTCTCACCA GAGGCAGGAA CGACTGATGG CACCGCCATA GGCAACAAGA TTATGTCAGC TCGTGAGCGC TATCAACAGG GAGCTATAGC GGTCGCCTCC GGTGGTGCGG TTGGCTCGAG TGCAATTATT ATGGCAATTC AGCAAATGCC AGATTGGCTG CGACCACTTC TCATGACATT GCCAGCGACA CGACTACGTG TTCCAGTTGT GCCAAAACCT CCGCCGCATA TGGTTGAAAT CGCTTTGGCC GCACTGAAGG CGAATTCGCT TCCTGCCGAG CGGCCTGAGG GAGAAGTATC TACGAGTGGC AGCAAGCGTA AATTGGCTGC GATTGACTCC TCGGACGAAG AGAGTGATGT GCAAGGCGGA GGGTATGGCA GCCAGTTTCG AAACAGACAG CGTGCTAGAA TGAACGCGTC ATAA
|
Protein sequence | MAGDPRQPPQ PSPESAHIGD KKPETARSIL WSADDHPDEE LEDVDLPLPV VDHPLTVPLS TNASHPVPAA DHELISRILV PSPFPHHHAG PTSARFRNAL NRVQTQPTAD VEAWQALLTE TQTCYKQIVA NNTLTNSVTH PYTTDAETDV VARVKQQQLD WVESCYGAVL RYFPYASSHV HTVAEILWTL SSHGVAEEQQ LLVFDGTNHS HHSTINASSV SPQRTQLYEA KLERLLSRYL GVTFDHSSSG SNYPNGRDAD TPPSPPENTA LPGMCDWMVE LWLLYARKKR RDALRHSNLA QQQQLPDARV SYVRDQTLQA YEQAQPFVGH GENNVIFWKA YLDFVRSWTA MANEDAKNHH AVAQQQMVRL RTIYQALIKY PMTGLDQLWQ EYEAFERGQN ETLAQALTQE LLPTYQHART VYLERHRVYD TNDLQLGRLA TPPADNAVTQ EEDYETKRAE EQALLRAWKV RVAYERTNPE RLNSSEFARR VRQVYQAMVS VLTRYPEAWH MWSTWELSVA TGTTTTSDVT ADGRHHESTI TLARAVLQLG QSHIPDCTLL AHTEAILVEL HAVDPKSCLN VMERFVDRSP NTLGFVLYQQ LTRRYQGMEA ARKVFARARR VLVNPAEAAA AAAKQDVRTE DGVDAENHPH DEGSGGKRWV VTNRLDPNIG PTNGQQVQGA TETTTGQEGV VDGSEKHPPG VITWHLYASH ANIEHRVNKA PEVAARIYEL GLRKHAAFLT VPSYVMRYAQ LLLELNDTMN LRALLTRAVA ACEAQEKENS LALLWNMTLH FESVMGGSDP TSAVTMQKIE RQRRAALMGA NVEEVATGGF VGINEPALIG AQKSTIADQL VRTESYDTSS SIVNGMNRAV DVLEIMGLWG SGESSVDQAR RRIKQSKNRE SEVDISGGKS DTSFQKRLEY QNAVSAGFSP EAGTTDGTAI GNKIMSARER YQQGAIAVAS GGAVGSSAII MAIQQMPDWL RPLLMTLPAT RLRVPVVPKP PPHMVEIALA ALKANSLPAE RPEGEVSTSG SKRKLAAIDS SDEESDVQGG GYGSQFRNRQ RARMNAS
|
| |