Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48210 |
Symbol | CYCP1 |
ID | 7203333 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 522442 |
End bp | 525442 |
Gene Length | 3001 bp |
Protein Length | 935 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182703 |
Protein GI | 219124841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTCG ACATTGCCAA CACCTGGCTC GCCATGGGCT TTCGTCGCTC CCGACTGAAT CGGCACTCGC AATCGGCGGC CAAGCTCTTA CGGGATGCGG AAAGTGTCGA CCAGACACAC ACTGCGCATC GGCGGGATCC TAACGGCCGC AATGCTGGCA TGGTTCTGGG TAGGAACCAC ACGGCCAGGG ACGATGAGGC CGGATCCGGC ATCGAGGGCT CGATGTCCTA CCACGATGTC GTATTCCGTG TCACTGCTAT CCCGTACGAC TCGAGCATCT CGCGTGTCGA CTTGGGACTC CGACCTTCCC GTGTACGGCG CCAGCGGACG CCGCGTCGCC CGTCCCCCGT TCGTATCAGA CTGAATCGTA CCCCGGCGCG GGAACGAAAA CGGGAATGGG CACCTACAAC GACCGAGGTC ATCGGAAGCC GCCGGCGGGT CCTACCAGCG AAACAGCGTA GGACATCGAT TAGGTCGCCG ACCCGTCGAA CGAAAGCTTG TTTCCGATCC AGAGAAACCG GAGTTGTATT GCGACAGCAG CATCCGCGAG ACTCACCAGT TCCTCCAGAT TACACCGGAT TCCCGGTCAA TAGTATGAAT TCGGCATGGC GGACGGCTAC GGATCCCAAC ACCGGACGGA CGTACTATTA TCACGTGGAA ACGAGGGAAA CGCAATGGCG GAAACCCATG GAATTGGCTA GTGAAACGGA ACGCCAGGAG ATGGAAGAAA AAGAAAGGCG ACAGCGTGAT TTCTTTGCGG CAATGGAAGC CAACATTCTA CGGAATCTGT CTCAAGAACA GCCTAACAGC AGCAGCGTTT TGTCGATTGT CGCTGAGAAC GAACGAGGGG AAACCAAACA TCCCGAAGGC TTGGCGGCTC CAATCCAACC TTCCAAACTG TCCCGAGGTG CGTTTGCCCA ACGGACTTCT TCCTTGCTAC GACCCAATCT TGTACGAACC ATTTCTACCA TGGACGAAGC CGTCATCACG GACCTCGTCA AACGCGTTCC TTCCCATCGT AGTATTCTAC ACAGTGATCC CGACGAAATC TCTTTTCTAC CCCACGACGT CATGGCAAAA AATGAGTCCT TTCGACTGGC ACGGAATTCC TCGCAAATGC TATCGATCGA CGAAGTTATG CAAGAGTCGC AATCGCACGG ATTTAATTTA GAACAATTGA ATTTGCAAGA TTCCGGAGTG GTGAAGCGGC GGCAATCCGA CGTGAGCTTG GGACCCGCCA ATGCGGTAAC CCTTGCCAAA GAACCTTCCT TTGGTACCAT ACTTGGGACG CTACCGGACG ATCACGATCC CAATTCGAGG CAGAGCAGTT TTTACGGAAG CTCGGCACTG GACGAATCTA GTTTCAACTT TGGGTTGTCA ACCGAAGAGA CCCTAGCTCT ACAAAAGCTT GCCGAGGTTT CCAATAGCAT GTCGCGTCTC AGTTTCGGAG CCTCGCGCGA CTTTTTGGGA GAAATTGGGG AGGAGGAAGA CGAAGACGAT TCCGAATCGA GCGAAATAGT CATGACCCGA GGCTCCGCAA TGGTGTTGCC GGAACGACGA CTTTCGTCGG AGGCCTGCCT TGCTCCCCGA AATCGTAACA ACCTCTCCAG TTTTCACACG GCACGTAGCC ATTTTGACGC CAGTCAATCC TCCATGCCCT CTCTAGCCGA AATGAACCCG CTGGCGTCTG GTCCGTACTC GTCGTCAACA CGCTTTCAAG ATAGCTTAAC GGCCTCACGC AACGAGAGGG AACGAGCACT TTTGGAAGGA GACGGTTCGG GACAGCAGTC ACCAACGGAA TGGAACGAAT CTACGGCGAC GAATTTAGAG TGGGATCCCA CCGTTGAAGA AAAAGAAGCG GAAGCGTCGC CCAACGCAGT CCGACCGAAA ATTACCCGTG CCATTAGTAA TAAAAAACCT GCCGAGCTTT TAGCATCGCG TCCGGGAATT GGTTCGCGAC GAAACACGTG CGGAACGCTC TACGTTGGGA GTACCATGTC GGATCCCGAC AAGGACGCCT CCATCAAGGT ACGTGGATTC CTGGCGACTT GATCACTTTT GTGTTGGTGA ATCTGGACCA TGTGCTGACC GATCGAGTAC TGTTGTGTGA TGTGTCGATA GTGCGTGTGT GGCGTACTTC GGGCTCATAT TTTGCAATCG GAACTGGAAG AGAATGCCGC GGCTGCTGCT GCGACTGACG AGTATCGAAT CTTTAACGAC CTCGAATCGC AGCAGAGATC ACTCAAGAAA AAGTTTCGAC CGAATGTTGA CTTTGTCGTG AAGCCGCCTC CGCCATCCCT CGAGGACATA AGCACGTTCT ACCGGGATGT CTTTACCCGG GCCCAAATGG AAACGGATTG TATTATTATG AGTCTAATTT ACGTGGAACG GTTGGTCAAA GTCACGGATG GAAAGCTTCG GCCACACCAG AGCAACTGGC GCTCCATTCT GTTTAGCTGC ATGGTGCTCT CCAGCAAAGT CTGGGACGAT ATGTCAATGG TACGTCGAGT CTTGCACGTT TGGTTGCTGT CTAGGTATGA ATGCTTTGGA ACGATTAGGT GGAGCTCACG TGGATTCAAT GCTTTGTTGT CATTGGCAGT GGAACGCCGA CTTTAGTCAG ACCTGCCCAG CGGGCATCGA ATTCACTTTA CAGCGGATCA ACGCTTTGGA GGTGGCCGTG CTGTCTGCGC TGTCGTACGA AGTGAAGGTA CCGGCTTCGG AATACGCCAA GTACTACTTT TTGCTACGAT CCATGATAAT CAAGAGCGGT TTGGGGGGCC AAGATTTGAT GAAAAATCCG CTCGACATTG AGGGCGCCCG GCGCTTACAG GCCGTCTCGG AGCGCTACCA AGTCGGTGTT TCCAAACCGG GGGGCCTCGC CAACTTTCGA TCGAAGAGTG TGGGTGCCAC CGTCGCGTTG GAAGGCAGCT CGAACAAGGT CACCGCAATC GAACAGCCCT CACAGAAGAA GATTGGCTTG GAGCACGTAA TGCGCATGTA A
|
Protein sequence | MTVDIANTWL AMGFRRSRLN RHSQSAAKLL RDAESVDQTH TAHRRDPNGR NAGMVLGRNH TARDDEAGSG IEGSMSYHDV VFRVTAIPYD SSISRVDLGL RPSRVRRQRT PRRPSPVRIR LNRTPARERK REWAPTTTEV IGSRRRVLPA KQRRTSIRSP TRRTKACFRS RETGVVLRQQ HPRDSPVPPD YTGFPVNSMN SAWRTATDPN TGRTYYYHVE TRETQWRKPM ELASETERQE MEEKERRQRD FFAAMEANIL RNLSQEQPNS SSVLSIVAEN ERGETKHPEG LAAPIQPSKL SRGAFAQRTS SLLRPNLVRT ISTMDEAVIT DLVKRVPSHR SILHSDPDEI SFLPHDVMAK NESFRLARNS SQMLSIDEVM QESQSHGFNL EQLNLQDSGV VKRRQSDVSL GPANAVTLAK EPSFGTILGT LPDDHDPNSR QSSFYGSSAL DESSFNFGLS TEETLALQKL AEVSNSMSRL SFGASRDFLG EIGEEEDEDD SESSEIVMTR GSAMVLPERR LSSEACLAPR NRNNLSSFHT ARSHFDASQS SMPSLAEMNP LASGPYSSST RFQDSLTASR NERERALLEG DGSGQQSPTE WNESTATNLE WDPTVEEKEA EASPNAVRPK ITRAISNKKP AELLASRPGI GSRRNTCGTL YVGSTMSDPD KDASIKCVCG VLRAHILQSE LEENAAAAAA TDEYRIFNDL ESQQRSLKKK FRPNVDFVVK PPPPSLEDIS TFYRDVFTRA QMETDCIIMS LIYVERLVKV TDGKLRPHQS NWRSILFSCM VLSSKVWDDM SMWNADFSQT CPAGIEFTLQ RINALEVAVL SALSYEVKVP ASEYAKYYFL LRSMIIKSGL GGQDLMKNPL DIEGARRLQA VSERYQVGVS KPGGLANFRS KSVGATVALE GSSNKVTAIE QPSQKKIGLE HVMRM
|
| |