Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48202 |
Symbol | CYCP4 |
ID | 7203521 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 499776 |
End bp | 501815 |
Gene Length | 2040 bp |
Protein Length | 585 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182699 |
Protein GI | 219124832 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.145009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCTTGTGCG GTTGAACCTC CTCACACACG GACCTACACT GTGCACCTTT CCAACCAAAG AGATCGACGG ACTCCCGCAA GGCCTTTTCT TTGTCCCGTT GGTCTCTAGC GATCATCCTG TACGGAGAAG AATAGAAAAT AGCTCAGTAG GATTCCGTAC CCATTCGCAG TATGGCATCG CGGCGGGACG AACGTTCGTC CCTCCCACTG CTCGATCAGC ATCGCAACGA CTGTTTAAAG CTACTGCTCG GCGATTTGTG CTTTTGGGAA GACCCTCGTG GCGAATTCCG AGGTGAAGAC CGGATGGTAT ACGCCCACAA GCGGCAGCCC TGCAGTGATG CCGGAACAAG TCCGCTGCCG ACGGAGCCCC GGAGAATGGA TCTCCAACGT ACCCAGAGTG CGGAAATTAG CCCTCCACCG GCGGCACCGC ATTCCTGGAA ACCGGCCATT GATCCTCAAT CCGGACGTAC CTACTACTAC GATGCGGTAT CTCGAAAGTC TCAATGGGAA AAGGTACGTA CCGCTTTGTG ATTTGGCGCT GCGATATGTC GACATTGCGC GACGTGGACC TCCATTCTCA TAGATCCACA TCTACCGTCC TGTCTCACCA TCGTGTCTTC TCAGCCCGCC GAAATCCGTG CGGATGAAAA AAGGGCACGC CGCGAGCAAC GCCAACGTGA CAAACGGTTC TTTAAAGACA TGGAGGCGAA TGTGCGGGCA AGCTTGGCAC GGAACGAACT TATTCCTGGA ATATGCGCAC TCGACATTGC CAAAGAGGGA CCGCCGCTCG AACCGTTGCC GTTGCTACCG CCCCAATTGC GCGTCCGTAC AATATCGGCC ATGGATGAGC GTATGCTTAT TGAACTCAAC AGCCCGTCCA ACATGGACTG CGAAAATCCG GCAACCAATT ACAGTGTTGC TCGGACGACA ACCTCCAATC TAGGCCTTCC GAACGAAGGT CGACCCCCGC TGCCCCAACG TCCTTCTGTC GTGCTGAATC GAGAAAATTC GTCCGATATT TCGCGGGCAT TCGAATCAAT GGAGTCACTT ACACTATCTC CGGATGCAAG GGACGATTCG TTGCAAGGAG AACACTTGCT GGACGGTCCG ATTTTGGATG AAGCAGACGA TCCGGAAGGT CTGGCTGTGC CACAGCAAGT GTGTGCCGGT GCCACTGCTC ATGTGCGACG AAACACGGGT AACACTATCT ATTTACAAGC CACCATGACC AATCCAAATA TTCAAGCCAC CATACAGTGT GTTTGTGGTG TATATCGGGC ACACATTGTT TCGTCCACCA AACGAACCGA GCGTTCTCCA GTGGCCGTAC ACGCCATGCA AGTCAATATG GACGTCTTCC AAGATCTATC TTGTCGGGCC CATGACGACG CAGCATCGGT TCCAACCTTG ACGGAACTGG AAAGTTTTTA CCAGGACTTT TTCAAGCGAT CGCAAATGGA ACACGATACG ATCATTATGA GTTTGATTTA CGTAGAACGA CTCATCAAAG ATACGAACGG ACACCTGTGT CCGTCGTCGA CTAATTGGAA GTCGGTGTTA TTTAGTTGCA TGATATTGGC GAGCAAAGTA TGGGATGATC TGTCCATGTG GAATATTGAT TTTTCCAACG TGAGCGCCGC TTCTGGTCTT TCTTCGTTCT CACTTCAACG AATCAACGAC TTGGAGATTG CCGTCCTGCA CTGCTTGAAC TTTAACGTGC GAGTCCCAGC GTCCGAATAC GCCAAGTACT ACTTTTTGAT CCGCACCATG CTCATACGTA GTGGTTTGCT GGAAGACAGC CAACTTCCGC TGGGCAGGGA CTCTGCCGAA GTTTTGGAAC GTCGTACAAA TCTGTATCAA GATTCTAAGC TCCACTTGCA TCGTGGGCAG AACAGGCGTG CACGGAGTGT TGACTGGAAT TGGATTCAGC AGACGGATTC ACTGAACCCC ACCACCAATG AATTTGCGAC CGTCGGGCCC GTTCTAAAAG ACCAGATTTG CTTGGAGCAA ATCGTCTCCA TGGATCGCAC GACTTTTTAG
|
Protein sequence | MASRRDERSS LPLLDQHRND CLKLLLGDLC FWEDPRGEFR GEDRMVYAHK RQPCSDAGTS PLPTEPRRMD LQRTQSAEIS PPPAAPHSWK PAIDPQSGRT YYYDAVSRKS QWEKPAEIRA DEKRARREQR QRDKRFFKDM EANVRASLAR NELIPGICAL DIAKEGPPLE PLPLLPPQLR VRTISAMDER MLIELNSPSN MDCENPATNY SVARTTTSNL GLPNEGRPPL PQRPSVVLNR ENSSDISRAF ESMESLTLSP DARDDSLQGE HLLDGPILDE ADDPEGLAVP QQVCAGATAH VRRNTGNTIY LQATMTNPNI QATIQCVCGV YRAHIVSSTK RTERSPVAVH AMQVNMDVFQ DLSCRAHDDA ASVPTLTELE SFYQDFFKRS QMEHDTIIMS LIYVERLIKD TNGHLCPSST NWKSVLFSCM ILASKVWDDL SMWNIDFSNV SAASGLSSFS LQRINDLEIA VLHCLNFNVR VPASEYAKYY FLIRTMLIRS GLLEDSQLPL GRDSAEVLER RTNLYQDSKL HLHRGQNRRA RSVDWNWIQQ TDSLNPTTNE FATVGPVLKD QICLEQIVSM DRTTF
|
| |