Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43226 |
Symbol | CYCP5 |
ID | 7196950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2382998 |
End bp | 2384994 |
Gene Length | 1997 bp |
Protein Length | 547 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176969 |
Protein GI | 219110435 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGTCCATTCC TTTGGTTGTC ACTCATTGCT GCACTACACA ACACTAACGA GCTCGGTTTC CTCTTTTGGT TGGCGGATTC GGTGCAAGCT CCAACTGTGT CTGGCCGACT GGATTGGACG GACAAGATCA GGTCCGCCAA GGAGAAGGCC GCTCCTTCCT CTGTATTCGT CGTTGCCTGT CTCCTCGTGT TTGTTCTCTA GAGTTTGCGA GACGGGTTTG CTCGTCCCAC TCGTCTTCTT ACTTGAACAA GACCAAAGTG GTAGACCGCA CCGTTGTCCC GTCACATACA CACAATGGGC CGGAGAGCCT GCGTGACTCC CTGTTTGGAA ACATCCTGTA CCGCGGATAC CCACACCAAC AATCCATCCA CACCGAAACT GAGCTCCGCG CCGGATACGC CCAATTCGCT AGTAGACCAA GACGTTTCCA TTCGCAGCGT CCTGCAGTTT GCACACCAAC ATACGCAACG CGCAGATTCA CAAGGAATGA AATTGCAAGC ACCCGCAGAA GGCAGCGGTC CACCGGAAGA CTCGATCGAA GATTCTTCCT TGCCCCGAGG CGACGTCGAA TCTCTTCGGG AATCGCAGAC GTCTGCGCCA TCCTTGTCCA TGCAATTCCC GGGGGCGCAG TTGTCTACTA CAGACAATGT GGGAATACCC ACCGATGGCG GCCGCGTGAG TGACTCTGTG TCGTCCCGAT GCATCCCATC GAGGGCTTCC CACACTCGGT ACGCCCATAC CAACGCGCCG CAGACCCCCG CCACGGTACC CGAAATACAA GCTCAGACTC CGCAACAGTT GTATCCCGAC ACTGCTGCTT TGATCCGCAC AAACAACGAA GCAGCTTTTC CGCACGACGT TTCCAATCTC AGTATCCGCG ATATATTCAA CGAAAATACG TCGAATCATC AAGACTACTA CGGCGTTGAC GACCCTACTG CTTCTTGGAC ATATTCCGAC GCCTGTGCTA TCCCGTATGG TACCGATCGC CACCACACGG AAAATGATAA CAGGGGGATG GCTACTCCAC CTTCAACCCT GCACACTCCG GTGGCGCCTC TATGCTCACC GGACGAAGAT TCGTCCACTG GTCTCCCGTT ACCTCCTCCC GTGATCACGC CCAGCACCCC CACAGCGGAA TCGGTCGGTG GAAGCGTAGT CACACTCGCG TCGTCAACCT CTTCGATGCA ACCACAGTCC GCACGCAGCA AACCGGTTTT GTACAGCGCG ATTGGTAGTT TGGAAGATCC CAACAGTACT TCCTTCGCTT CGTGGAGAGA CAAGAAATCG TGGCGAAATC GACCCAAGAT TGACTTTCCA TCCACGCACT TTTTAAAACC GGTCAACTCG GTAAATTGTG TCACTACGCA ATCTCATCGG CCAGTGTCGT CATCTGGTAT GGTGTACGGA CTCAAGCAGA TTCGACACGT GTACGAGATT CCTACCGAAG CGGAGATTTA TGAATTTGGT CACCAGCTTT TCAAAACGGT ACAGCTGTCT TCCGAATGTT CGATTGTATG TTTGATTTAC ATTGAACGTT TGATGGAGCT TGCCAAAGTG CCGCTGTTGG CCAGTACCTG GCGACCAATT TTCATGTGCG GCCTGCTACT AGCGTCAAAG GTCTGGCAGG ATTTGAGTTC GTGGAACATC GAATTTGCCA GCGTCTATCC TCAGTACTCC TTGAGCGCGA TCAATCGATT GGAGCATACC TTTTTGCGCA TGATCAAGTG GGAGCTTTAC ATTTCCAGCT CAAGTTACGC CAAGTACTAT TTTGCGCTGC GATCCTTAAC CGAAAAGTCG GACTTTCGAC AACGGTACAA TCGTATGGTG GGTGGTGTCA ATTGCGTGCA GGCGGCAGAA GCACGCAAGA TTGAGCAACG GAGTACGCAA GTGAAAGAAG AGGCCCTGCT ACAATTAAGC CGGAGCTGGA ATGGATGATG ACCAGAAGAA ATCGCTTATG TACAGTAAAA ACTTTAAAGT TACAAAACAA CAGCAAC
|
Protein sequence | MGRRACVTPC LETSCTADTH TNNPSTPKLS SAPDTPNSLV DQDVSIRSVL QFAHQHTQRA DSQGMKLQAP AEGSGPPEDS IEDSSLPRGD VESLRESQTS APSLSMQFPG AQLSTTDNVG IPTDGGRVSD SVSSRCIPSR ASHTRYAHTN APQTPATVPE IQAQTPQQLY PDTAALIRTN NEAAFPHDVS NLSIRDIFNE NTSNHQDYYG VDDPTASWTY SDACAIPYGT DRHHTENDNR GMATPPSTLH TPVAPLCSPD EDSSTGLPLP PPVITPSTPT AESVGGSVVT LASSTSSMQP QSARSKPVLY SAIGSLEDPN STSFASWRDK KSWRNRPKID FPSTHFLKPV NSVNCVTTQS HRPVSSSGMV YGLKQIRHVY EIPTEAEIYE FGHQLFKTVQ LSSECSIVCL IYIERLMELA KVPLLASTWR PIFMCGLLLA SKVWQDLSSW NIEFASVYPQ YSLSAINRLE HTFLRMIKWE LYISSSSYAK YYFALRSLTE KSDFRQRYNR MVGGVNCVQA AEARKIEQRS TQVKEEALLQ LSRSWNG
|
| |