Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45345 |
Symbol | |
ID | 7200034 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 909410 |
End bp | 911320 |
Gene Length | 1911 bp |
Protein Length | 575 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179534 |
Protein GI | 219117479 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.824285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTTTTGTTC CACACACTCC AGTCCATCGT TGAAGGCCCC CACTTGTTGG TTTGGTACCT TCGCTCTATA CCCTTTTCAG TTCTGTGCTA TTGTTCACCA TTGCCTCGGT TTTGCAGACT CTTGGTTACG AGGCGAGGCG AGTTTACGAA CGCGGTGCAT CATGAAGGAT AGTAGATGTA CGAGGCTGGG CGCGCCGTTA CTCGTACTCT CGTTGTCGTC CTGGTCCGGG GTGGTGAAAG CCTGGACGAA TCATTTCGTC CACAATCCCC AACGTCACTC CTCGGCACGG AATCCTGGCG TAGCCTGGCG TGGAAGACCT TCGTATCGCG TCGGACCGTT GCGGGAACTC GTAGTGACGG ACCAATCGTT CCGACCGACA CGGGGAGTAC CACTCCACGC CACGGGGAGC CGTGATGGCC GTGCCACCCA AGCCGACATT GAATGGGACG AAGACGAATA CGATTTCGAT CAGCAAAAGA GTATGGACGA GTATCGGACC CAGTTTCAAG CACTCGCCGC CGAAACGTCC CAAAATCCCC ACGCCGTACA GCAAGCCCAG GATCTCTTTG ACGAGCTCTA CAAGGCGTAC ATCATGACGG AAGACGCCTC CTACTGGCCC GGGACCGATA TTTACAATCT TCTGCTCGAA ACACACGCGT ATTCTCCGCA CAAGAATGGT GCAGTGGAGG CAGAAGCCAT TGTTGCCCGT ATGGAACAAC AGGAACACGG GGTGGCCAGG CCCAACGTCG CAACCTACGC CAAACTTATG GAGGCGTGGA CGCAACGCAA ACGCTTGGAT AAGGTCACGG CCGTATGGGA ACGCATAGCC GAACAGGGAT TGCAACCCAA CATTTCTACT TACAACAAAC TCATCAAAGC CTACGGGGTT GCGGGCAAGG CGGAACAGGC TTTGCAGGTA CTGGAAGATT TGTTGCTGCA ACACCAAGGG AAAGACAACA ACAAAGAAGA GGACGACCAG ACGGATGCGG AGACCAGCAC AAAACCCACG CAGAAAACCT GGGTACAAGT GTTACGGGCC TTTGCCAGCA AAAAATACGT CCGGAATGGA GAGGGCGTGG ACCAAATACA AGCCTTGCTG CGCCGCATGG CACAGGCCTA CCGACAGGGC GAAGCAGACT GGAAACCAGG TGTGGATGCC TACAACTCCT TGCTCAAAGC CATGAGCTTT CAAAAGGGAT CCGGTAAAGA GTCGGAAAGC GTTCTGTACG GCATGCTGGA ACAGTTCCGG GAAGGAGAAG AAGCGTTGCG GCCGAACGCG GGTAGTTTTT ATCACGTGTT GCACGCGTAC CGGGGTGACA AGGATGCGGG TGTTTCCTTC AAGGTGGAAA AGTTGATACA GTTACAGGAA GCTTTGGCAG TCGAGCGCAA CGACCCAAAC GATCCAGCTC GAACGACAAC TCGCGTCTAC AACGCCGCCA TGGCGGCTCT ATCGCGGACC AAAGATCCGC AAAAGGCCGT CCGCGCCAAG AGGTTTATGG ACCGTATGAA TCTGCAGCAT AACGATCCAG ACATGCGTCC GAATGAAGCC ACATACACGA GTTTGCTGAA CGCGTGTGCG TACACAACCG AAGGTGAACC AGCCGACAAG CTGGCCGCGT TTCAAATTTC CGTGGACGCG TTGAAAGAAA TCCGCCAATC CCCATCCATT TCGACAAATT CAAAAATGTT TGGGTTGTTT TTGAGAGGAT GCGCCAATCT CATGCCGCAT AGTCGAAAAC GCGATGCGGT GGTAGAAAGT GTATTCGCAT CTTGCTGTGA CGAGGGCTAC ATTTCAGACT ACGTCCTTGA ACAGTTTGAA AGGGCGGCCT CGGAACAATT GCAGCTCAAA GTGTTGGGCG GCTTTCTTGT AGATGGCGTC GAAACTCCAG CCGCATGGAG ACAAAATGTA G
|
Protein sequence | MKDSRCTRLG APLLVLSLSS WSGVVKAWTN HFVHNPQRHS SARNPGVAWR GRPSYRVGPL RELVVTDQSF RPTRGVPLHA TGSRDGRATQ ADIEWDEDEY DFDQQKSMDE YRTQFQALAA ETSQNPHAVQ QAQDLFDELY KAYIMTEDAS YWPGTDIYNL LLETHAYSPH KNGAVEAEAI VARMEQQEHG VARPNVATYA KLMEAWTQRK RLDKVTAVWE RIAEQGLQPN ISTYNKLIKA YGVAGKAEQA LQVLEDLLLQ HQGKDNNKEE DDQTDAETST KPTQKTWVQV LRAFASKKYV RNGEGVDQIQ ALLRRMAQAY RQGEADWKPG VDAYNSLLKA MSFQKGSGKE SESVLYGMLE QFREGEEALR PNAGSFYHVL HAYRGDKDAG VSFKVEKLIQ LQEALAVERN DPNDPARTTT RVYNAAMAAL SRTKDPQKAV RAKRFMDRMN LQHNDPDMRP NEATYTSLLN ACAYTTEGEP ADKLAAFQIS VDALKEIRQS PSISTNSKMF GLFLRGCANL MPHSRKRDAV VESVFASCCD EGYISDYVLE QFERAASEQL QLKMASKLQP HGDKM
|
| |