Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21873 |
Symbol | |
ID | 7202907 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 661654 |
End bp | 663039 |
Gene Length | 1386 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182112 |
Protein GI | 219123603 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTCGAAGGG TATCCACGAG ACGACACTCC TTGCCACTGT CGCCGTTTGC CTCGTCAGGG TCGTCCTCCA TTTACACGCA ACTATTCAGT TCACAAAAAC CAATGTCGTC TTTGTCCACC GCTTCCACAT CCTCCACAAC CACATTTACG GATACCGAAC TCACCGAGGT AAAGCGAGAT TTTCAATCGG CAACCGAATT GTATCGAAAG AATCTAGCCT CCACGACCAA ACTACCGGCA CTACAAAGTC AAATTGCCGA TTTGGAAACG GAGCAGTCCC AACCCGATTT TTGGGACGAA GCCAACACGA GTCGCGCCGC CATCGTCAAC GCTCAAGTCT CCACCGCCAC GAGACTCCTC ACCCGGATAC AAGCTTGGCA AGAGTGGCAC GGAGACGCCC AGGCGGCGCT CGAAATATTG TCCCAGTCGT GGACCGCATC CTCCGACGAC GCCACCGCCG TTACCAGCGG GGTGAACAAT TCTTTGAGAT TGGCTTCCGA GGAACGTGCC ATGCTCTTGG ACGAGTTCCG TTCCGCCATT GCGCGCTTAC GCGAAGACAG CGATCGATTC GAATTGGAAT TACTCCTGAG CGGCCCGTAC GATCACGCCC CGGCTCGTCT ACTCCTGACG GCCGGGGCCG GAGGTACTGA AGCCAACGAC TGGGTGGGCG ATCTGAAACG CATGTACCAA CGACACTGCG AGGCAATGGG ATTGTCGTGC GTCGTACAGG ATGAACAGGC CGGGGAAGCG GTGGGCTACA AGAGCGTCGA ACTGCTCGTA TCCGGTGACA ACGCCTACGG TTGGCTGCAG GGTGAAAAGG GGGCGCACCG CATGGTCCGC CTCAGCCCGT TTAACGCCAA CAACAAGCGG CAAACGACCT TTGCCGGAGT TGATGTGGCG CCGGATATTT TGAATCAAGA TGATGATGCT TACTGGAATA CGATTGATGT TCCCGAATCA GAGTTGGAAA TTACTACCAT GCGCGCCGGG GGCAAGGGTG GACAGAATGT GAACAAAGTC AACTCGGCAG TACGCATTAA GCATTTGCCG TCAGGTTTGC AGGTAAAATG TGCTCAGGAG CGGAGCCAAA GCATGAACAA AAATATTGCC TTGAAGCGTC TAAAAGCGCA ACTCTTGGCC ATTGTCCAAG AACAGCGCGT GGCCGAAATC AAGGAGATTC GAGGGGATAT GGTGGAAGCT TCGTGGGGTG CGCAGATCCG GAACTATGTT TTCCATCCTT ACAAAATGGT CAAAGACCAA AGGACGGGTT GGGAAACGTC CAACGTACAG GCCTTTATGG ATGGTGACCT CCTAGAAGAG TGCATTGGCT CTTTCCTACG ACATAAGGCT GAAGAACAGC GAAAGGAACA AATAGCTAAC GAGTAG
|
Protein sequence | MSSLSTASTS STTTFTDTEL TEVKRDFQSA TELYRKNLAS TTKLPALQSQ IADLETEQSQ PDFWDEANTS RAAIVNAQVS TATRLLTRIQ AWQEWHGDAQ AALEILSQLA SEERAMLLDE FRSAIARLRE DSDRFELELL LSGPYDHAPA RLLLTAGAGG TEANDWVGDL KRMYQRHCEA MGLSCVVQDE QAGEAVGYKS VELLVSGDNA YGWLQGEKGA HRMVRLSPFN ANNKRQTTFA GVDVAPDILN QDDDAYWNTI DVPESELEIT TMRAGGKGGQ NVNKVNSAVR IKHLPSGLQV KCAQERSQSM NKNIALKRLK AQLLAIVQEQ RVAEIKEIRG DMVEASWGAQ IRNYVFHPYK MVKDQRTGWE TSNVQAFMDG DLLEECIGSF LRHKAEEQRK EQIANE
|
| |