Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_12095 |
Symbol | Fru5_1 |
ID | 7200689 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 194726 |
End bp | 196806 |
Gene Length | 2081 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | frustulin 5 |
Protein accession | XP_002179600 |
Protein GI | 219117616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGCTA GCGCTGGCGG TATCTCGACG GATGATGACG GCATCGATCA CGTACAGGCG GACGCTCTCG CCTCCGCCGG TACCATTTGT ACCTTTTCAG CCTCGCGCAA AGGCGTCGAC GCTCGGTCCC GAGATATTAA TGTTCAAAAC TTCACCCTTC AACACATGGG AGCCGTGCTG CTGGATGAAA CCGAGATTGT TCTTAATCAC GGCAATCGCT ACGGTTTGGT AGGACGCAAC GGATGTGGCA AGTCAACGCT TTTGCGAGCG CTGGGAGCCC GAGCCATTCC GATTCCGCGC GGTATTGATA TCTTCTTTTT GTCTGAAGAA GTGGAGCCAT CTGATACTAT GACGGCACTC GACGCGGTCA TGGCAGTTGA CGAAGAACGA TTGCGGCTCG AACAACAGGC CGACGAACTC AATCACTTGC TCGCGGCACT GGCTGACGCC AGTGTGAACG ACAGCGGTAA CAACGGTGTG AGTAGAGAGG ATAGCGAGGA CGATAAGACT CCGGAAGAGC AACAAGAAGA TGTCATGGAA GTGCTGAATG CGGTATATGA GCGTTTGGAC GCGCTGGACG CGAGTACTGC GGAGGTCCGT GCGCGTTCTA TTCTAAAGGG CTTGGGCTTT ACACACGAAA TGCAGTCCAA GTTGACCAAG GACTTTTCCG GAGGATGGCG TATGCGTGTT TCCTTAGCGC GGGCCTTGTT CATTCAACCG GTATGCCTGC TACTCGACGA GCCCACGAAT CACTTGGATA TGGAGGCTGT TATTTGGTTG GAGGATTATC TTTCGAAATG GAACCGTATT CTGTTGCTGG TTTCACATTC GCAAGATTTT CTCAACAACG TTTGTTCGCA CATGATTCAC TTTACGAATA GAAAGCGATT GCAATACTAC GATGGCAACT ACGACCAGTT CATCAAAACG AAGTCGGAAA AGGAAGAGAA TCAGCTCAAA CAGTTCAAAT GGGAACAGGA ACAGATGAAG TCCATGAAAG AGTACATTGC GCGATTTGGC CACGGAACGT CCAAAAACGC TAAACAGGCG CAGTCGAAAG AAAAAGTTTT GCAAAAGATG ATCCGTGGGG GTTTGACCGA CAAACCAGAA GAAGAGAAAC CGCTTAATTT CAAATTCACT GATCCGGGAC ATTTACCACC ACCTGTTTTG GCCTTTCACA ACGTTTCTTT CGGTTATCCA AATTGTGAAA AGCTCTACAC CAACGTTAAT TTTGGTGTGG ATTTGGATTC TCGAGTGGCC TTAGTCGGTC CGAACGGAGC TGGAAAAGTG CGTCTGGGTG GAAGCTGGTC CTGAGCATCG CTCGCTGATA TATTACGCCT ACTAATACGA TTGTGCTTCT TCCTTTCACA GACTACGCTA ATGAAACTCA TGTCCAGCGA ATTGCAGCCA TCCATGGGTG ACATTCGACC CCACGGACAT TTGAAGCTCG GTCGATTTAC GCAGCATTTT GTGGACGTTT TGGATTTAGA CATGACGCCG CTCGAGTTTT TCGAGAGCAA GTATCCCAAC GACCCGCGGG AAGAACAGCG CAAATATTTG GGTCGTTTTG GAGTGTCGGG GCCGATGCAG GTGCAGAAAA TGAGGGAACT GTCCGACGGA CAAAAGTCTC GCGTCGTCTT TGCCAAGGTA GGATCGGATA CGGCATGAGC GGGACGCCGA GCAGTTCGAC GAATTCGGGA TGACGCATAT CTCTCACCTT TACTTATTTA TTTGGCTCAG CTTGGTCGCG ACGTTCCGCA CATTTTGCTG CTCGATGAGC CGACGAACCA CTTGGATATG GTACGCTGTG GGAGCTTGGA ATTGGGCAAA GGCGATTCCA AACGCATTCC AGCAACACTC ACGGTCGGTC TATCGTTCTC GACATTTTAC AGGAAAGCAT TGACGCGTTG GCTAAGGCAG TGAATGAATT CCAGGGTGGT TTGGTATTGG TGTCGCACGA TATGCGTTTG ATTGGTCAGG TGGCCAAGGA GATTTGGATA TGCGACAACA AAACGATAGC GATTCACCGG GGAGACATTC AGTCGTTCAA AATGGACATG CGCGCTGCCA TGGGCATTGA T
|
Protein sequence | SRDINVQNFT LQHMGAVLLD ETEIVLNHGN RYGLVGRNGC GKSTLLRALG ARAIPIPRGI DIFFLSEEVE PSDTMTALDA VMADSEDDKT PEEQQEDVME VLNAVYERLD ALDASTAEVR ARSILKGLGF THEMQSKLTK DFSGGWRMRV SLARALFIQP VCLLLDEPTN HLDMEAVIWL EDYLSKWNRI LLLVSHSQDF LNNVCSHMIH FTNRKRLQYY DGNYDQFIKT KSEKEENQLK QFKWEQEQMK SMKEYIARFG HGTSKNAKQA QSKEKVLQKM IRGGLTDKPE EEKPLNFKFT DPGHLPPPVL AFHNVSFGYP NCEKLYTNVN FGVDLDSRVA LVGPNGAGKT TLMKLMSSEL QPSMGDIRPH GHLKLGRFTQ HFVDVLDLDM TPLEFFESKY PNDPREEQRK YLGRFGVSGP MQVQKMRELS DGQKSRVVFA KLGRDVPHIL LLDEPTNHLD MESIDALAKA VNEFQGGLVL VSHDMRLIGQ VAKEIWICDN KTIAIHRGDI QSFKMDMRAA M
|
| |