Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26980 |
Symbol | |
ID | 7200054 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 1011821 |
End bp | 1013918 |
Gene Length | 2098 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179336 |
Protein GI | 219117083 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.24942 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGTCTTTG CTGTTAAGAC GCACCAAATC GTGTACGCTT GTTGTTCTCT GACTGATTGA AGTACAATGA ATAACGCAGC GGCCTCAGGA CTGGTACGTG TTCCGCAGGT TGTCGGCTAG TCGTGTAGGT CGCGAACGTT CCGTTTGGTG CGGGTTTATC CGTTTCGTCG TTGAGTGCGT TGTGGGTTTG CGCCTTGATT TGCCTGATGC AAGTATTCCA AAAATAGAAT CTTCCGTGTT GCTGGCACAT GTCATCAAGC CCTGGAAAAA AATTTAACTC CAAAAGTGTA CCTTTGCAAT AGATTTGCAC AAACGTTCGC ACACACCCTC ATTGTTGAGA TTTTCATTGT CTGCAGTTTC TCGGGGGTAC CCGCGAATCG GGCGAAGACG TCCGCGTTGG GAACGTGACA GCCGCAATTG CGGTTGCCAA CGTTGTAAAG TCGAGTTTGG GACCCGTCGG TTTAGATAAA ATGCTCGTGG ACGACATTGG GGATGTTACG ATTACCAACG ATGGTGCGAC TATTTTGGCT CAGCTCGAAG TCGAGCATCC CGCGGCGCGC CTTTTGGTGG ATCTGGCACA GCTCCAGGAT AAAGAGGTTG GTGACGGAAC GACGTCGGTT GTCATTATCG CCGCGGAGCT TTTGCGAAGA GGAAACGATC TCGTGAAAAA TGGAATTCAT CCCACTACCA TTCTCTCGGG TTATCGTTCT GCGTTAAAGG CCGCGGTGGC TTACATCAAA AGCACTATGG TGGTACCGGT ATCCAAGCTG TCGGACGAGC ACCTTTTGCA AGCTGCTCGC ACCTCCATGT CCAGCAAACT CATCGGCAAG GAAGGAGACT TTTTTGCACA GCTTGCTGTC GATGCCGTCA AGAGTGTAGC TACGATAAGT CCCTCCGACG GCAAGGCCAA GTACCCCCTG TCGGCTATCC ACATCCTCAA GGCACACGGC AAGTCAAGCT TGGATTCACA CCTTATGCAA GGTGGTTTTG CCCTCCTGGG CACTCGAGCT TCCCAAGGTA TGCCCTCGAC GATTGATCCT GCTGACGGTG AATCCGATGT CAAAATTGCC ATGTTGGACA TGAACTTGCA GCGTCACCGC ATGGCAATGG GCGTACAGAT CCAAATCACG GATCCCAAAG AAGTCGAAAA CATCAAGAAA CGGGAGCTCG ACATTACCAA GGAAAAAATT CAAAAAATTC TGCAAACGGG AGCCAAGGTC GTCCTCACCA CCAAGGGTAT CGACGACACG TGTATGAAAT ACTTTGTCGA AGCAGGTGCT CTGTGCGCAA GGCGGTGCAA CAAGGAGGAC TTGAAGCGCT TGGCCAAGGC AACCGGCGGT AAGCTAGTTG TCACCCTGGC CGACATGGAG GGTGAAGAGT CCTTTGATGT GGACTCGCTT GGTAAATGTA CGTCGGCTGC TGAAGTCCGT GTAGGCGACG GCGAGATGCT ACATTTTTAT GGTTGCAAAG GGGCCGGGGC ATCCACGATA GTACTGCGAG GAGCGAACGA ATACATGCTA GATGAAATGG ATCGGGCCTT GCACGATGCG CTTTGCGTCG TTAAAAGAAT GCTCGAGTCG TCTACCCTGG TTCCAGGTGG TGGAGCTGTG GAAGCGGCCC TATCCGTATA TTTGGAGCAA TTTGCGGAAA CACTGGAAAC ACGAGAGCAA TTGGCTATCC AAGAGTTTGC CGACGCATTG TTGGTGATTC CCAAAACTCT CGCTGTCAAT GCGGCGAAAG ACAGCTCCGA ACTTGTCGCC AAGCTTCGGG CGGTTCATGC CAAGCACCAG AAAGCTGAGA ACCCCACGGA TACCGATTAT CAAAATTTTG GACTGGATCT AATAAATGGT GAAATTCGCA ACAATCTTTT GGCCGGTGTT GTGGAGCCCG CGATGTCAAA GATCAAATCC TTACGCTTTG CCACCGAAGC TGCGATAACG ATTCTACGTA TTGACGACCG CATCACAGTG TCGGAACAAG GATAACCTAA TGGCTAAAAA AGATGAAGAG ATCCAGCAGC GTATACATTT CCACCATTTT GCTAAATGGT TGTCCATTTC GACGGCACGG TCATAAAGTA GCGCTCCATT GCATTTGT
|
Protein sequence | MNNAAASGLF LGGTRESGED VRVGNVTAAI AVANVVKSSL GPVGLDKMLV DDIGDVTITN DGATILAQLE VEHPAARLLV DLAQLQDKEV GDGTTSVVII AAELLRRGND LVKNGIHPTT ILSGYRSALK AAVAYIKSTM VVPVSKLSDE HLLQAARTSM SSKLIGKEGD FFAQLAVDAV KSVATISPSD GKAKYPLSAI HILKAHGKSS LDSHLMQGGF ALLGTRASQG MPSTIDPADG ESDVKIAMLD MNLQRHRMAM GVQIQITDPK EVENIKKREL DITKEKIQKI LQTGAKVVLT TKGIDDTCMK YFVEAGALCA RRCNKEDLKR LAKATGGKLV VTLADMEGEE SFDVDSLGKC TSAAEVRVGD GEMLHFYGCK GAGASTIVLR GANEYMLDEM DRALHDALCV VKRMLESSTL VPGGGAVEAA LSVYLEQFAE TLETREQLAI QEFADALLVI PKTLAVNAAK DSSELVAKLR AVHAKHQKAE NPTDTDYQNF GLDLINGEIR NNLLAGVVEP AMSKIKSLRF ATEAAITILR IDDRITVSEQ G
|
| |