Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20740 |
Symbol | |
ID | 7201621 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 80273 |
End bp | 83218 |
Gene Length | 2946 bp |
Protein Length | 837 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180750 |
Protein GI | 219120004 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.478562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCAAAACAG TCCGTTTCAT TGCTCCCGTT TGCGCCTCCG CCAAAATGCC ACCGTCTTCC AAGGCTCGGC TTCCCACCCA AGAAGCGTTA CCGGCGGGAA CAACTCGCGT TTGCCTCAAG AATTTGCCCC CCTCGTTCTC GTCGCATGAT TTACGAACCT TTGTCCGCGA ACGGTTGTTG CCGCTCGATC CCCACGTGAA ACTTACGGAC TGTCGAGTGC TGCTCAAGAA GGACGGAAAG TCTAGACGCA TGGGATTTTG CGGGTTTGCT ACTCCCAGTA CGGCTCAAGT GTGTGTACGG CAACTGGACA AGGCCTATTG CCGGACCAAC AAACTCGTAG TGGAATTTGC TACGCTCCCC GCATCGTTGG CCACGACCAC TGCGAACGAT CCGACGCCTG CCGTTGAAAG TTTCAAACAG GAGAAAAGCT CCGAACCGAT AATAACGGAC AAAGATCGCA AATTGGAAAA GAAAAAAGAG GAATTCCTGG CGGTAATGGG TGTGGGTAGC AATGCCGAGT CCAAAAACAA ATTCTGGGCC AACGATGATG GTCATTCGGG CACAAATACA AGCGGCGATC AGATTGCAAC GAAAGCAACT ACTGGAAACC ACGATGATTC CGAATCCGAT TCCGACAGTG ATGACGCTAC GGATGAAGAC AATGCAGATC CTCTGGAACG CAAACTTCCA CTACCCGCTA CGGAAGAAAA GAGTTCGTCT GCTCAAACAT CCGATCTAGA CTTCTTGAAG TCCAAAAAAG TGCAAGTTCA GGATCTGGAC GATGCAGAAA CCGATAGAAT GAATGAAGGA CCACACGATG ACAGTGAATC TGGCTCGTCG TCCAGCAACA GTAGCGAAGA TTGCGATATT GTAACCCAGT CCAAAGAGGC CCCGAAAGGA CAGCCCCAAA TACAGGCAGG CTATACAACA GAAAACAACG ATAGTGTGGG CGATCATCAT TTATTGGCTG GCGAAGACGA CGCTGAGGCT AAGAACATAG CTGCAAATCG CTTGTTCCTA CGAAACCTGC CGTTCACAGC AACCGAAGAC GACTTGAAAA CTCATTTTGA AGCCTTTGGT AGTATAGTGG AATGCCACGT CCCCGCTGAC GATCAGAAAC GGAGCAAGGG CTTTGCATTT GTGACTTTTG TCAAAGCAAA CGATGCCATA GCCGCGAAAA CTGCTCTGGA CGGCACGGAC TTTCAAGGTC GTCTTTTGCA CGTCTTACCT GCTCGTCAGG CACCTTCTCT AGGAGACGGC AATGGCACCA ATCTCACGTT CAAAGAAAAA CAGGAACGAT TGCGGAAGCA ACAAGCTGAG TCGCAGACTG GATGGTCGGC CTCGTTCGTC CGTGGGGATG CTGTCGTGGA CAATTTGGCT TCGAGGCTAG GACTACAAAA AGGTGAAATT CTGGCCGTGA AGGATGGACT GTCGTCGGGT GATGCAGCTG TGCGCTTGGC TTTGGGGGAG ACAGCGGTCA TTGAAGAGAA CCGCGACTAC TTCCGATTAC ATAATATTGA CATGGATGTT CTTGTATCGG CCACATCCGA CAAAGACGCC AAGCTGGTTG AGCGAAGTAA GACAATGATA CTCGTAAAGA ACTTGCCTCA CGATACTACT AAAGAGGACC TTGTCAAGGT ATTCAGTGGA GCTGGGGATA CACCTTCTCG TATTCTCTTG CCCCCCTCTC GGACAATCGC GGTCGTGGAG TATTCTCACC CGAACGATGC CAAACGTTCT TTCCGGAAGC TGGCCTACCG AAGATTTAAG AATGTGCCTT TGTACTTGGA GTGGGCACCC CTGGCTTCCA AGCGAATCGA CAATGGATCC GAAGAAACGA ACGATGAGAA CATAATACAG ATAGAAAACT CAGAGGATGC CAACCGTGAG ACGGATGACT TGGTGGAAGG TCCCACGCCG ACAATCTACG TCAAGAATCT AAATTTCCAC ACAACCGAAG ATCAGCTCCG CCAAGTGTTT TCCAAGCATG TGAAAGATGT TCGCACTGTA CGTATACCCA AGAAGATTGC TCCCGTAAAG CGTTCTGGAG GCAAATTTGG TACCGAAAAC GAGACAACGA GAGAAATGTC GATGGGATTT GGTTTTGTTG AATTTGGTTC GAATGAGTCG GCGCGGACAG TTCTGAATAA GCTGCAAGGC TTCACGGTTG ATGGGCACAT ACTGGAACTC AAACCATCGT CCAAAACTGG CAATCAAGGA GTGTCATCCA CGGCGGCTAA GAACACTACT TCAAAGAGTA CAAAAGTAAT GGTTCGCAAC GTGCCGTTTC AAGCAACCCG AAAAGAGCTG CTTCAGCTTT TCGGCTCGTT CGGGCCTCTC AAAAAGGTTC GGTTGCCGAA AAAATTCGAC GGAAGCCATC GGGGATTCGC GTTCGTGGAA TACATGGCGG CGAAAGAGGC AGCTGCTGCT ATGCATACTT TATCCGCTAC TCATTTGTAC GGACGGCATT TGGTTTTGGA ATGGGCTGCT GCCGATGAAG AAGCCGAAAA CTTGGACATT CTGCGTGCCA AAGCCAAAAG GAATATTGGT CTTGACGCCT TGTCCGCGAG AATGGAAAAC AAGAGAATTC GTTTTGAGTA GTAAATTGAT GTTCAGTATA TTGATTCAGT CTGCTTGCTT CGCATCGAAG GGGGCGAATC AACCCACAAT GTAACAGTGC ATTTCCTATA ATCAATCGCA TCCCACGTTT CTTTCTCGGA TGGGATTGTA TACTGGATAA TATGAGATAT ACTGCCTCCA ACGAGAAATA TCATGGAAGG ATGAGATTTT TGGCTGGCAA GGTCAAAACT AAACGTATCC AATTCTCTCC TCTAGATACG AGGACGCCAC CACGACGAAA ATACGTGGTT GTATCGAGCC AAAGCGTAGT GGCAGCAATA ATATTCCTAT ATAAGTCCTT TGACGTGTAA AATCCATTTG TCAAAC
|
Protein sequence | MPPSSKARLP TQEALPAGTT RVCLKNLPPS FSSHDLRTFV RERLLPLDPH VKLTDCRVLL KKDGKSRRMG FCGFATPSTA QVCVRQLDKA YCRTNKLVVE FATLPASLAT TTANDPTPAV ESFKQEKSSE PIITDKDRKL EKKKEEFLAV MGVGSNAESK NKFWANDDGH SGTNTSGDQI ATKATTGNHD DSESDSDSDD ATDEDNADPL ERKLPLPATE EKSSSAQTSD LDFLKSKKVQ VQDLDDAETD RMNEGPHDDS ESGSSSSNSS EDCDIVTQSK EAPKGQPQIQ AGYTTENNDS VGDHHLLAGE DDAEAKNIAA NRLFLRNLPF TATEDDLKTH FEAFGSIVEC HVPADDQKRS KGFAFVTFVK ANDAIAAKTA LDGTDFQGRL LHVLPARQAP SLGDGNGTNL TFKEKQERLR KQQAESQTGW SASFVRGDAV VDNLASRLGL QKGEILAVKD GLSSGDAAVR LALGETAVIE ENRDYFRLHN IDMDVLVSAT SDKDAKLVER SKTMILVKNL PHDTTKEDLV KVFSGAGDTP SRILLPPSRT IAVVEYSHPN DAKRSFRKLA YRRFKNVPLY LEWAPLASKR IDNGSEETND ENIIQIENSE DANRETDDLV EGPTPTIYVK NLNFHTTEDQ LRQVFSKHVK DVRTVRIPKK IAPVKQMSMG FGFVEFGSNE SARTVLNKLQ GFTVDGHILE LKPSSKTGNQ GVSSTAAKNT TSKSTKVMVR NVPFQATRKE LLQLFGSFGP LKKVRLPKKF DGSHRGFAFV EYMAAKEAAA AMHTLSATHL YGRHLVLEWA AADEEAENLD ILRAKAKRNI GLDALSARME NKRIRFE
|
| |