Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42824 |
Symbol | |
ID | 7196428 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1241180 |
End bp | 1243129 |
Gene Length | 1950 bp |
Protein Length | 460 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177246 |
Protein GI | 219110989 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.138741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCACAGCGC TTGTTCTACT AGTAGAAATA TATTGGCTTA CCCGCAGTTT CTGACAGGCC GATACAAACT GTTTCGCGAA AAGCTTGAGA CGAGCACCGT AAAGTAACTC ATCAGCAGTC ATGGCATCTG TTTCCGGACC TTCTGTTTCT GGAAAGCTTA CCTTCCCTGG ACCGATAGGG GCGAAGCGTC GGCGAATGCG AAGTGGAGGT AATGCATCGA CAGTAACGAA CAACTCTCCA GGGAAAACCG ACGGCTCGGA AAGCGAGCCT AAAACTTCGG GTAAATGTAA AGGGGGTACT ACGATTCCAA TTTTTCTGAA AAGTAAGTAT CACGTGGCAA CCAAAAAATA TGATTGTATT CCGGCCAAGT ACGCTTCGCT CGCGGTTTTG TCCATGCAAC GAATGTGAAG TTTTGAATGA AATTCCATAT TACTTTTTAG ATTTTACCCG TGATTCGCTA CTCTGATTGT TACCAACCAT TCTAACGCAT CTTTCTTTTC CAGAGACGTA CAAGATGATC GACAGTTGCG ATCCATCGAT TGCGTCATGG TGAGTTCCAC TCGTAATTTC TTGTCTGTGC TCTGAGATAC AAATTAAGGT TTTTGTTCGA TTGTGTTCCA CGCATTGTCT ATCTAGTACG ATAACCTGTT GATTCTCTCG TTAAGGCAAA TGGTGAAGTT AGCCTTAGTG TACTTTCAAA TCCGCGAAAT TTTTGATCGG GAAAAATCTG TCTTGACCTC ACCATTTTAT AATTTGTACT TTGATCCATT AGGACAGAGG AGGGCGATAT GTTTATTGTT AAGGATCCTG ATGTTTTCGC AACGCAAGTG ATCCCGCAAT ACTTTGACCA CAACAAATTT TCAAGTTTTG CTCGCCAGCT CAACTTTTAT GGGTTCCGCA AAATGCAATC GAAACCAATC AGAAACAGCG ATTTTGACAC AGGTACCGCT AAGCACGTCA CTTTTTATAA CGAAAACTTT AAGCGTGGTC GATGTGATCT GCTGAAGAAG ATTCAGCGCT CGACTCGTGG AGGGGGGAAC ACGACGGGGC AAGACAGTCA TCGTGATGTC CAAAATCTTC GTGATCAAGT TGCAATGCTC GAACAAAAGA TGGACGAAAT GAGCAGTCAG GTAGAAGATC GCGTACGTCG CCTCGAACTG GAAATGTTGG CTCGCCTGGA GCAGATGATG CTTGCTATGC AGCAGCAACA GACCACACAG TTGCACCTTC AGACTGCGAC TTCAGTTGGT TCAAACAGCG GATCTGGTAG TTCCAACGGT ACTGGCAATC ATATGCCTGC GCCTTCCAGC AATCAATTGA GCTGGGATAA TAACGGGCTA TCCTTTCCTC GCGGAAACTC TATCAACTCC AATGTGAGCT CAGTAACCTT CCAGCAACCG CGGCAACAGC AGCAGCCACT CCAACAAATG ATTCATCAGT CACATCAGTT AAACCAGTTG GACAACACCG GAATGGCTCC TCCGACCTTA CCGCCTCATC CTAAGCAGAA GCAACTCCCA ATGAATGGAT TTCCTGGGAA CATGGCCACT CCTCCGGAAC GAATGAATTC TCTACGTGGA ATTTCCACCT TGTCGCGTGG TCTTTCCGGA TTGTCGCGTG GTGCGTCGAT TGAATCTAGT GCTTCTGCTG TATTGATGCG CAACTCTTGG GAGGATAAGT TTTTTTCGAT GCTCATGCTA GATAGCGAGC AAAATGGGAG TAGTTCGAAC CCTCACGACT CTAACGTGAT GCCAACTCCT CTTGCTCCCG GTATTTCGTC TAACGCAAAT GCACAAGCAC CTGTCACTGT TTCAGATAGA TCGGCTGATA TCAATGGACA GCTCTCGGCG CACCACGAGA ACAACGACGA TGATCTTAGT TCTGTATCGA CTTCAGACAT GCCATGAGGG GCAAGAAACT TGTTTACATA AACCATATGA CTTATTTTAC
|
Protein sequence | MASVSGPSVS GKLTFPGPIG AKRRRMRSGG NASTVTNNSP GKTDGSESEP KTSGKCKGGT TIPIFLKKTY KMIDSCDPSI ASWTEEGDMF IVKDPDVFAT QVIPQYFDHN KFSSFARQLN FYGFRKMQSK PIRNSDFDTG TAKHVTFYNE NFKRGRCDLL KKIQRSTRGG GNTTGQDSHR DVQNLRDQVA MLEQKMDEMS SQVEDRVRRL ELEMLARLEQ MMLAMQQQQT TQLHLQTATS VGSNSGSGSS NGTGNHMPAP SSNQLSWDNN GLSFPRGNSI NSNVSSVTFQ QPRQQQQPLQ QMIHQSHQLN QLDNTGMAPP TLPPHPKQKQ LPMNGFPGNM ATPPERMNSL RGISTLSRGL SGLSRGASIE SSASAVLMRN SWEDKFFSML MLDSEQNGSS SNPHDSNVMP TPLAPGISSN ANAQAPVTVS DRSADINGQL SAHHENNDDD LSSVSTSDMP
|
| |