Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50885 |
Symbol | |
ID | 7200500 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 70759 |
End bp | 72923 |
Gene Length | 2165 bp |
Protein Length | 602 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179573 |
Protein GI | 219117560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGATGCGATG AGTATTGTGG AACGACACAT CCATCACTCC TCGTGGCGTG ATGACAACAG TGAACGAACA AGAAGCCGCG GACCTTCCGC TGTAAATAAA GCAGCACAAG CCGGACTTTC CTTCGAGTCG TTGATTCTGG TTGTCAACCA AAAGTCTGCG GAAAACTAGA CTAGCTTGTT CTTCACATTC CGCTGGTAGC ATGCCACTCT TTCGTTCATC GTCTCTTATC ATGGCTGCCA TTCTGAGTAG TTGCTGTTGG ATGCGGTCCT GTCGTGCGTT TGTTGGTCCT GCTTCTAGCA TTAGTACCAG TGCGGCGATC CAAAGACTGA CTACGCCTTT GTCTCGTCAC GGCCCGCTCT TTTCGACGGC GTCCAAGAAG GAGGCCGAGG CTCCCAGCGA TGTAGTCGTT GCCGAAGCCT TGACGTATTC CATGGAAGAC GTCGTCGGCT TGTGCAAACG GCGGGGAATC ATTTTCCCAT CGTCCGAAAT ATATAACGGT TACGCCGGAT TCTATGATTA CGGCCCGCTC GGCAGCGAAC TCAAAAAGAA CGTCAAGGAT GCCTGGTGGA AGAATTTCGT CATGATGCGC GAGGACGTCG TTGGTGTCGA CTCCTCCATT ATTCACAATC CGGAGACCTG GAAATCGAGT GGTACGAGAA CACTACAGAA AACGGAAAGG AATGTCTGCC TATTGTTGCG CGCCACTCAT GAATTTGTCT TGTCATGTTT CACGACTCGC GGTAGGCCAC GTCGACGGCT TTTCCGACCC CATGGTGGAT TGTAAGGAAA CCAAACTCCG GTACCGGGCC GACCAGCTAT TCTACGCCCC CGTCATGGTG AGTGGCGAAG CAGAAGTTCT CGGATACTTG TGTGTCCAGG AAGCTAATGA AGCGGACATG GCCAAGGAGG CGAAAAAACA AGCCAAGGCG CTGCTCAAAT CGTTGGACCG CAAAGGCGAA ACCGTCCAAG AGCCGTTCGA CTTTCGTGAA GTAGTGGAAG CGACCGAAGC CGAAATGGCC CAAATACCGT CTCCTGGCTC CGGTAAACCA ACACTCACCA TGCCACGAGC CTTCAACCTC ATGTTCCAAA CGCAAGTGGG GGCCTTGTCC GACGCGGCAT CCGTGGCTTA CTTGCGACCG GAAACGGCTC AAGGCATCTT TCTGAATTTC AAAAACGTAC TGACCACCTC GCGACAGAAA ATACCTTTTG GAATTGCCCA GATTGGTAAG GCCTTTCGCA ACGAAATCAC CCCCCGAAAT TTCATTTTTC GGTCACGCGA ATTTGAACAA ATGGAAGTCG AATACTTTAT CCCACCCGGC GATGAAGTTT GGCCGGCGTT TCATCAGCAA TGGATAGATG ATTCAAGGGC GTTCCTGCTT TCCATTGGAT TGCAGGAAGA ATTGCTCGGA TGGGATGTGC ACGAGGGGGA CAAGTTAGCG CATTATGCAC AAGCCTGTAC CGATATCACT TTTAGATTTC CGTTCGGTGA ACAAGAACTT ATGGGAATTG CCGCGCGTGG CAATTACGAT TTATCGCAAC ACTCGGAGGG ATCCGGCAAG AGTAAGTTAC AAAGCGTGGT CGTATTTATT GGCAGCCTGC TCCGGCGTAG GAACGCTCGA GGCGTTTCGG GCTTTCTTAA ACTCATATGC CATTTGCTTC AATTCTGTAG GTCTGGAATA CTACGACGAA CAGACCAAAG AAAAATATAT TCCGCATTGC ATTGAACCGT CGCTCGGTGT CGATCGTCTA ATGCTGGCCT TGATTTGCTC GGCGTACGCA GAAGACGAAG TAGGCGGAGA GAAACGTAGT TTGCTCAAGT TTGACCCAAA GATTGCACCC ATCAAGGTTG CCGTCTTGCC TTTGTTGAAA AACAAGGAAG AGCTAGTGTC GGTTGCCCGG GACTTATTCG ATAAACTTCG TCGTAGGTGG AACTGTCAAT ATGATGCTGC GGGTGCTATT GGACGGCGGT ACCGGCGAGC GGATGAAGTC GGTACACCTT ACTGCGTTAC GATTGACTTT GATACAATTG AAACGGATAA CGCCGTCACA ATTCGCGATA GGGATACGAC GGATCAAGTC CGAATTCCGC TTAAAGATGT GATTTCGTAC TTGAGCGAAC GTATCGACGG ATACTAAAGA TAAAAAACGC TTGTCTTATT GCTTC
|
Protein sequence | MPLFRSSSLI MAAILSSCCW MRSCRAFVGP ASSISTSAAI QRLTTPLSRH GPLFSTASKK EAEAPSDVVV AEALTYSMED VVGLCKRRGI IFPSSEIYNG YAGFYDYGPL GSELKKNVKD AWWKNFVMMR EDVVGVDSSI IHNPETWKSS GHVDGFSDPM VDCKETKLRY RADQLFYAPV MVSGEAEVLG YLCVQEANEA DMAKEAKKQA KALLKSLDRK GETVQEPFDF REVVEATEAE MAQIPSPGSG KPTLTMPRAF NLMFQTQVGA LSDAASVAYL RPETAQGIFL NFKNVLTTSR QKIPFGIAQI GKAFRNEITP RNFIFRSREF EQMEVEYFIP PGDEVWPAFH QQWIDDSRAF LLSIGLQEEL LGWDVHEGDK LAHYAQACTD ITFRFPFGEQ ELMGIAARGN YDLSQHSEGS GKTCSGVGTL EAFRAFLNSY AICFNSVGLE YYDEQTKEKY IPHCIEPSLG VDRLMLALIC SAYAEDEVGG EKRSLLKFDP KIAPIKVAVL PLLKNKEELV SVARDLFDKL RRRWNCQYDA AGAIGRRYRR ADEVGTPYCV TIDFDTIETD NAVTIRDRDT TDQVRIPLKD VISYLSERID GY
|
| |