Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34943 |
Symbol | |
ID | 7200146 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 718507 |
End bp | 722127 |
Gene Length | 3621 bp |
Protein Length | 1005 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179277 |
Protein GI | 219116965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00018875 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTTG AACAGCTCCC ACTTGGTAAG GAAAGTGTAA GTGTGAGCCA ATCCGATGGA CGCAGCGGTT ACATGTCCCA TATGGAGTTT GTGTCGGCCT TATTGCGCAA ATCTACCGTA TTTGTTGATC AAACTCCTGG TAAGGACAAC GTCTCACCAG AAGTTCAATC GCCTACTGCT AAAAGACAAG TCGCTTCCGT CCCACACACA ACGAATCACA GCCCCACAGT CCCTACGAAT ATCCAAGAGT ACAATATAGA CTCATAATCT TCGTAGTTCA TCGTTTTCCT CAACTGTGAT CTCCATCGTC GTTTTGTGTT CTTCAATAGA ACTACGAAGC AACCCCACCG TCTTTACGCC CCTAACCCTG CCCTCCCACC CATCCGCAAC CTCTTCGGTC CCGATGTCGA CCTCGGCTCA TTTCAAACTG AGCGACTTTC CTCACAAAGT CCTCGACCCG ATCGCCACCC TCACCGTCCC ACCGACCTAC GCAACCATTA AGCGTGCCCA ACGCCAGCTC ATGACTAACG CCGCCGCCAT TCCCACACTC AACGGAGGTG GCGCCCACGG CCATATGGCC TTGACCCTGA CCGCCCTTGC CTACGCCGAC ATCAGCGACG TCCCGTTTGT CATTCCCGTC GCCCCTCCGG CCAATCCGCC TCCCGGCGCC ACGCAACCGC AAATCACCGA AAACAACCGC ATTCATCAAC ACGACGCTGA CATCTACAAC CTTTATGTTG CCGTCAACAA CGCGCTTCGC CAGCAACTTC TCGACGCAGT CCCCCGCATT TATGTCCGCG CCCTCGCCCA TCCCATGTTC GAGTTTAGCA ACGTCACTTG CCTTGACTTG CTCTCGCCCC TCTGGACCAA ATACGGTACC ATCAAGCCCG CCGAGCTCCA GAAAAATTTC CAGTCCATGT ACACCCCTTG GAACACAACC GAGCCGATTG AATCAGTTTT TCTTCAGCTC GACGAGGCCA TCGCTTTCTC TGTTGATGGT AACAACCCCA TCTCGGAAGC TGCTGCTGTT CGCGCCGGCT ACGAAGTCAT TGCGCACTCG GGCCTGCTCC CCCTGGACTG CAAAGAATGG CGCAAATTGC CTACTGCTGC TCACACCCTT GCCCATTTCC AGCAGCACTT TTCCCTTGCC GACGAAGACC GGCGCCTCAC GGCCACCACC GGTTCCCTCG GCTATGCCAA CGTGCTTGCT GCTGCCCCCT CTCTCGCTCC TGCCACGACC TCCGACACTC TCAGCCTTCC TTTCTCCGCG CTCTCTGTGT CCCAGACTTC TGTCTCTTCG CCGGACATGA CCTATTGCTG GACCCACGGT ACCAGCAAAA ACCGGTGCCA TACAAGCGCC ACGTGCAAGA ACAAGGCCCC TGGCCATCGC GACGACGCGA CCGCCACCAA CACTCTCGGC GGCTCCACCA AGGTTTGGAC CGCTCCCAAG CCCCCTGAAT AGGAAAGAGG GACGGCTACG CCAATGGTTA ACTCTAGTAA TACCGATTAT TTAAATCATA TTACTAGTCT TAATTCATCT GTAGTCCCCT CCCCGCCTAG TCCCCATACC TCGGCCATTG CCGACACCGG TTGCACCGGC CATTACATCA CCGTCAACTG CCCCCACACC CACAAACGTC CGGCAAGCCC CAGCCTTGCC GTACGTGTCC CTAACGGCGC CGTCCTCCGC TCAAGCCACA TTGCCACCCT GGCCCTCCCT GGCTTCTCCC CTTCTGCTTG CCAGGCCCAC ATCTTCCCCG GGCTCACCTC ACACCCACTC ATTTTGATTG GACAACTTTG TGACGACGGC TGCACCGCCA CTTTCTCAGC CACACGCCTC GAGATCCACC GCGACACTAC ACTACTCCTC TCCGGCACTC GTGCACCCAC CACCGGCCTC TGGCACCTTG ACCTTACCCC TGCCAAGCCT CCTGCCACAG CCCACGCTCT TGTTCCCAAC ACTCCCCTTG CTGACCGCAT CGCTTTTGTT CATGCCTCGC TCTTCTCCCC GGCTATCTCC ACATGGTGCC AGACCCTCGA CTCCGGCCAT CTTGCAACCT TTCCCGAACT TTCCTCCCGC CAGGTCCGCA AGCATCCACC TCATTCCCCC GCCATGGTCA AGGGCCACCT CGACCAACAA TGCGCAAACC TTCGCTCCAC CAAGCTTCCC CCTGTAGGTT CCCCCATCAC GACGGAACCC CTTGCCGCCG CTGTGCCCGA CCTTGACCCT CCCGACGCCC ACGACGTCAC ATGCACACAC CATGTCTTTG TTGCCCACCA ATGGGTTACC GGTCAGATCT ACACGGACCA ACCGGGCCGC TTCCTCACTC CCTCCAGTGC CGGCCACAAC GATATGCTTG TTCTTTATGA TTATGACAGC AATGCTATCC ACGTCGAACT CATGAAGAAC AAGTCCGGCC CCGAGATTCT AGCCGTCTAT AAGCGCGCTC ATGCTCTTTT CACCCAGCGA GGCCTTCATC CCCAACTCCA GCGTCTTGAC AACGAAGCCT CTGCAGCCCT CCAGTCCTTC ATGTCCTCCG AGCACGTGGA CTTTCAGCTG GCACCCCCCC CATCTACACC GCCGTAATGC AGCCGAACGG GCCATCCGCA CCTTCAAGAA CCACTTTATT GCTGGTCTCT GCACCACTAA CCCGGATTTT CCCTTGCATC TTTGGGACCG CCTCCTCCCA CAGGCCCTCA TTACCCTCAA TCTTCTTCGT CGCTCCCGCA TCAATCCCAA GTTGTCCGCC CACGCACAAC TTCACGGTGC CTTTGACTAC AACCGCACCC CGCTCGCTCC TCCTGGCACT CGCGTCTTAG TTCATGTCAA GCCCGCTGTT CGCGAAACCT GGGCCCCCCA TGCTGTTGAA GGTTGGTATC TCGGCCCCGC TCTCAACCAT TATTGCTGCC ATCGCGTCTG GATCACGGAA ACACGTGCCG AACGTGTTGC TGACACCCTT TCCTGGTTCC CGACCCGCAT TCCCATGCCC GCCGCTTCGT CCACCGACCG CGCCCTGGCC GCCGCCCGTG ACCTAGTCCA TGCCCTCCAG AATCCTTCCC CTGCATCTCC GTTCGCCCCC CTTGATGCCA CCCAGCACCA GGCCCTTACC GACCTCGCCA ATCTCTTTGC CACTGTGGCC GCCCCAGCCG ACGACGTCCC TGCACCCGCT CCGGTGCCTC CGGTCCGTCC CCCTGCCCCA GCAACTCCCG GTCCGTCCCC CTGCCCCAGC AACTCCCCTT GCGCAGGTCC GTTTTGCCGT TCCTCTTGTC ACGGCCGAAC ATGCCCCGGC ACTTCCGAGG GTGCCCATTC CTGCCCCAGC ACTTCCGAGG GTGCCCACCC TGGCCACCTA TCACTCTCGC ACCGGCAACC CCGGCCGTCG CCGCCGCACC GCACGCAAAC AACCGGCAAC CCCAACCCTA GTCCCGGCGC ATCCCCTGTT ACCGAGTACT ACCAAAGCAC AAGCGTAGGA CAGATCGCTG TTAGGACAGT GAGTGGTAAA CGAAGGTGGG CGTTGAGACG AACGACCTGC AATACACCTA AGAGACTTAG GGACTGTAAA ACTCAACTCG TTGACGACGC TCCCCACGGG TTAGACAGCG GATACGAGGA CCTATGCGTA G
|
Protein sequence | MSLEQLPLGK ESVSVSQSDG RSGYMSHMEF VSALLRKSTV FVDQTPELRS NPTVFTPLTL PSHPSATSSV PMSTSAHFKL SDFPHKVLDP IATLTVPPTY ATIKRAQRQL MTNAAAIPTL NGGGAHGHMA LTLTALAYAD ISDVPFVIPV APPANPPPGA TQPQITENNR IHQHDADIYN LYVAVNNALR QQLLDAVPRI YVRALAHPMF EFSNVTCLDL LSPLWTKYGT IKPAELQKNF QSMYTPWNTT EPIESVFLQL DEAIAFSVDG NNPISEAAAV RAGYEVIAHS GLLPLDCKEW RKLPTAAHTL AHFQQHFSLA DEDRRLTATT AKTGAIQAPR ARTRPLAIAT TRPPPTLSAA PPRFGPLPSP LNRKEGRLRQ CLNSSVVPSP PSPHTSAIAD TGCTGHYITV NCPHTHKRPA SPSLAVRVPN GAVLRSSHIA TLALPGFSPS ACQAHIFPGL TSHPLILIGQ LCDDGCTATF SATRLEIHRD TTLLLSGTRA PTTGLWHLDL TPAKPPATAH ALVPNTPLAD RIAFVHASLF SPAISTWCQT LDSGHLATFP ELSSRQVRKH PPHSPAMVKG HLDQQCANLR STKLPPVGSP ITTEPLAAAV PDLDPPDAHD VTCTHHVFVA HQWVTGQIYT DQPGRFLTPS SAGHNDMLVL YDYDSNAIHV ELMKNNPSCP PSTWTFSWHP PHLHRRNAAE RAIRTFKNHF IAGLCTTNPD FPLHLWDRLL PQALITLNLL RRSRINPKLS AHAQLHGAFD YNRTPLAPPG TRVLVHVKPA VRETWAPHAV EGWYLGPALN HYCCHRVWIT ETRAERVADT LSWFPTRIPM PAASSTDRAL AAARDLVHAL QNPSPASPFA PLDATQHQAL TDLANLFATV AAPADDVPAP APVPPVRPPA PATPGPSPCP SNSPCAGPFC RSSCHGRTCP GTSEGAHSCP STSEGAHPGH LSLSHRQPRP SPPHRTQTTG NPNPSPGASP VTEYYQSTSV GQIAVRTTAD TRTYA
|
| |