Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42537 |
Symbol | |
ID | 7196086 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 352279 |
End bp | 355509 |
Gene Length | 3231 bp |
Protein Length | 1030 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176566 |
Protein GI | 219109623 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCGAATACA TAAGTAGTAA ATTTTCAAGA ACCTCTGTAT TGATTTGGCT TCACAGGCCA CCATGGCACC TGCCACTCGG CAAATGACGG GCGGAGCGGT CTATGCGCAC CTTTTGGATA ACGTGCTTCT TCTTCCCCAA GGGCACCCTA TTTGTCTCAG TTTTGCACAA CAAGGATACG AATCGGCTGA TGACCTCCTA TGTATTTTTG AGAATGAACT TGAAACTCTT GAATTCATTC CTCTTGCCCC TGCTGACGGC CCCGAAACTA CGGCACCGGT TGCCTTACTC ATGGCACATT GACAGATCAT CCGTCATTTC CTCCGGTGGC AAGCGTCCCT TGAGCGTCAA AAGGGAACTC CTTTGAAGAA CTCCGAGCTT GCAGCCCTGA ACAACAAAGA CTTTGTCCTG TACCGCCGAT CCGCTCTCGG CCAGGTCTCT TCGACTGTTG CTCCAATAGT CACAAACCCC AATGCTGCAA TTCCCACCGC TAAAACTCGA TCTGCTGTGG AAGATTTCAA GCGTGGGATC AAACGAGACA AAACCCATTA CCCCGTGCTC AAAGACGACC GGTACTGGGA TAATTTCTAT CGATCCTTCG TCGTCACTGC CGTCTCCCAT AACGTTGAGA AGGTACTTGA CCCATCATAC TTGCCTACTG ATCCACTGGA AAAGTCGTTG TTTGAAGAAC AAAACAAGTT TGTATACTCA GCCTTGGAGC ATACACTTCA GACGGACATG GGCATAAATA TCGTTCGAGA ACATAGTTTT GATTTCAATG CCCAGGAAGT TTTCCGTAAA GTGGTCAAGC ACTATACAGA GTCCGCCTCT GCAAAGATCA GCTCCTCTAC CACTCTAGGA TACCTGACCA CGGCAAAGTA TAGCTCATCA TGGACTGGCA CAGCGGAGGG ATTTATCCTA CACTGGAAGA ATCATTTGCG TATATATAAT GATACCGTCC CTACGGGTGA GCAGCTCCCA CAACAGCTTT GTCTCAGTCT ATTGGAGAAT GCTGTCCATG ATATACCCGA ACTTCGTCAG GTTAAAATCA CGGCAACTTT AGACTTAGCA AAAGGTGGCA GCCCTATTAG TTATGACGGT TATCTCAGTC TACTACTTGC ATCAGCATCG CTCTATGACA ATGGCAACAA CCTATCTAAT GCTTGTAGCA ACAAGAACAA ACGTCATGTT TATTCTACTG ACTTAGTCTA CCATCCAACT GACTTTGACA GCGATCTAGA CGTAAGTTAC GATATAGATG TGTCACCCAC AGCAATCTAT GAAGCCAATG CCCATGCACG CAACTCCGGT AATAGTGGCA ATCGCAGTCG CAACGCAGCT AGCCCCAGAG ACCGACCTTA TATTCCCCGG GAAATGTGGA ATCAACTCTC AGAGGATGCA AAAGCCATTC TCCAAGGCTT GTCTGCTCCT AGTAAGAGTA CATTACCGGC CGCGCAACCT TTTTCACAGG TGCTACAAGC CAATACGCAT AGCCATGGTA GCAGCGAAAC CGCGGACACT TTCCATGATT GCGCACCGGA GACTGAGTTG TTGGCTCACC TTACTGACCG CGTCAGTCGT ATGAACGATG GTGATATTCG TAAAGTCCTT GCAGCATCAC GTGACAACGT CTCCCCACAA CCAGGAGCGA GACCCAAATT CATGCAATCC AATATGCTAC GTTATCAAGT CTCTCGGCAT AATGTCAACG GTACCACTGC AGCTCTTGTC AATCGTGGTG CTAATGGCGG ACTTGCCGGG GCGGACGTCA TGGTGCTCAA CAAAACAGGA CGTTCCGCCA ATATAACTGG TATTAATGAT CACACATTGT CCGATTTGGA TATTGTCACC GCTGCAGGAT GTGTTGAATC CCATACCGGT CCTATCATTG TAATTATGCA TCAGTATGCG TATCTTGGCA CTGGTAAGAC TATACATTCC AGTGCGCAAC TCGAGCATTT CCATAACAAC GTTGAAGACC GTTCACGTAC AGTTGGTGGA GACCAGCGCA TTGTGACCTT AGATGATTAT ATCATCCCCT TGCACATCCG CCAAGGTCTT CCATATATGG ATATGAGGTG CCCAACAGAT GCCGAATTTA CCTCTCTCCC GCATGTGATA TTGACCTCTG ATGTCGATTG GGACCCGTCA GTCCTTGACA ACGAGATTGA TCTGGCCACC GATTGGTACG ACACTGTACA GGATTTACCC CACTACCATA TGTCGAACCG CGTTTTGACC ACATGGGCAA ATATCTCCAT CGTCATATTT CGCTTTGTGA CACTCGCCAC CATGCCGTTG ACTGTATCCT TCAATGTCAG CAGCATGAAA TTCAGCGTAA TGACCATGAC TACGAAACCC TCCGTCCTTG TCTTGGTTGG GTATCCGCCG ATACCGTTCG TAAGACTATA CAGGCCACCA CCCAGTATGC ACGAGAGGTA TACCACGCAC CGTTACGCAA GCATTATAAG TCGCGCTTCC CGGCCTTAAA TGTCCATCGG CGTAACGAGC CAGTTGCCAC CAATACCATT TGGTCAGATA CTCCTGCTGT TGATAGTGGT GCCAAATTTG CGCAACTTTT CGTGGGCCGC CGATCCCTTG TCACTGATGT TTATCCCATG AAAACCGACA AAGAATTTGT TAACGCTCTC GAAGACCATA TTCGGTTTCG CGGCGCTATG GACAAGCTCA TCAGCGACCG TGCACAGGTC GAGATTAGTA AAAAGGTCAT GGATATCACC CGTGCTTACA ACATTGACCA GTGGCAAAGC GAACCACACC ACCAACACCA AAATTTTGCT GAACGTCGCA TTGCCACTAT CGAGGCTAAC ACCAACAACA TTCTCAATCA CACCGGTGCC CCTGACTCCA CATGGCTTCT TTGTGTCACG TACGTGTGCT ATGTATTCAA TCATCTCGCC CATGAATCCT TGCACAACCG TACACCCTTA GAAGTCCTTA CTGGTTCCAC TCCTGATATC AGTGTTCTTC TTCAGTTCCA TTTTTGGGAA CCCGTCTATT ATCGACTCGA AGATGCGACC TTTCCGTCTG ATGGTACTGA ACAAACGGGA CGTTTCGTAG GCATTGCTGA CTCCGTTGGC GATGCTCTTA CTTATAAGAT CCTCAACGAT ACTTCTAATA GAATCCTCTA TCGTTCCAGC GTGCGCTCTG CAAACCTTCC CGGTGAAACC AACCTACGCC TTACATCACA GGATGGGGAG AATGGCCCTA A
|
Protein sequence | MAPATRQMTG GAVYAHLLDN VLLLPQGHPI CLSFAQQGYE SADDLLCIFE NELETLEFIP LAPADGPETT APIIRHFLRW QASLERQKGT PLKNSELAAL NNKDFVLYRR SALGQVSSTV APIVTNPNAA IPTAKTRSAV EDFKRGIKRD KTHYPVLKDD RYWDNFYRSF VVTAVSHNVE KVLDPSYLPT DPLEKSLFEE QNKFVYSALE HTLQTDMGIN IVREHSFDFN AQEVFRKVVK HYTESASAKI SSSTTLGYLT TAKYSSSWTG TAEGFILHWK NHLRIYNDTV PTGEQLPQQL CLSLLENAVH DIPELRQVKI TATLDLAKGG SPISYDGYLS LLLASASLYD NGNNLSNACS NKNKRHVYST DLVYHPTDFD SDLDVSYDID VSPTAIYEAN AHARNSGNSG NRSRNAASPR DRPYIPREMW NQLSEDAKAI LQGLSAPSKS TLPAAQPFSQ VLQANTHSHG SSETADTFHD CAPETELLAH LTDRVSRMND GDIRKVLAAS RDNVSPQPGA RPKFMQSNML RYQVSRHNVN GTTAALVNRG ANGGLAGADV MVLNKTGRSA NITGINDHTL SDLDIVTAAG CVESHTGPII VIMHQYAYLG TGKTIHSSAQ LEHFHNNVED RSRTVGGDQR IVTLDDYIIP LHIRQGLPYM DMRCPTDAEF TSLPHVILTS DVDWDPSVLD NEIDLATDWY DTVQDLPHYH MSNRVLTTWA NISIVIFRFV TLATMPLTHE IQRNDHDYET LRPCLGWVSA DTVRKTIQAT TQYAREVYHA PLRKHYKSRF PALNVHRRNE PVATNTIWSD TPAVDSGAKF AQLFVGRRSL VTDVYPMKTD KEFVNALEDH IRFRGAMDKL ISDRAQVEIS KKVMDITRAY NIDQWQSEPH HQHQNFAERR IATIEANTNN ILNHTGAPDS TWLLCVTYVC YVFNHLAHES LHNRTPLEVL TGSTPDISVL LQFHFWEPVY YRLEDATFPS DGTEQTGRFV GIADSVGDAL TYKILNDTSN RILYRSSVRS ANLPGWGEWP
|
| |