Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33116 |
Symbol | |
ID | 7204251 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 81317 |
End bp | 85372 |
Gene Length | 4056 bp |
Protein Length | 1057 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185995 |
Protein GI | 219112823 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.51252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTCGG ACGCACACAA ACCGCAGTCT GTGAGCGAAA AAAAGGGAAA CAAGGACCCG ATGCCGCAGC GGCAACGAAA AGCGCTTGGA TGCTGCCGGA GCGAGAACGC AATCCAAGGA AGTGGTGTCA CTGTCACGTC CATCCCCCCC AATTGACCTT GACACCACGA GTTCGCACGC TCCATACAAG TTTCCAGAGT CCAGTGGTGA CGCTGTGCCC TTTCCCGGCA CTCGATGCAG TCGTGTGTGT GTGTGCGTAT TTACAGTCAA TACTACAAGC GTTCCCCCGC CATTCCCATA TCCCCTAGCC CTACAGAAAC GCCAGTCCCG GTCCCGTTTC CCCAACGAGG ATTCTCCCTC CTCGAAGAGC AGCTCAAGCC CCCTCTAGTT AGACATGAAT CTACCCACGC ACACGATCAG TGCGATTCCA ATGGAATATG GTATGTCCGA TTCTCACAGT CAACGACAAA ACTCTGCGTC GGCCTTTGCA GAAGACGAAA AGTACCCGGC GGGCGTTTCC CAGACATGTT GGTGGATGTG AACGTCCCCA GAGGTACACA ATTCACGGTC AGTCCATAGT CTGTAGGTCC CACGCTCTGA TGTCCTCTAC AAAGAACTGT GTACGGAATC TAATAGCAAT GATATTCATT TACATTGCTT TTCTGTTATT ATCCCCCCAA AAAAGAACAC TGCAGTCCTG CCAAATTAAT TACACATTTA GCATTCTCAA CGAAATACGC CATTTTGACA CGTATTCGGT CCTCTCGAAT TTGGCAACAA ACAACATTCG CCGTGTACCA TAGGAAGAAG TGTACTGCTG CTCCCGACAT CCATCGTTGG ATTAGGGCAC TTATCGATCG CTTCGAAAAC CTCTCTTGCA TTGCAGTAAG AAAAGAACGA CTCATGGTGC ACGAAAGACT TTGGCGTTCG AAACGGACCG ATCCTCGGAA AACGGATATC CGACTAGTGA TGACAACGTG CGCATTGCTA GCTTTTGGGA GTCTGTACAC GTATTACATG AGTTTGTTGA ACAATTTCGA CTACTCCATT CTGTACAACG ATATCGGAGT TTCCGTGCCT TTGCCTGTGC TACCCAACTT TACCTCTCTA TTGATCGCGT CGGGATCGGA CAAGTACAAT AGGCATCACT ACGAGCGATA TTACGAGCGC TGGCTCGAAC CGTATCGCGA TGTGCCGGGC GTGAAAGTAC TCGAAATTGG CGCTAATCAA GGACACTCCT TGAAGCTATG GGAAGACTAC TTTGCGGATC CAGACATTAT TCTGGGATTG AAGTATGGGA ACGCCGCCAA CGGTATTGAG AACAAGATTG TGAACCTCAC CAAAGTCTCC CTCTATACTG GTGACCAGTC CTCCAAGCCC ACCATGGACT ACTTGAATGA GCGCGGACCT TGGCACATTA TTATTGATGA TGGCTCGCAT GTCCCACAGC ACGTGATATA TTCGTTGGTG CATTTATGGG ACTCGGTGGC GCCAGGAGGC ATGTATATTG TGGAAGACCT GGAAACAAGC TACTGGCGCA ACGGTTCCAA CGTCTACGAC TATCCTCTTG CTAATACTGG AGTACTCGCG GACGCCAATC ATTCCGCAGC GGCCAAGATT ATGCAACTCC AGCATATCTT GGTCCGCCAC CAAATTGGGG CTCGAGACAT GTCGATCTTT CCCGGAGATG ACACCATTTG CTCAATCGAA TGGGGCATGA ACCTGCTGGC CATTCGTAAG TGTGGGCTAC CAACGGATGG TGTGGGACCT AAGTATTTCA GAGAGAGATT TGACGCCTCT GAAATGCGTA CCTGGCTCAA GCAATCCAGG TCCTCCAATC CGAAAGATTA ACGAATGTCT GTGTCGTAAA GTGCATAGAG ATAAGAATAA TTGGACCTTG CAATCAATAC TGTAAGTATA GGAATAAAGG GACAGGGGCT ACAGGACTAT GTATGGTCGA AAGAGCGTTT TGTCTCACAG TCAGAGGAAG TTCAGGATGC CGAGCATTCG AAATGTCAAT AGTTTTACAA GTCGTCACAT CAGACATCCG ATGGGTTATC GCTTGTAATA TCTAATTTTC GTCTCACTGT CAATCCCATA TGTCATTCCT GGACATCGAC CGTCGATCAT GAACATAATA AGACTTCATT TTTTTCGTTT GCTTTACAGT TCATCCCATG CCTTGCCTAC TTTCCTCACC ACAGTCTGCT CTAAAACGAG ACGCCTTTTG AACAATTGCA CAGTAGATAA GCCAGTCAAA GCTATGCCGC GGCAAAACCT AGTCGTTGAG ATTCAAGCGA TAGTTCCGGA GGAACCGAAA GGGTTTTTGG ACGACCCAGA CGTGGACGCT GCTGCACCGG CAATGGATGC TGGCGCCCCG CAACACTATT CGCTTCCTCA GCAGAACCGC ACCACCGACT CCCCGGTCGT TCTTCATACA ACCATGTTTG CGTCTACCGC ATCCCTCTGC CTTTGCATGT TGACGCATAG TTTCTTACTG ATTTCGGTAT TTCCGTATTC CGGTTTCATG GCCGTAGAAC TGATCGAATC GGTAGACGAA GAAACAGCCG GTGCCTACGC TGGATTGCTC GCCTCGTGTT TCATGTGGGG TCGGGCGACG ACGGCGTACG GCTGGGGTCA AGTGGCCGAC GTATACGGAC GTACCACGGT ACTGTATTGG TCTTTCGCAC TTTCGGGGAT CCTCTCGATC GCCTTTGGAC TGTCACCAAC GTTTGGAAGT GCCTTGTTCC TCAGATTCGC TCTCGGTTGT GCGAATGGGA TCATGGGAAG TATTAAGACG ATTGTTTCCG AGATATCGGC GGGGAATGAG GCCTTGGAAA CTAAAACCAT GACAATGGTG ATTGGCATGT GGGGGTGGGG CTTTCTCGTG TCGCCCGCCT TGTCGGGAAT ATTAGCAGAA CCAGTCAAAC AGTACCCCGG CGTAGAATGG CTACAGCGCG AGGGAATATG GAACGCAGTG TTAGCCAAAC ATCCCTTCTT GTTACCGAAT CTACTCGCGG CGATTTTTTG TTTGATAGGC GTCCTGGTAA TTCGAATGTT TGTTCCCGAA ACCTTACCAT TCGGACAACG ACGCGACCCG CGACTCTTGC TATACGATAT TGGAGCTTGG TGCCAACGGT CGGCCGGGTA TGCAAAAGTG CCGTTGAATG TGACGCGATA CCAGCTTGTA CCGACCCTCA AAACGCACCC ATCCGACTTG GATCTGAGCA GCAGAAACAC AAGGTTTTCA GTTTCTTGTC ACAACGCAAT CGACGAGGAT GATCTTGACG CTGTCCAAAC ACTAGAGTCC AATGAACAAG TTGTATCCTT ATCGACAAAC ATTCCTGAAA AGGCGACCAT TTTGTCCTTG CTCTCCCGCA AACCAACACG CACTTGCTTA CTCATATATT GGGCCTATTC GTTTGTCGGT CTGACCGTAG ATGAATCGTT TCCACTCTTT TGTATTTCCA AACAGGCAGG GTTTGGACTG TCTGAATATC AAATTGGCCA GATCTTGTCG CTTTGTGGAT TGTTCTTTGC CGTCTCTCAG TACAGTGTTT ACACGACCAT CTACAACCGC TTTGGTCTGT ACGGATCGAT ACGCTTTGGA AGCTGCTTTA GTGCACCCGT AATGTTCCTA ATGCCCTTAT CGGTACTGCT GAATCGAGGC GCGCCAACCG GTCATCTCCG CACTTCCGCC TTGGTGTTTT TGTCCACTTG CATGGCGGCC TACCGGGTGT TTGGGCTCGT GTTTTTCTCG AGCGTTTCCG TGACTATGAA CCGAACCGTG CCTCGTTCGC ACCGGGCTAC CATGAACGGC TTATCCGTCT TGGGAGGGAG CGTCGCAAAA GGCTTGGGGC CCATTTTTGC CGGCTTTCTC GTTTCGGGGT CCGTGGCGCT TTGGGGAAGC CTGGGAGGAT TGCTCATTTT CGGCACCATT GGATTGATTG GATGTGCCGT GGCCGCGACG ACTTTTTTCT ACCTTCAAGC CAGCGATTGT GAAGGTTCTA CTGACGATTT AGAGCAGAGT GTGGTCGGGG ACACCGACCA AGTTAG
|
Protein sequence | MYSDAHKPQS VSEKKGNKDP MPQRQRKALG CCRSENAIQG SGVTVTQYYK RSPAIPISPS PTETPVPVPF PQRGFSLLEE QLKPPLVRHE STHAHDQCDS NGICQRQNSA SAFAEDEKYP AGVSQTLRKE RLMVHERLWR SKRTDPRKTD IRLVMTTCAL LAFGSLYTYY MSLLNNFDYS ILYNDIGVSV PLPVLPNFTS LLIASGSDKY NRHHYERYYE RWLEPYRDVP GVKVLEIGAN QGHSLKLWED YFADPDIILG LKYGNAANGI ENKIVNLTKV SLYTGDQSSK PTMDYLNERG PWHIIIDDGS HVPQHVIYSL VHLWDSVAPG GMYIVEDLET SYWRNGSNVY DYPLANTGVL ADANHSAAAK IMQLQHILVR HQIGARDMSI FPGDDTICSI EWGMNLLAIR PPIRKINECL CRKVHRDKNN WTLQSILSSH ALPTFLTTVC SKTRRLLNNC TVDKPVKAMP RQNLVVEIQA IVPEEPKGFL DDPDVDAAAP AMDAGAPQHY SLPQQNRTTD SPVVLHTTMF ASTASLCLCM LTHSFLLISV FPYSGFMAVE LIESVDEETA GAYAGLLASC FMWGRATTAY GWGQVADVYG RTTVLYWSFA LSGILSIAFG LSPTFGSALF LRFALGCANG IMGSIKTIVS EISAGNEALE TKTMTMVIGM WGWGFLVSPA LSGILAEPVK QYPGVEWLQR EGIWNAVLAK HPFLLPNLLA AIFCLIGVLV IRMFVPETLP FGQRRDPRLL LYDIGAWCQR SAGYAKVPLN VTRYQLVPTL KTHPSDLDLS SRNTRFSVSC HNAIDEDDLD AVQTLESNEQ VVSLSTNIPE KATILSLLSR KPTRTCLLIY WAYSFVGLTV DESFPLFCIS KQAGFGLSEY QIGQILSLCG LFFAVSQYSV YTTIYNRFGL YGSIRFGSCF SAPVMFLMPL SVLLNRGAPT GHLRTSALVF LSTCMAAYRV FGLVFFSSVS VTMNRTVPRS HRATMNGLSV LGGSVAKGLG PIFAGFLVSG SVALWGSLGG LLIFGTIGLI GCAVAATTFF YLQASDCEEC GRGHRPS
|
| |