Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36007 |
Symbol | |
ID | 7201348 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 322052 |
End bp | 325416 |
Gene Length | 3365 bp |
Protein Length | 1106 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180413 |
Protein GI | 219119300 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.287397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCCG CCACCCGGCA AATGACGAGT GCAGCCGTCT ATGCCCACCT TTTGGACAAC GTACTTCTTC TTCCCCAAGG GCATCCTATC CGCCTCAGTT TTGAGCAACA AGGATATGAA TCGGCTGATG ATCTTCTGTG TATTTTTGAG AATGAACTTG AGTCTCTTGG ATACACTCCT TCTGTCCTTC CCGACGGCCT GGAAAACCCG CCAACTATAC CCCTTCTCAT GGCGCACCGA CAGATCATAC GTCATTTCTT GCGCTGGCAG GCATCTTTGG AACGACAAAA GGGGACACCC TTGAAGAACT CCGAACTTGT TGCACTTAAC AATGAAGATT TTGTCCTTTA CCGTCGCTCA GCCCTTGGTC AAGTCTCGAC AGCAACTGCA CCGGTTAATG CTTCCCCAAC TGTCCAGAGC CCCATAGGAA AGACACGTTC GGCTGTCGAG GACTTCAAGC GTGGGATCAA ACGTGACAAA ACTCACTATC CCGTGCTTAA AGATGATCGG TACTGGGACA ACTTTTATCG GTCGTTTGTT GTTACTGCCG TAACACATAA CGTTGACAAA GTTCTAGATC CGACGTACAT CCCTACCGAT CCCTTGGAGA AATCCCTTTT TGAAGAGCAG AACAAGTTCG TATATTCTGC TCTAGAGCAT ACTCTCCAGA CGGACATGGG CAAGAACATT GTACGCGAGC ATAGTTTCGA CTTCAATGCC CAGGAAGTTT TCCGTAAGGT TGTGAAACAC TACACAGAGT CCGCTAGCGC GAAGATTAGT TCGTCTACTA CCCTGGGATA CCTTACAACT GCAAAGTACG GATCGTCATG GACTGGCACA GCAGAAGGTT TTATTCTTCA CTGGAAAAAT CACTTGCGCA TCTACAATGA CACTGTTCCT GCTGGTGAAC AGCTTCCTCA GCAACTATGC CTTAGTCTTT TGGAGAATGC TGTTCATGAT GTACCTGAGC TTCGACAGGT AAAAATCACT GCAACTCTTG ACTTAGCAAA GGGAGGTAAT CCTATTAGCT ATGATGGTTA TCTCAGTCTA CTACTCGCAT CGGCATCGCT CTACGACAAC GGCAATAATC TATCTAATTC TCGTAGTGGC AAGAACAAGC GCAACATCTA TGCTAATGAA CTAGAGTACA ATCCGATGGA TTTTGAGAGT AAACCGGATG TAGACTATGA TATAGATGTG TCGCCTACCG CAATCTACGA AGCCAATGCT CATGCCCGTA ACAGCAGTTC CCGGAATCGT AGTCCGGCAG CTAATCGCGA GCGACCTTAC ATCCCTCGTG AAATGTGGAA CCTGCTCTCC GACGATGCCA AAGCCATCCT CCAAGGCTTA ATAGCCCCCG GGAAGCAGGC CCCGTTGAAT AATACGCCAC ACCAATCGTT GCAGGCCAAT ACGCACGATA CCATTGGCGC GGAACGAATC ACAACGGACA CCTTCCATGA TTGCGCACCC GAAACTGAAT TGCTTGCCCA CCTGACTGAG CGTGTTAGTC ACATGAGCGA CGGCGACATA CGTAAGGTAC TTGCCGCATC TCGTGATGGT CCCGCCTATG ATGAGCCCAC ACCACTGCAA TCTAACGTAC TTCAATATCA AGTGTCTCGT CACAACGTCA TTGAAACTAC GGCAGCCCTC GTCGACCGTG GAGCCAATGG AGGTCTTGCC GGCAGTGATG TCATGGTCTT GCATAAAACA GGTCGTTCTG CAACCATCAC AGGTATCAAT GATCATACCT TGTCCGATTT GGACATTGTC ACCGCTGCTG GCTACACTGA ATCCCAAAAT GGCCCCATCA TTCTCATTAT GAACCAATAC GCCCATTTGG GACAGGGTAA AACTATCCAC TCCAGTGCAC AGCTTGAACA CTATCGCAAC CATGTCGAAG ACCGTTCCCG TACTGTAGGA GGTAACCAGC GAATTGTAAC ATTGGATGAC TACATCATCC CATTGCACAT TCGACAAGGA CTCGCGTACA TGGATATGCG GCGTCCTACC GACAAGGAAC TTGCGTCCCT TCCACACGTT GTCCTAACCT CCGACGTAGA CTGGGATCCC TCCGTACTTG ACCACGAAAT TGATCTCGCG ACCTCTTGGT ATGATGACAT ATATGATTTG CCTCAATCAC CTTACGTTGA ACCACGTTTT GACCATACAG GCAAATACCT CCATCGTCAC ATTTCCCTTT GCAACCATCG CGATGACGTT GTTGACCGCG TATTATATTG CCAACGGCAC CTCGTCACGA AAAATGTGCA AGATTATGAG GCCCTTCGTC CGTGTTTTGG ATGGGTCTCT GCTGAAACCG TTCGCAAGAC CATCATGGCG ACCACGCAGC ATGCACGCGA AGTATATAAC GCTCCGTTAC GCAAACATTT TAAGTCTCGC TTTCCCGCTC TAAATGTACA CCGTCGTAAT GACCCAGTTG CTACCGATAC CATTTGGTCC GACACCCCTG CTGTCGATAA TGGTGCTAAA TTTGCACAAC TTTTCGTTGG TCGACGCTCC CTTGTCACCG ACGCTTACCC CATGAAAACT GACAAAGAAT TCGTCAATAC CCTTGAGGAC CATATCCGTT ACCGGGGTGC CATGGACAAA TTGATTAGCG ATCGTGCCCA GGTTGAAATC AGCAAAAAGG TCACCGATAT TACACGCGCA TATAATATCG ACCAGTGGCA AAGTGAACCA AACCATCAAC ACCAAAACTT TGCCGAACGT CGTATTGCCA CTATCGAGGC TAATACCAAC AACATTCTCA ATCTTTCCGG TGCCCCTGAT TCCGCCTGGT TACTTTGCGT GACATATGTT TGTTATGTTT TCAACCATTT GGCACATGAA TCCCTAGATA ACCGCACTCC CCTTGAAGTC CTCACCGGCT CCACGCCTGA TATCAGTGTT CTCCTTCAGT TTCATTTTTG GGAACCGGTC TATTATAAGC TCGAAAATGC GACATTTCCT TCTGGTGGTA CCGAACAACA AGGACGTTTT GTTGGCATAG CCGACTCCGT CGGCGACGCT CTCACTTATA AGATACTTAC CCACACCACC AACCGCATTC TTCATCGCTC TAGTGTCCGT TCTGCGACCA TTCCCGGACA AACCAACCTA CGCCTTACGC CACAGGATGG GGAGAGTGGT CCTAAACCCA TCAACTTTAT CAAGTCGCGT AGAACCGAAA ACAAAAATTC CTATGCCATT AAGGAGTTGC CTGGTTTCAC ACCTGATGAC CTTATAGGTC GTACGTTCCT CACCGACACT CGGGATGATG GGGAGCGTTT GAAGGCACGA ATCACGCGGA AAATATTGGA CCCAGACAAG CCCTCGGATG TAAAGTTCCT TGTCGAAATC AATGA
|
Protein sequence | MVPATRQMTS AAVYAHLLDN VLLLPQGHPI RLSFEQQGYE SADDLLCIFE NELESLGYTP SVLPDGLENP PTIPLLMAHR QIIRHFLRWQ ASLERQKGTP LKNSELVALN NEDFVLYRRS ALGQVSTATA PVNASPTVQS PIGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYRSFV VTAVTHNVDK VLDPTYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNI VREHSFDFNA QEVFRKVVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP AGEQLPQQLC LSLLENAVHD VPELRQVKIT ATLDLAKGGN PISYDGYLSL LLASASLYDN GNNLSNSRSG KNKRNIYANE LEYNPMDFES KPDVDYDIDV SPTAIYEANA HARNSSSRNR SPAANRERPY IPREMWNLLS DDAKAILQGL IAPGKQAPLN NTPHQSLQAN THDTIGAERI TTDTFHDCAP ETELLAHLTE RVSHMSDGDI RKVLAASRDG PAYDEPTPLQ SNVLQYQVSR HNVIETTAAL VDRGANGGLA GSDVMVLHKT GRSATITGIN DHTLSDLDIV TAAGYTESQN GPIILIMNQY AHLGQGKTIH SSAQLEHYRN HVEDRSRTVG GNQRIVTLDD YIIPLHIRQG LAYMDMRRPT DKELASLPHV VLTSDVDWDP SVLDHEIDLA TSWYDDIYDL PQSPYVEPRF DHTGKYLHRH ISLCNHRDDV VDRVLYCQRH LVTKNVQDYE ALRPCFGWVS AETVRKTIMA TTQHAREVYN APLRKHFKSR FPALNVHRRN DPVATDTIWS DTPAVDNGAK FAQLFVGRRS LVTDAYPMKT DKEFVNTLED HIRYRGAMDK LISDRAQVEI SKKVTDITRA YNIDQWQSEP NHQHQNFAER RIATIEANTN NILNLSGAPD SAWLLCVTYV CYVFNHLAHE SLDNRTPLEV LTGSTPDISV LLQFHFWEPV YYKLENATFP SGGTEQQGRF VGIADSVGDA LTYKILTHTT NRILHRSSVR SATIPGQTNL RLTPQDGESG PKPINFIKSR RTENKNSYAI KELPGFTPDD LIGRTNHAEN IGPRQALGCK VPCRNQ
|
| |