Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43446 |
Symbol | |
ID | 7197446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 471622 |
End bp | 473916 |
Gene Length | 2295 bp |
Protein Length | 747 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177626 |
Protein GI | 219111749 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATCTGGCG ACAGCATGGT CTTCGCCAAG CTCTTCGAAG ATACCCAGAA GAGGAATAGG GATGAAGCTA AGGTGTCGAC GGCTTCGTCG AATACCGAAG CTGCTTCGCC TCCCAAGTCG CATTCCGAAA CCGTCACCAG AAACAGGAAA TCTGACACGC CTCCTTTGAA AGGAAAGGCT CGATCTACCG ACCACAATAA GTCTAGTTCT TTGTCGTACA AACGTCACCG TGGTGTCAAT CCGGTGGAAC GCCAGAATGG TTCGAGAAGG AAAGTAGTAT CGCAAGAAGC GCTCTCACTT TCAAGGCAAC TACAGGAACT CTCTCGGCAA AAGCGCCTCG ACCAAGCCCT CAAGGTCTAT TGGAGTAGTC AGAACGACAA AATTAGGGAC AGTCATCACG CATGTATTGT AGTGGACTGC TGCTCGCGAT GCGGAATGGT TGAGAAGGCC GAGAAAATAG TCGCGGACCT GCGCCGAAAA GGTTCAATCG TAAACGTGGA AACCGAAACA GCCCTTTTGA AAGGGTACGC GCATAGCGGA ATGCTCCATG CAGCGATGGA CATGTTTCGA CAGATGTGTC AGTCCAAGTT TAATAGACCC AACGTGCGTA CATTGAACAC ACTATTAAGA GGATGCATGT GGACTGCTTC GACGAAACGC GACGGTCATG TTGCTGGTGG TGTGATATCA AGTGAAGAGG CCTGGCAGCT CTATACGTCT ACATCGGGCG TCGATACTCT CGATTCATCT TCCTATGAAT ACTCTATTAC ACTTTTGACC CAAGCGCTAA GGATAAAACA AGCGACACGC CGTATTGAAG ACTTCCGAGC TAAGTATGGT ATCAACCTAA AAGGGAAGGC CAGTTTTAGC GGCGGGGACC AGTCTGGAAC AGAAACACTT GCGGTTGCCT ACCTTGGTTT GGCTCGAGCT TTTGCTTTAC GCGGAGATTC ACCAAATACG TGGCTAGCCT GCCAGAGAGT TTTGAATGCC GTCAAACTCT CGCAGACTTA CCTGATGGAA CTTGAGCAGG TGGCGTCCAT CGAATCCAGT GGCAAAAAGA GACGTCGAAT GGAAGCCCAG GGCGGCAAAC GTGCTTGGAA GAAGGGCGGT GACGACAAAA GCGAGGAAAA GGACAATCGT AGGGCATCAT CAAATTCCAC GTTTCGAACA CATCGTCTAT CGGAAATTGC AGCAGAGGCG AAAGCTTTGA TTAAAGTTAG AGGGAAGTAC AGTGCTGACT TAAAGCCCCA ATCAGATTTA GCATCGCGGC TCATGGTATC CTTGTTTTAT TTTTCGGGGG GTGGCACAAC CAACATGATA GCGAATAAAG AACTGAAGCA ACTTACAGCT TCGAAACGAG AAGATCCCTT TGCCTACATT ATTCCAACGT GGACTAGCTT TGGCTTGTCA GAACTAACAG AGCGGGCAGA ATCCATCGTT CATCTCGACG AAAATAGCAT TCGAGACGGG ATTGGAAGTG TTGATGTCAG GGCATTGCGA GACGATGGGA CTATTAATTT TGACACTGTC TTTGGCACCA CGGCAAAGGG TCGCCCATTG GACATTGAGC TTGGTGCCGG TTTCGGTGAC TGGATTGCTC GACAAGCGTT CCACAGGCCA AAACGAAACC ATATTGCCGT TGAACTCCGT GCCGATCGTG TACATCAGAT CTTTGCGAAA GGTACGTTGC AAGCTACGCA ACCGCTCGAT AATCTGTGCG TTGTGGGTGC GGAGAGTGGT GACTTTTTAA AAGATCGGCT TCGTTCCGGA TCTCTAGCCA CGGTCTTTGT GAATCATCCC GAGCCTCCCA CACAAACATT TGGCGGAGAC CGGAGTGAGC TCGAGGCAAT CCAAAAAGGA GGGGGAGAAC CCGCTCATAT GCTGACTAGC GGAACATTGG AAGCTGCCGC GAACAGTCTC CATTCCGGAG GCCGTCTGGT CATTGTCACG GACAATCGCT GGTACGCTCG TCTTCTGGCC TCGACCTTGC TCAAAGTGGT GCGACAAAAG CCGGACCTAT TCCGACCCCC TAGGCCTAAG GAGTTTCATG CGTCCAATTT GCACCAAATG GAATATTTTG GGGGCAGCAC TGGCCAAGCA GGTGTCCCAT TGTACGAAGG ACAGCCTAAC GAAGGCATTG GTCACGTCAA GTACGATGTC AGCACAGGCG CCAGCTACTT TGATCGGCTC TGGAAAAGCG GTGCAGGCTT GCATGCGGAA CGGCAAACGA GGTTCATCCT CATCATGTAC CGATGTTAGA GTCACATTAA CTGTAAGCAT ATGTTGATTC ATACT
|
Protein sequence | MVFAKLFEDT QKRNRDEAKV STASSNTEAA SPPKSHSETV TRNRKSDTPP LKGKARSTDH NKSSSLSYKR HRGVNPVERQ NGSRRKVVSQ EALSLSRQLQ ELSRQKRLDQ ALKVYWSSQN DKIRDSHHAC IVVDCCSRCG MVEKAEKIVA DLRRKGSIVN VETETALLKG YAHSGMLHAA MDMFRQMCQS KFNRPNVRTL NTLLRGCMWT ASTKRDGHVA GGVISSEEAW QLYTSTSGVD TLDSSSYEYS ITLLTQALRI KQATRRIEDF RAKYGINLKG KASFSGGDQS GTETLAVAYL GLARAFALRG DSPNTWLACQ RVLNAVKLSQ TYLMELEQVA SIESSGKKRR RMEAQGGKRA WKKGGDDKSE EKDNRRASSN STFRTHRLSE IAAEAKALIK VRGKYSADLK PQSDLASRLM VSLFYFSGGG TTNMIANKEL KQLTASKRED PFAYIIPTWT SFGLSELTER AESIVHLDEN SIRDGIGSVD VRALRDDGTI NFDTVFGTTA KGRPLDIELG AGFGDWIARQ AFHRPKRNHI AVELRADRVH QIFAKGTLQA TQPLDNLCVV GAESGDFLKD RLRSGSLATV FVNHPEPPTQ TFGGDRSELE AIQKGGGEPA HMLTSGTLEA AANSLHSGGR LVIVTDNRWY ARLLASTLLK VVRQKPDLFR PPRPKEFHAS NLHQMEYFGG STGQAGVPLY EGQPNEGIGH VKYDVSTGAS YFDRLWKSGA GLHAERQTRF ILIMYRC
|
| |