Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47999 |
Symbol | |
ID | 7202999 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 658524 |
End bp | 661904 |
Gene Length | 3381 bp |
Protein Length | 955 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182439 |
Protein GI | 219124287 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGAGGACT CTGGGAAACG GTTGTCACAC ACCTTCTCTC TCTCTCTCTT GCAATACTTG TTTTAGCTCT TTACCCGTTG TGATCACAAA TTAAAACCCG TTCGTCATGT CTACTACGAA TGCGGAGGAC GATACACCCG TCTTGGAATG GCCGATGGAA AAGGTTCGCG ATACGTTCAA GAACTATTTC GTGGAACAGC ACGGACACGT CTTTTGGCCT TCGTCGCCGT GCGTCCCCGT CGATGATCCC ACTCTACTCT TTACCAACGC CGGAATGAAT CAGTACAAGC CTCTATTCTT AGGTACGTAC GTTCGCACGA TACGATACGA GATGATGCGA TCCCTACAAA ACATGGTTCC CGATGGAGCA GGCGTCGATG GAACCAAAAG CCTTTCCAGA GAACGTTCGA ATCGTCGTGT CTTTCTCCCG TGAGGCCAAT GCGACCTTTT GTATCCAATA TCTGCTAACG ACGTTCTCGT ATTTACACAG GAACCTGTGA TCCCAATCTG GAAATGTCCA AACTGACCCG CGCCGTCAAT TCGCAAAAAT GCATTCGTGC TGGAGGCAAA CACAACGATC TCGACGACGT CGGGAAGGAC GTCTACCATC ACACCTTTTT CGAAATGCTC GGCAACTGGT CCTTTGGTGA TTACTTCAAA GCTGGTGCCA TTGATATGGC CTGGCAGTGT TTGACCGTCA CCTTTGGACT GGACCCGGAA CGACTATACG CCACTTACTT TGCCGGAGAC GAACTCACCC CGGTCGACGA AGAAGCACGT CAACTGTGGT TGCGATACCT CCCCGACGAT CGCGTACTCC CCTTTGACGC CAGGGACAAT TTCTGGGAAA TGGGTGCCAC CGGACCCTGC GGACCTTGTA CGGTACGTGG CTAGCGGTGT GCACGGTATA CCGACGGACC TTTGTTTTTG TCGATACATT GGCACTCACG TGTCTTTTTT TGTTCCGTCG TTGGTTGTCT AGGAAATTCA TTACGATCGA ATCGGTGGAC GCGACGCCTC CAAACTCGTC AACGCCGATT TGCCGGATGT GATGGAAATA TGGAATGTTG TCTTTATTCA ATACAATCGT GAAGCCGATG GTTCCCTCCG TCCACTCCCG GCACAGCACG TGGACACCGG GATGGGATTC GAACGCCTGA CGTCAATTCT CCAGAATGTC GATTCCAATT ATGATACCGA TATCTTCATT CCACTCTTTA CCGCAATTCA GAACATTACG GGCGCCCGTC CGTACGCCAA GAAGGTCGGC AAGGAGGATC CGGAATACAT TGACATGGCC TACCGTGTGG TGGCCGATCA CATTCGAACC CTATGTTTCG CCATTACCGA CGGTGCCGTT CCCAGCAATG ATGGTCGTGG TTACGTGCTC CGACGCGTCT TGCGTCGTGC GGTGCGGTAC GGCCGTCAAA ATCTCGGAGC CGAACTCGGG TTTTTTGCCA AACTCGTTCC AGCCTTTGTT GACGTTATGG GATCGGCGTT CCCGGAAGTG GTGGAAAAGC AAGAATACGT CACGGGAATC ATCCAGGAGG AAGAAGAATC GTTTTCGCGA ACGCTCGATA AGGGCTTGCA AAAGTTTAAC GAATTGGCCG AAAAGGTGGG AGCGGACCAG ATCTTTTCCG GTGCCGACGC GCATTTCTTG TACACTTCCA TGGGATTTCC CGTCGACTTG ACTGAACTCA TGGCGGAAGA GAAGGGCATG ACACTTAATA AGGAAGAGTT CGAAGCCAAA ATGCAGGAAG AGCACGATAT TTCGCAAGCG GCACATTTGG CAAAAATGGC GGGTGGCTCC GGAAAGGATA TGCGTCTGGT CGCCGAGCAG ACATCTTATC TCGTCGGCCA GAATATTAGT GCCACGGACG ACGCGGCAAA GTATGTGTGG CACGAGGAGC TGGCTGACTG TGTGGTCAAG GCATGCTTTA TTGGTCGCAA CGAGACGGAA GACATGATTG GATTCGTCGG TAGTATTTCT CCAGAAAGCA GTGCCGTTGG TATCGTCCTG GACAAGTCGA GCTTCTACGC CGAAGCGGGT GGACAGGTTT ACGATGTTGG TACCTTGACT TCATCGACTG GTGCCGTTGT CAAAATTACG AACGTGCAGG CGTATGGACA GTTCGTCCTA CATCTTGGTG AGGTCGCATC CGGAACGCTG TCGGTTGGCG ATACTGTCAA ATGCAGCGTT GACTACGTCC GTCGTGCCCC AATCGCTTCC AATCATACAA TGACGCATGT CCTTAATCAC GCGTTGCGAG AAGTCCTTAT CAAACGACCA GAAAAGGAAT CGGGCAAGAC ATCCACTTTG ACCGTCGACC AAAAGGGATC GCTGGTGGAC GAAACAAAAC TGCGTTTTGA TTTCTCGTGG AGCGGCCAGT TGACGCCGGA ACAGTTGGCG GAAGTCGAAA AACTTTGCAT GGATCGCATC GTAAATGCAG TTCCCGTGGA TGCGTACGTT GCACCGTTGG GCGATGCGCA GCAGATTAGT TCGCTGCGTG CTGTTTTTGG TGAAAAATAC CCCGATCCCG TACGAGTGGT TGCCGTTTCG GATCATGCCG TTCCAGAGAT GCTTGCCAAC CCACAAGACA GCCAGTGGAA TGAATATAGT GTCGAATTTT GCGGAGGAAC GCACTTGACG AACACCAAAG AAGCGGAAGC CTTTGTTTTA TTGGCCGAAG AAGGAATTGC TAAGGGAATT CGGCGCATTA CGGCCATAAC TACCGGTGAG GCTAAGAAAG CAATCGCTCT TGCCAACGAA TTTGAGAGCA AACTAACGGC TGCAGAAGTA GTTCAAGGAG ACGATTTGGA AAGCACTGTA AAACAACTTT CTGCCGAATT GGATGGATTA GATATTGCCG CCGTGAAGAA GATGCAGTTC CGTGAACAAC TCGCCACCAT GACAAAACAG GTCTTGGCGT ACAAAAAGCA GAAGCTTGCG GGTATGGCGG ACGAGATTGT CGACAAGGCT GTGTCTGTCG CTGCCGAAAC AGGCGGCAGT AAAGTCGTCA TGCGTTTCGA CTTTGGTGTG GAGGGCAAGG TTGCCAAGTC GGTTATGACC GCCTTCGGCA AACAAGTCAA GGATAAGGCT TTGCTGTTGG TTACGGCAGA CCCCGAAGCT GATCGCTTCA TGGTTATTGC GGGCGCACCT AAGGCAATGA AAGACTTGAA TTGTAAAGCA TGGATTGAGG CGGCAACTGA CGGCCTAGAC GCCAAGGGCG GAGGCAAGCC CGACAGCGCT CAATATCAAG TATCCGGAGT AGAGGCAGTT GACACTGTTT TGGAGAAGGC CAGAAAATTT TAAAAACGGA TGATGATAGG TCGTAGTATT TCGTTAACAT TGTGATGCCA TATTAAGACG CCACTTTATT G
|
Protein sequence | MSTTNAEDDT PVLEWPMEKV RDTFKNYFVE QHGHVFWPSS PCVPVDDPTL LFTNAGMNQY KPLFLGTCDP NLEMSKLTRA VNSQKCIRAG GKHNDLDDVG KDVYHHTFFE MLGNWSFGDY FKAGAIDMAW QCLTVTFGLD PERLYATYFA GDELTPVDEE ARQLWLRYLP DDRVLPFDAR DNFWEMGATG PCGPCTEIHY DRIGGRDASK LVNADLPDVM EIWNVVFIQY NREADGSLRP LPAQHVDTGM GFERLTSILQ NVDSNYDTDI FIPLFTAIQN ITGARPYAKK VGKEDPEYID MAYRVVADHI RTLCFAITDG AVPSNDGRGY VLRRVLRRAV RYGRQNLGAE LGFFAKLVPA FVDVMGSAFP EVVEKQEYVT GIIQEEEESF SRTLDKGLQK FNELAEKVGA DQIFSGADAH FLYTSMGFPV DLTELMAEEK GMTLNKEEFE AKMQEEHDIS QAAHLAKMAG GSGKDMRLVA EQTSYLVGQN ISATDDAAKY VWHEELADCV VKACFIGRNE TEDMIGFVGS ISPESSAVGI VLDKSSFYAE AGGQVYDVGT LTSSTGAVVK ITNVQAYGQF VLHLGEVASG TLSVGDTVKC SVDYVRRAPI ASNHTMTHVL NHALREVLIK RPEKESGKTS TLTVDQKGSL VDETKLRFDF SWSGQLTPEQ LAEVEKLCMD RIVNAVPVDA YVAPLGDAQQ ISSLRAVFGE KYPDPVRVVA VSDHAVPEML ANPQDSQWNE YSVEFCGGTH LTNTKEAEAF VLLAEEGIAK GIRRITAITT GEAKKAIALA NEFESKLTAA EVVQGDDLES TVKQLSAELD GLDIAAVKKM QFREQLATMT KQVLAYKKQK LAGGSKVVMR FDFGVEGKVA KSVMTAFGKQ VKDKALLLVT ADPEADRFMV IAGAPKAMKD LNCKAWIEAA TDGLDAKGGG KPDSAQYQVS GVEAVDTVLE KARKF
|
| |