Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45953 |
Symbol | |
ID | 7201026 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 762588 |
End bp | 765872 |
Gene Length | 3285 bp |
Protein Length | 857 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180112 |
Protein GI | 219118689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.535873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGAGCGAAA GCGTAGGAAA GAAACGAGTG CATTGACTGT GAGTTGCTAT TCGATCTTCC CTACATTTGC GGGATCATCT GGTTCGAGAA ATCTCGCACT TCTTGTATCG ATCCGCCATG ACGACACTTT CCTCTACTAC ACCTAGACCA AGCGTGCCCA AACGAGCGGA TCAACGATCG ACAATCACAA TTCTTTTATT AGGCGATGGT ACGTGGAAAA ATCAGTGAGA ACGATGAAAA AATGTGACCC ATTGAAAATG TGGATAGCTT GATTGAGTCA CTGAAGGCAG CAATGTATCC AAATGTTTGC CTATTGAGAG ATTTGTAACA AAATCACGTT TTTACAGCGG CCTGAAGAAA ACCGCAAGCC GTTTTCTGCA GTAGAATTAG CTCGATTCCC CTTTCATTTT TATTTGCCTA CATCTCTATC GAACGGAATC TTTCTGATCT CGTATTTCTT ATATTGCATC TTTTCTCTAC AGAGGGAGTT GGGAAATCCT CGTTGATATC CACCTTTGTC TCTCGATACT TTTCTGAGGT CGTCCCTGGC ATCATGACTC GTGTTCGTTT GCCGCCGGAT CCAGAACTGT CCTGTGTCAC CACTATAGTA GACTCGCAGG GGGGCGACCT TGCCTTGCTA CAGGCAATGG CCACTCGCCG CTCTATGATG CAACACCATT CGTCGGTGCA CGGTAGCACG GACTCACTAG CCGCGCTCAT GGAGCGGGCG GAAACCAGTA TGATGACCCA GCAATCGTCC GCTCCAGAAC AAACAACCAC GCCGACCGTT AAATCTTCTG GTATCGAAAA CGTCGACTCA ATTGTTTTGG TGTACGATTT GGACCGAGTA GAAACTTTTT TTCGTTTGGA GAATCATTGG TTGCCTTTGA TTGAAAGATG TTACAACGGG AAGGTAAGCC GATTCTCACA CTGTTCGTTC CGGATCACTG TGTTGCTACG CTGCTCCAGC GAACCATATC TTACTCACAG TCACAATGCC GCTGCAGGTT CCAATCATTG TGGCGGAAAA CAAACTAGAT CTCTTTCGCC CTTCCAGTAC GGCGGGGATG ACGGACGAGC AAGCTGTAGC GCGACAACGA CAACAGATTG TCTCCCTCCT ACAACGATTC CCATTTGTCC GACAATGCAT CAAGTGTAGT GCCAAGAACT TGGTACGGGT TGATGATGTC TTTCTGAAGG CGCAACAGGC AGTCCTCTAC CCCTTCACTC CGCCCTTGTA CGATCTCGAA CATGGACGCT TGACAGAGGA GTGCAAAAGA GCCTTTACTA GGATCTTTCG AATGTACGAT TCGGATCGTG ATGGATTGTT GAGCGACGTT GAATTGAATC GCTTTCAGAT CGAAACCTAT CACGTAGCAG TCTTTGATCG GGATTTTTCG GCCTGGAAGA AGGTAGTGTC GCGCAACAAC CCCACCGACG AAGTTGTGAT TCAAGACGGC AAATTCACAA TCGCTGGTTT TTTTGCAATT TTCGATCTCT TCATCAGTCA AAATCGACTT GATGTTGTAT GGCAAGCTTT GCGCGAGTTC AATTATGATG ACGATTTGAA TTTGCATATA CCTGAAATTG TTACAGCCCC AACCGACGAC ACCAGTTGGA AGTTGTCATC GGGCGCGAAA AGATTCTTGT CAGGTGTCTT TCGTCAATTT GACCAGGACC AAGATGATGT TTTGACTGCA GATGATATAG GGAACATTTT TTCGATTCTG CATCCACCCG CTCTTCCTCC GTGGCATCCA GCTCGCGCTC CATTTTTGTT CGCGGGTTGC TTTTCACTGC CCAAGCAGAA ATATTCGCCA GGCACCGAAA GTCCTAACTT TGGAGGCAAT GTATCCTTGA TTCTCCCAGG TTCTACCCCA ATGGCCCAGT CTCTATCAAA CAGTGGAATT TCTATTTTAA GTGCTTCAGA TTCCCTACCG AGCGTTGCCT TGTCGGGAAT AAGTGTTTCG GAACCTCTCA CGTTCTTGGA ATGGATGGGA CACTGGCACA CAATTGCTGC TATTTCGCCG TCAGTGACTA GAGCGGAACT GTTTAGGTTG GGGCATAGCG AGGAGTCTCG CAAAACTGAT CCTCGGCCTC GTCGAAGTCG TAAGAAGAAA TCAGCTTCTA TCACCCCAAG TCAAGCGCCA TCCGATGCCA CTTTTCCCTC CAGTGCCATT AGGGTTTTGG TGCTAGGCAG CGGCTCTTGT GGCAAAACAG CTCTATTAAA TGCACTATGT GGCTCGATGG AAAGCACCGA AGTTTCAGCT ACCAACACAA CAAGCACTCT GCATCCCGAG ACAAGCAGCA CATACGTAAA GATCGGTAGG GGGCAATCGC TTGGCCATCA CGGTACCTGC AGCCCGTCCA AGTCGCATGA TGTAGTTGAG GAAATTGTAG CTCATCTCGT TTTCACAGAC GTCCCTGAGA CTGCTGCTGT CAGTCAGAGG GAACATTATC GAGAATTATC CGAGCTCTTT GGCTCGACCG CGTCTCCAAA AGATCGCGTC TGCGACCTTG CGATGCTAGT GTTTGACGCT TCGAGTCCTT CGAGCTTTGA ATTTGCCAGA GAACTAGAAG CAAAGCTATT GACACAGGAG ACTCCTCGTG TTTTTATCGC TACGAAATCA GACAAGATAT CTGCTCCCGA ACCAGAGGAT GGCGACGCAC AGGCTGCGAA TGTGTTGGAA ACTGCCACGA TTCATTGTCG AGAATCCGAC TTGGAACTGC CGCTCTTGAC GTCGGCCGCC GACGGCTCAC TGCTGAATTT TGAAAAGCGC AATGCTACTC TTGACCACTT GGCGCGTTGT GCCCTGGTCG AAGCTGGAGT GACACGCCTA AAGTCGAGGC CGCACGAAGA GAAGCAACGC CGCGAGACTA ACCGCCGTCG CAAGATGATG TGGCTCGGTG GTATCGTAAG CGTCGGTGTC GTTGTTGCTG CTGGTGTAGG TCTCCTTTGG GGCAGTCATG CGACAAAAAA GGAGCAGACG AGTGGCTTTG GATGGTTGCG TAACTGGTTT GGAGGTACAA CCCGGGGTAA TTCACCGCAG GCCATGTAGT TACAGTGATG TTGCTATTGG CAAGACTTTC CATCTCTCGC TTACCCTGCC ATACTAGGTA AAGACTGAGA AATTTATAAT TGCTTTCGAA AAAAAGCCTT GCACCCGATT TCGGTATGCA TTACGTCGTC ATAAGTGTTA TCCTTTGGTC ATGAAACTAG TCTAAACCTC GGGCACATTG CAATATGTGC TCACTTATTT TATAATAAAG CACTTTGCAA ACTGT
|
Protein sequence | MTTLSSTTPR PSVPKRADQR STITILLLGD EGVGKSSLIS TFVSRYFSEV VPGIMTRVRL PPDPELSCVT TIVDSQGGDL ALLQAMATRR SMMQHHSSVH GSTDSLAALM ERAETSMMTQ QSSAPEQTTT PTVKSSGIEN VDSIVLVYDL DRVETFFRLE NHWLPLIERC YNGKVPIIVA ENKLDLFRPS STAGMTDEQA VARQRQQIVS LLQRFPFVRQ CIKCSAKNLV RVDDVFLKAQ QAVLYPFTPP LYDLEHGRLT EECKRAFTRI FRMYDSDRDG LLSDVELNRF QIETYHVAVF DRDFSAWKKV VSRNNPTDEV VIQDGKFTIA GFFAIFDLFI SQNRLDVVWQ ALREFNYDDD LNLHIPEIVT APTDDTSWKL SSGAKRFLSG VFRQFDQDQD DVLTADDIGN IFSILHPPAL PPWHPARAPF LFAGCFSLPK QKYSPGTESP NFGGNVSLIL PGSTPMAQSL SNSGISILSA SDSLPSVALS GISVSEPLTF LEWMGHWHTI AAISPSVTRA ELFRLGHSEE SRKTDPRPRR SRKKKSASIT PSQAPSDATF PSSAIRVLVL GSGSCGKTAL LNALCGSMES TEVSATNTTS TLHPETSSTY VKIGRGQSLG HHGTCSPSKS HDVVEEIVAH LVFTDVPETA AVSQREHYRE LSELFGSTAS PKDRVCDLAM LVFDASSPSS FEFARELEAK LLTQETPRVF IATKSDKISA PEPEDGDAQA ANVLETATIH CRESDLELPL LTSAADGSLL NFEKRNATLD HLARCALVEA GVTRLKSRPH EEKQRRETNR RRKMMWLGGI VSVGVVVAAG VGLLWGSHAT KKEQTSGFGW LRNWFGGTTR GNSPQAM
|
| |