Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46387 |
Symbol | |
ID | 7201644 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 182275 |
End bp | 185543 |
Gene Length | 3269 bp |
Protein Length | 623 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180773 |
Protein GI | 219120052 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.7871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCACA TAACCAAATG GTGCTTTCCG GGAATTAAGG ACTGGTTGAC AACATTTGTG ATCGGATTTT TAGTAACAAG CTACTGGCGT GGGACATGGA CACTTCTGGA CATATGGCTG TGCGACCAAC CCGCAGATGC TGGGCTTACG TCAGCTGACT CGTTTTGCTT TGCTGGCCTT CCTGATGAAG CTGTGAAGCA TCGAAATTCT GGGTGGCTTT CCATGGGAAT AGGAATGTTT TTGACTGCAA TCGGTGTCAG CCTCATGTGG TTGGACTTTT GGAGGCCTCA GGTGTCGAAT GTAAAGCACC GAGTGCAAAT TCCAGCCCGT CGAATTGTCA TTCGTTTCTT ATTAGTGTAC ACTCTGGGCA TGGCATCCGT CAACATATGG CGTGGCGTTT GGTATTTGAC AGATTATTTT CTATTGCCCA ACAAGCTCAC TGCAACTGAA TGGGGAGATT TTCCATTAGC ATCTTATTGG GTCTCTTCGG TAGTTGGTTC GACAGTCTGC TTCCTTTTTT ATGCTGGCCC GTGTTTATTG GCCCCGCCAG CAATCTTTTT GATTGATGGT CCTGGAATCA ACCCACCGTA AGTTTACTAA AACATATTGG CATCTCACAT TGCCTTACAT GTGCTTACGC CGTGTCTCGC GGTTCCAGAC CTATTGCAGT AACGCTGATT TCATCCTACT ATTCCCTGAC GCTACCAGCG GATCACAGTA TTCCGGATTT ATCGCACACA GTGATTGCGC TAGATTTGCT TTTTAGCTTT CTCGGCGTAC CAATCATGGT AGTATGGTTC TGGAGAGGCA GTTGGTTGTT GCTTGACTAC TATTTATACG GATTTTCACC AAATTCGCAC GATGTTCACT TCTCCATACT TTGGTCATCC ATTGTTGGGG TTTCTTTCCT GATCGTATGT AGTGAAACCA TTTTTGCCTA CATAAGGGTT CGCAACACAG TTGTTCTACT ACTTCTTGGT CGGCTGAGAA CTTTTATCCT GGCTTGGGGT ACGGTCAATT TTTGGAGGGC TGTTTGGTAT ATTTGGGACG AGTTTTTGGG AGGTTCCACA CAATGGTCCT GTTGGCTGGC ACATGCTGCA TCTATTGCGT TGCTAACAAG CTTCGGTTGC ATGTCTTGTA TTTTGGGTAA GTCAAGGCGC CTACTCTCCG CCCGATGTAT TTAACCCAAT TACCTCCTTT GAACTAATAA ATGTCATTTT TCTTGAACCA GCCCCGGCTT CCACTCTCGG AGTGGACGCA GTACCTCATA AGGATTGTGC TGATGATCCT CTCTTCAGCA ATCTTCCGGT GCCCGCTGAT GACTTGTTTA TGTTTGGAAT TGGACGACAA CCACTAACTC TGGAGCAGTC TGCGTGACCA CGAAAATCTT TGGAAAATCA AAAGCACCAT TTGCAGACCT CTTCGAAAGG GCTGAATGTT GATGGCTCTC CTTTAACACG CACCTCCGTG GCAATATCGT TGGAGAGCAA GAATGATGAA GGTGAAGCCT GGCGTAGCGG GGGGACGGGT CGTCGGGGAT CGTACAGCAG CGTTCGCAGC GAAGAAATTT CTTTGTCGAA CAATTCTTAT CTCGGCCGTC AGCGTCCCGA TCTTGATCGA CGTGTCAGTC GAGCTGACCT TGTCTCTTCG GGGCAATCTG TGAGACGCAG CAGCCAGTTT TTTCGAAACA GATAGAAGTA AGACAGGGCA ATTGTGAGTA GAGGCGCTCA ATCGCAAGAA CTAACATTCA ATGATGTCCT TCTCTCCCTT TTTTGCGGAT GTTCACTTGT ACATTTCATT AGCGTTAGAG GAGATTCACT AATCCGGCCT CGATGAGGCC GCAAAATCAC TAATCCGGCC AAAATAAATT TATTAAGTAC CAAAGGAAGG ACCACATTGC TTTCCTTACT AGCGATGATT ATGAAAAAAT ATGGATCTTA AATAATTCGA GTATTTAAAG ATCACATTCT ATGTTAATTT TTGAGATGCT AGATTTTTTA ATTTTTAGGC CTGCGATGGA CGATGTTCTG CAAAAGACGC TTCGGTCGAA GGAGGCAACT TGACGGTGAG AGACGATCCA CTGAAAAATC TTTACGAACA GTGGCTTGAG AAGTAAGTTT CAAGTTTATT AATGTATGTG ATCCAATTAC TTAGGAAGAA ACTCTGGCCC TACCTTAGCT ACTTACAGTC AACGAAATCT AGGTTTTGGC CGGATTAGTG ATTTTGCGTG CTCATTGAGG CCGGATCAGT GATTTTCCTC TTAGCGTTAC TATTGCATGA ACGAATCGAT CAGTAGTTTT TTAAAGAGAA ATTTCGGTAT TTGGGATGAG CAGGGGTGAA ATTTTCGCTA TTTGGGAAAA ATCACACTGT TTCTAAGTGT TTTTATTCTC GCGGGAAATA CTTTATAAGT ACATTTTTTA ATTAAGAACA ACCCCATATA CTATGCATAG CCATTTAGAT TCAATTCACC AGTTCCTTCC ATGAAATTTT TGACTCTCCA CCCAGCAGCC AACTCTTTCT TGTTCACAAC TGTGAAGTAC AACACTGGCT GAGATATTGG ACAGCGGCGA CAAAAAGGAG AAAAAACAGA AAGCAACGCT AGGCTTTGAC AGTAAGGGGG ATCAGCACGA ATTCATCCTG TCACAAAGGG GGTTTGTTGG GATCAGCTTT TACTCGCAAG CCGCCGAAGA GATATCGGAC AACGAAGAAC CCCACGAGCC GTTGTTCTCG GGCAAAGCTG CGAATGAGCA AGCACTCGGT CCGGCACAGT GCATTCTAGA AAAAAAAGAT GTTCTCGCTG GCCGGGCATG TTGCCAAGCC CGTCAGCCAG AAGAAGATAT CGTAGCGTTT TACCAATCCT ATTGCACGGA GACGCCGCCT TTGCCGGGCA AGGGGTAGTC TACGAGACCA TGCAAATGGC CGAGGTGCCC GATTTTGATG TCGGTGGAAC CATTCACGTC ATCATCAACA ATCAGATTGG CTTTACCACC AACCCCCTAC ATTCGCTCTC AATGCCCTAC TCGTCGGAGT TGGGCAAGGC CTTCAATTGC CCCATCTTTC ACTGCAACGG CGACGATCCC CTGGCAGTAT CGACGGCACT CGAGACCGCC GTCGAATGGC GTCACGAATG GGGCATGGAT GTCATTATCG AGATGGTCTG CTACCGTTGT AATGGTCCCA ACAAATTGGA TCAGCCGGCC TTTACACAAC CCAAACTCTA TAAGGAAATC TCTCAACACC CACCAACCCT GGATATTTTC GAAAAGTGA
|
Protein sequence | MFHITKWCFP GIKDWLTTFV IGFLVTSYWR GTWTLLDIWL CDQPADAGLT SADSFCFAGL PDEAVKHRNS GWLSMGIGMF LTAIGVSLMW LDFWRPQVSN VKHRVQIPAR RIVIRFLLVY TLGMASVNIW RGVWYLTDYF LLPNKLTATE WGDFPLASYW VSSVVGSTVC FLFYAGPCLL APPAIFLIDG PGINPPETIF AYIRVRNTVV LLLLGRLRTF ILAWGTVNFW RAVWYIWDEF LGGSTQWSCW LAHAASIALL TSFGCMSCIL APASTLGVDA VPHKDCADDP LFSNLPTSSK GLNVDGSPLT RTSVAISLES KNDEGEAWRS GGTGRRGSYS SVRSEEISLS NNSYLGRQRP DLDRRVSRAD LVSSGQSACD GRCSAKDASV EGGNLTVRDD PLKNLYEQWL ENGDKKEKKQ KATLGFDSKG DQHEFILSQR GFVGISFYSQ AAEEISDNEE PHEPLFSGKA ANEQALARRR YRSVLPILLH GDAAFAGQGV VYETMQMAEV PDFDVGGTIH VIINNQIGFT TNPLHSLSMP YSSELGKAFN CPIFHCNGDD PLAVSTALET AVEWRHEWGM DVIIEMVCYR CNGPNKLDQP AFTQPKLYKE ISQHPPTLDI FEK
|
| |