Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39281 |
Symbol | |
ID | 7195024 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 112755 |
End bp | 114852 |
Gene Length | 2098 bp |
Protein Length | 671 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183293 |
Protein GI | 219126080 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0784232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTCCAT GGCATCGTAC TCACAGTCAA GCATCGATCG CTCTTCGGGG TAACGTGGGT AGTAGCAGCT GTAAGCAAGG AGTAGCTCAA GGGCTCAGGT GCATCGCCTG TTACTTGGTC AAACGCATTC TCGGAAATAG GGGTCCTCTC AGGCGAATTT TGTTTGGCAT CGGAGTTATC ATACTCCTAA CGACTGCGAG GCAGGGAAGG TCCTTCGCCA TAATTCTTAA AACGAACACC AAAGAGAGTT GGATAAAGCT TAAGCTGGCT CCTGCTTTTA ATCCTGCTGT GAAAGCCGTG GGTGGTAAAG GACCAGTGCG CGCATCAGAG CGTAATCGCG CAAATTTTTC TTCATGTAAC ACAGATCCTG GCCAAGCCGA GAATGCTGAG GGAAGTGCAA TGGGCTACAA TAATCTTCTC AGAATCCGCG AGATCAAGAT GAATGAAAGT TCAGCGAAAG GGCAACGACT TCTATGCGTA TTTCCTTCAT TTCCTTCGGT TCAGCTTCAG AACGCTGCGG TAGAAACGTA TGGAAAGAAT TGCGATGGTC GTCTCGTAGT AAGCCGCAAT ACGCCGCTGG TAAGCTACAG TTCCCAGACG TATTGGATAA ACGAGAGAGA CCCTACATAT AGAGATGGAG CGGGGGCAAT ACTTCAGCAA ATTCTGCACA AAGTCGGAAA TGAGTATGAT TGGTTTTATT TTGCTCAACC AGGGCGCTAT TTGATCGGAT CCAATGCCCA GATTCGCCTA TCCTTTACGC GCAACGGAAC CGATTTAGCA AGGAAGAATC CGTTAGCGCT GCGAACATAC ATACACAATA TTTGGGAGTT TCCCAAAAAG GTTCATGCAC GAGGGAGTTG CCCTGGAACT GAGTTTTTAA ACCGCGCTGC CGTCGATGAA CTCGTTCACG TCTATCTCAA GAACAAAGAG TCTACATCAG GAGATTCCAA TTTGATTACA TGGATACAAT CAGTGATCTC AGACACGGGC ACACGATGTG TGAAAGGCCT GCTAGCAAAA ATGCCAAATC CACAGAAGTA TGCCGAGCTG CTACTGTCGC CCAATGCGAC ATTCGAGCAA CAGGCCGTGG GACTGTATCG AATTCAATCA ATACTGAACG GTACATGCGA CCGCCAATGG CGAGAAAAAT TCGGATTTCT GAATGGAGAT GTTCGGGATG TAACTTTACT GCGCAAGCAT GGCCCAGCGT TTGATTTCTT TCCTGCAGGT TTTCACAAAT CCGTTTGTGA GACCCCCTTC GGAACAGGAA CCGAAGGACA CATGGGATAC AGGGGTTTGC GAAAGATCCA GATTGCGCAG CAGTCTCAAG AGAAACGAAT CTTGTGTATG ATTTACACCC ACGAGAGTCG CCATGAGCAG TTGCGCTCAA TAGTAGAGAC ATGGGGCAAA GGATGCGACG GGTTCTTTGC AGCTTCAACA AAAAACGACG AAAGTTTAGG AGCAATCAAC TTACTGCACG AAGGTCCGGA GCTATATGAT AATTTGTGGA TGAAAGTCAG GGCTATGTGG CAGTATGCTT TTGACCATTT CTTGAACGAC TACGATTTTT TCCATATTGG TGGGGATGAC CACTACGTCA TCGTCGAGAA CCTAAAATAC GCTGTTGCTA CGGGAAATTG GAAAGAACAC TGGAACCAGA GTGTTCCTCT CTTCTTAGGT GGATCAGTTG CGGACCACGC AGACCTCCAA AGACGATACT GCGGAGGTGG CAGTGGCTAC ACTTTGAATC GTATAGCTCT ACGAAGGCTG GTAGAAGAGC TTTTTCCCAA GTCACAGTGC TGGCCGCATT GGACATCAGC TCAGGAGGAC AGAATCATGG CAGGTTGCTT CCGGTCAGTT GGAATTCAGT GTATGGATAC AAACGATTAT AAAAATGAGA CCCGCTATCA TCCTTGGGGC GTGGACTATC ACGCTTCTTG GACAAAGAGA AAAAAAGGAA ACTGGCACCC AAAAGTTTTG GAAACAGTTC ATGGGATTGC GCAGCCCGAA GGCTTGGCAC AAATATCAGA TTCTAGCGTA TCCTTTCATC TGAAACCACT CCGCACAGAT CCTGATTTGC CGCCCGATCG AGGAATGA
|
Protein sequence | MCPWHRTHSQ ASIALRGNVG SSSCKQGVAQ GLRCIACYLV KRILGNRGPL RRILFGIGVI ILLTTARQGR SFAIILKTNT KESWIKLKLA PAFNPAVKAV GGKGPVRASE RNRANFSSCN TDPGQAENAE GSAMGYNNLL RIREIKMNES SAKGQRLLCV FPSFPSVQLQ NAAVETYGKN CDGRLVVSRN TPLVSYSSQT YWINERDPTY RDGAGAILQQ ILHKVGNEYD WFYFAQPGRY LIGSNAQIRL SFTRNGTDLA RKNPLALRTY IHNIWEFPKK VHARGSCPGT EFLNRAAVDE LVHVYLKNKE STSGDSNLIT WIQSVISDTG TRCVKGLLAK MPNPQKYAEL LLSPNATFEQ QAVGLYRIQS ILNGTCDRQW REKFGFLNGD VRDVTLLRKH GPAFDFFPAG FHKSVCETPF GTGTEGHMGY RGLRKIQIAQ QSQEKRILCM IYTHESRHEQ LRSIVETWGK GCDGFFAAST KNDESLGAIN LLHEGPELYD NLWMKVRAMW QYAFDHFLND YDFFHIGGDD HYVIVENLKY AVATGNWKEH WNQSVPLFLG GSVADHADLQ RRYCGGGSGY TLNRIALRRL VEELFPKSQC WPHWTSAQED RIMAGCFRSV GIQCMDTNDY KNETRYHPWG VDYHASWTKR KKGNWHPKVL ETILICRPIE E
|
| |