Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49011 |
Symbol | |
ID | 7195273 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 299001 |
End bp | 301223 |
Gene Length | 2223 bp |
Protein Length | 719 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183720 |
Protein GI | 219126973 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCAT GCGGAATGTG CGTTGCTACG TCCACAGGTG ACGAGTGCGG CAAACTGAGT GAAAGGGAAA TTCTTCAGAA CGTCGAAGAG CTGAATGCCG CGATAGCTCA AGAAGAGATT TTCTCCAAAG TTGATGTTAG CGATGATCCA CCAAAAACTA TAGAGGTCTC TTTGTTAGAT ACACTACCTG AAACATTCTC AAACGAAGAG CTTCTCCTAG GCACGGTGAA GCCGAAAGAT TCAGCGGAAT TTCTTTCTGA AATGATTACA AGTAGTCGTA TATCCGAACA GCAAAGACTT CTCGATCCAG TCGTCTCCAA AGATGTTCAA TTGAATGATA AAAGGCTTTC GGAATTTGTA GCAAACAACG ACACAATAAT TGAATCTGAA CACACTTCAC ATGCTCATTT TGAAACAAAG ACAGAGCTAG CTGGAGAATC ACTTCCGGCG GATGTCGTCA TTCCGAAAAG AGAAAAAGTT GGGATGGAAT TATCAAAAAC AGTACATCGT GACCCTTCTG AATCTCTTGT GGAACTTGAT TTTGATGTGA ATCCTGGCAA GCTTTATACT GCTCTGCAGC GCAAGGACTG GGATCTTGTT TTGATGCAGC TCCGGGAATT CCCCGAGGAA AGCAGCTTTT GGATATCTCG CCGCGAAATA GATGGGAGGC TTCGGTGGCG GCTGCTTCCC ATTCACGGTG CACTTGTCTT CAAGGCACCA GAGTTTATAA TTCAAGCTTT GTTAGACGCC TACCCAGATG GCGCTCGCGC AACGGATGAT CAGGGCATGT TACCTATCCA CTTGTCGTAC AAGGCGGGAA GCTCGGAGAT CGTTGTACGA ATGCTATATA GTGCTTTTCC TGATTCTTTA ACGGTAGCGG ACCGAAAAGG CCGCACCCCG GTCCAACTGG CGGAAACTAC ATACGGAACA AATAGGGAAG GATTTCTCTG CGCATTGGAG AGCACGAATC ACAATGACGT CGAATCCTCA ATTGAGGGAA CAACTGCTGA ACAAGGGAAC ACTACTAGGT CAGTCCCGAT ACTAGTAGAC GACTTACCGA CAACTCTCAC AATCGCTACT ACACCCGTCA GCATCACTAA ACTGGATCGA ATGATCTTCT TGGCAAAGTC GGACGACAGC ACAACACAAG CCACCAGGCC ACTGGGGACC GGAGTGCAAG TGGAAGCTCG GAAAATGAAT TCTTCGACCG CAGTCACCAG TACAAAAACA ACAATAGAGG TCCATGGAAA GAAACATGAG CTAGAGTTGG AAAGTCTTCG AAAAGAAATT CAAATAAAAG AGGGCACTTT CCAGCAGACC ATGAAGGAGC TCAAAGCAGA ACTGGAAAAT AAGGCCGAAA CATCCAAAGT TTTAATGGAG CATGTGACTT CCCTTGAGTC TCAAATAAAA AGTCGCGGGC ATATGGAGCG CTTTCTAGCT ACAAAGATTG CCACTCTGGA TAGAAGTCTA AAGGCTACAA AACAAGCCAA GCTAGAATCG GAAAATAGAC TGAACGAGGA AATCCATAAG CTTCAATCCA ATCTGGAAGA GGCTATGATT GAAGGAGCAA CACACACTCG GGGAAAAGAC GAGTACATCA AAGCACTTGA GGCTCAGCTT GCTGGGGCCA AAATCGCCGC TCTCGCAGGT AATTCGGTAA AGTCAAAGAA GGGACGCCCT ACCGAGGAGG AAACGTTGGA AAACCAAGTA GCGAGCTTGC TTTCGCTTCT CTCCGCACCG AACTCTGAAT CAAGTTTGAC AACAGCTTCC CTTACGATGC GCGTAAAAGG CTTGGAATCC GAAAGAAATG ATCTTAGGAA GACAATCGCA AAGCTGACCA AAAAACTATA CAAGGTGGCG AGCGTTATTG ATGGAATTGT TTCGCAGCAA CAGGATTTGC TTTTGGAGAA TGGTGCGAGC GACGACGACG ATTGCGACGA CCGTCAACTT GTACGCGGTA GCAAGAAAGT GGAAAAACTA CAGGTGCAAA TTACGGAAAC GCAAGATGCC GTCGAAGACG TGTTGCCCCA TAAAAGTGAC GATGGCGATG ACGAACGCTT AGTGGATCGT GCTATTCTGC ATGTATTGAC TTCTTCAAGT TTCGCCGAAG GAGAGTCCTG CGATACCGCC CCGTTTCATC CAAAATGCAA CCAGAAAGGA TCTTTGACAT TGAGTGACGA AGAGAAAAAG GCCGACGACA CGGATGCTGT GGCAATTTTC TAG
|
Protein sequence | MPSCGMCVAT STGDECGKLS EREILQNVEE LNAAIAQEEI FSKVDVSDDP PKTIEVSLLD TLPETFSNEE LLLGTVKPKD SAEFLSEMIT SSRISEQQRL LDPVVSKDVQ LNDKRLSEFV ANNDTIIESE HTSHAHFETK TELAGESLPA DVVIPKREKV GMELSKTVHR DPSESLVELD FDVNPGKLYT ALQRKDWDLV LMQLREFPEE SSFWISRREI DGRLRWRLLP IHGALVFKAP EFIIQALLDA YPDGARATDD QGMLPIHLSY KAGSSEIVVR MLYSAFPDSL TVADRKGRTP VQLAETTYGT NREGFLCALE STNHNDVESS IEGTTAEQGN TTSITKLDRM IFLAKSDDST TQATRPLGTG VQVEARKMNS STAVTSTKTT IEVHGKKHEL ELESLRKEIQ IKEGTFQQTM KELKAELENK AETSKVLMEH VTSLESQIKS RGHMERFLAT KIATLDRSLK ATKQAKLESE NRLNEEIHKL QSNLEEAMIE GATHTRGKDE YIKALEAQLA GAKIAALAGN SVKSKKGRPT EEETLENQVA SLLSLLSAPN SESSLTTASL TMRVKGLESE RNDLRKTIAK LTKKLYKVAS VIDGIVSQQQ DLLLENGASD DDDCDDRQLV RGSKKVEKLQ VQITETQDAV EDVLPHKSDD GDDERLVDRA ILHVLTSSSF AEGESCDTAP FHPKCNQKGS LTLSDEEKKA DDTDAVAIF
|
| |