Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49091 |
Symbol | |
ID | 7195447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 557380 |
End bp | 558840 |
Gene Length | 1461 bp |
Protein Length | 459 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183633 |
Protein GI | 219126792 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAGCGCTTC GGTAGCTTGG ATCATCGACG TTGAAAGCCA CTGCTTCTAC TGGATATAGA TTCTGTCCTA TTTGCCCAAC CATGAGTGAT GGAAGCATAC CGGACGTACA AGTTTGCATT TCTGATGCCT ACGATTCGGG CAACGTCGAG TTTGTTTCCT CGGAACGAGT GGACGGCGTT TGGACCGTCC ACGTCCGCAT CAAGCCCGAT GTGTACACTG CACTCGAAAA GATATCGCAT ATGCAGTATT TCTCCTTCCG TGCGACCGTC AACAACATCA GCAAACCGAC CAAAATTGTG TACGTGATTG ACAATGCGTC CAAGGCTTCC TACCCCGCCG CTTGGACCGG TACCACAGTG TGCTACAACA CGACCGATCC GGAAGATTCG GAGGCCTGGT TCCGGAACAG CACGACTCGT TACGTCGACG GAACGCTAAC GTGGACGCAT CTTCACGTCT ACAATTGTAG TTCCTACTTT AGTTACTTTC CGCCCTTCAC GTACGCGCGT CATTTGAAAC TCGTTGGCCA CCTTCAACTG GCGGCGGCCA AAATTCCTCA CGCCAACGTC GAAAGTCTAG GGCAAACCTT GGATGGTCGG GAAATTGAAT GTGTTACAAT TGGCACTGGT GATAAAATAT GCTGGATCAT TCACCGTCAA CATCCCGGTG AAACCATGGC GGAGCACTAT GCCGAAGGCT TGTTGTATCG CCTCTTCCGA TTGGATGACG TGGAAGATCC GATCGTGGAA CAAGCCTTGC AGCGTTTCCG TTTCTACGTT GTACCCTGCA TGTGTCCCGA TGGCGGAGTT CGCGGGCATT TGCGGACCAA TGGAGTTGGA GCGAACTTGA ACCGGGAATG GGCTACCAAG GGCGACTACC AAGCACCCAC CCTAGAACGA TCGCCGGAAG TTTACCACGT GCTTCACAAA ATGGACGAAA CTGGAGTGGA TCTGTTCTTG GATGTTCACG GGGACGAAGA ACTGCCTTAC AATTTTATTT CGGGGGCCGA ACAGACGCCG GCTTGGAGTA AGCGGCTGGA ATCCCTGCAC GGTGCGTTTG TGGCGTCCTA CCATCGTGCC AATTCTGACA TGCAACAAGA AATAGGATAT CCACCGCCGG AAAGCCGCGA GGAAGCTCAG AAATACATGA ATGTTGCGAC AAATCAAGTA TCGACGCGCT TTGACTGTCT CGGGATGACG CTGGAAATGC CGTTCAAGGA TTGCGAATCC AACGTTGACC CCGATCGTGG ATGGTCCGCG GCGCGATCTC GGGCATTGGG AGCCTCCGTG TTGGGGCCCT TGCTGTATGT TTATCCGTAT CTACGGGACG AAACGGAGTT CTGGACTACC TTACCACCGG AAGACGCCTA CGTGGAGCCC CATGATGACT ATCAGGGCTT TAAGCCTTTG ACGAAACGTC TCTATTCTGA TGTTCGCGCA ACCAGCAGTA TCACCAGTTG A
|
Protein sequence | MSDGSIPDVQ VCISDAYDSG NVEFVSSERV DGVWTVHVRI KPDVYTALEK ISHMQYFSFR ATVNNISKPT KIVYVIDNAS KASYPAAWTG TTVCYNTTDP EDSEAWFRNS TTRYVDGTLT WTHLHVYNCS SYFSYFPPFT YARHLKLVGH LQLAAAKIPH ANVESLGQTL DGREIECVTI GTGDKICWII HRQHPGETMA EHYAEGLLYR LFRLDDVEDP IVEQALQRFR FYVVPCMCPD GGVRGHLRTN GVGANLNREW ATKGDYQAPT LERSPEVYHV LHKMDETGVD LFLDVHGDEE LPYNFISGAE QTPAWSKRLE SLHGAFVASY HRANSDMQQE IGYPPPESRE EAQKYMNVAT NQVSTRFDCL GMTLEMPFKD CESNVDPDRG WSAARSRALG ASVLGPLLYV YPYLRDETEF WTTLPPEDAY VEPHDDYQGF KPLTKRLYSD VRATSSITS
|
| |