Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50607 |
Symbol | |
ID | 7199445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011700 |
Strand | + |
Start bp | 70926 |
End bp | 72242 |
Gene Length | 1317 bp |
Protein Length | 351 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185560 |
Protein GI | 219130835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGGTGCAT TCGAACTGTT GGCAAGGCAG TTTACTGTTA GTTTCTCGCC CTCCAGGCTT TCCTCGCGCA TTTCCAAACG TATTGCAGTA CTCACGTCCA GACACAAATA CAGTATCCAA ACGCTGACTT TCAATCGCAA ATCGCGTGGG CCATCATCAT ACCCCCCGTA TTTGGTGGTT CCTGCCTACA CACCTAACAG GGTATCGATC GGTAGTGTGT CCTGCCTGTG ATAGAAACGT CTCGACAGCA CCTTGACCAC AATGTCGGTA CCACCCACCG AATCACAACC TCGTCCACAC GAACAGCCAC TACGGAACCG CGCGGCCGCC CCGACCAACG CGGGATCCTT ACGGGATGCC CGCTTACTCC TCCTCGGTGC GGCCTTCTCC TACGTACTCT TCTTTGCCGT TAACGTCCAC AAGTCCACTA TTCTTCCAGA CGTTATAGAC GACGCCGTTC TCGCTACGAT GGGTTTGCCA CTGGCGTCCG TGCCCGTGTC CTCATTGCCT TCATTGCCCG GATGGAATCC CGTCCACGTA TACTACGGGG ATACTTCGCA CTTGGATCAC GTGCTTGATC CACGGCACAG TCGACGTAGT GTTCAGTGGC TCGGCAAGGG ACCGGCCAAC GGTCGCCAGT GGACCAGCCA GTTCGGACAG GACGTGGCCG TCATGAAGAT ACTTGACTTC CCCACAGGAG CTTTCTTTCT CGATCTTGCC GCCAATGGAC CTGTCTGGAT GAGCAACACT TACGTTCTAG AAACACACTT TGACTGGAAG GGTATTTGCG TCGAGCCCAA TCCGATCTAC TGGTACCCTC TGTCCTTTCG TAGTTGCCAC GTGGTGGGTG CCGTTGTCGG AGCCCGCAAC ATGGACCAAG TATCGGTCCT GTTGGATCCC GGACAAATGG CTCCGAGCAA CGGAATCGTT GGCGACACCT TTGACAACCG CAACGTCACC CAACCCGGAT TGGTCAAACC CCGCTTTACC GCCAGTCTCA CGGACATTTT GGAACACTTT CAGGCGCCCG CCGTCATTGA CTACTTTTCC CTCGATGTCG AAGGAGCCGA ACTCTTCATC ATGAAAAATT TCCCCTTTGA AAAATATCGG TTCCTGTGTT TGACAATTGA ACGAGCACCA CCCGAGCTGC AGGAGATTTT GTCGCGGAAC GGCTACAAAT TTGTGCACAC GATTCGACAA GGTGTGGACG ATCTATGGGT GCACGAGTCC ATTTACGACC AAGCCAAGGC CAACCTGGCC ATTCGCACCC ACGAAATTGC CACCAAAAAT CCCCAACTTC CGGTAGCAGC AGCGTAG
|
Protein sequence | MSVPPTESQP RPHEQPLRNR AAAPTNAGSL RDARLLLLGA AFSYVLFFAV NVHKSTILPD VIDDAVLATM GLPLASVPVS SLPSLPGWNP VHVYYGDTSH LDHVLDPRHS RRSVQWLGKG PANGRQWTSQ FGQDVAVMKI LDFPTGAFFL DLAANGPVWM SNTYVLETHF DWKGICVEPN PIYWYPLSFR SCHVVGAVVG ARNMDQVSVL LDPGQMAPSN GIVGDTFDNR NVTQPGLVKP RFTASLTDIL EHFQAPAVID YFSLDVEGAE LFIMKNFPFE KYRFLCLTIE RAPPELQEIL SRNGYKFVHT IRQGVDDLWV HESIYDQAKA NLAIRTHEIA TKNPQLPVAA A
|
| |