Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_10257 |
Symbol | |
ID | 7204065 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1382085 |
End bp | 1383509 |
Gene Length | 1425 bp |
Protein Length | 459 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186243 |
Protein GI | 219113319 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.144929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGTTGT TGTTGTTGGC GTGTTTCGCA CGCCGTACTT CGTCACTGAC GAGACCCTGC TCCGCCCGAT TGCAACGGTC GGCCTTGTTG TCGTCGACAG CGACTGCCGA TGAGGCGGCA TCCTCGACCA ACTTTCGATT GGCGGCACGT CTACAAGGCC TCGACAAACC CACCGTGTGG CAGGAATTCT CCCCACTCGC TGTCGAACAT CAGGCGGTGA ATCTCGGTCA AGGCTTTCCG GACTGGGACC CCCCGCTCTT TGTACAACAA GCCATGCGCA ATGCCATTGA TCCGGCACAA GCCCGACACG CCAATCAGTA CGCCCGTCCC AACGCGCACT TGCCACTCGC AACCGTCCTG GCCGAAGACT ACAGTTCTCG GTGGCCACAA GTAGATCTAA ATCCCGCTAC GCAAGTCGCA ACGGCGGTAG GCTGTACCAA CGTCCTCTAT TGCACCTTGC AAGCATTGGT GGGGGTCGGA GATCAAGTCA TTCTCCTCGA ACCAGCCTTT GATATTTATG CATCGCAAGT GCGCATGGCG GGTGGGACAC CAGTCTACGT GCCACTACGA CCTACCGGAC ACGTTCACGA GGGTGCCAGC CAGGCCTTTT CCCTCGACTT GAACGAATTG GAAGCCGCCG TTACTCCCAA CACCAAGGTA CTCATTCTCA ATACTCCACA CAATCCTACC GGAAAAATAT TTTCCCGCGA CGAGCTCGAA GGCATTGCCG CTATTGTACA GAAACATCCG CAATTGACCA TTATATCTGA CGAAGTGTAC GAGCACATTT TGTTCGATCC GGCACACGAG CCGCACATTT CCATGGCCAC GATACTCTTT GATCAGACGC TCACTTTGAG TTCATCGGGA AAAACTTTTT CGTGCACGGG GTGGAAGGTG GGTTGGGCGG TGGGACCTCC GCATCTGGTC CAAGCCGTGG TGGCAGTACA GCAGTGGGTC AACTTTTCTG CACCAACCCC CAATCAAGAC GCCATTGCCC AAGCGCTAGT GGAAGCCCGT CAGCCCTTTC AGGGATACGA CTCATATTAC GCCTACTTGG CCGACGAGTA TTTGCGCAAA CGAGGTATTC TAGTGGAAGC GCTCGAGGCG GCGGGCATGA CACCCATTGT CCCACCCGGC GGCTTTTTCA TCATGGCCGA TACGAGTTCG ATCAGTGACT CGTTCGTCCC GGAATCCTAC CGCAAGGAAG TTACCGCCGC CATGCCGACC AATCCAATGC CGCGCGATTG GGCCCTTTCC CGATGGCTCA CCAAGGAGGT GGGGGTGACG GCCATTCCCC CTTCGGCTTT TTACAGTGAA GAAAACGTTC CCCTGGCTCA GAATTTGCTG CGGTTTGCCT TTTGCAAAGG CGACGATACT TTACGGGAGG CACAGACCCG ATTAGCCACA TATTTTCAAA GGTGA
|
Protein sequence | MLLLLLACFA RRTSSLTRPC SARLQRSALL SSTATADEAA SSTNFRLAAR LQGLDKPTVW QEFSPLAVEH QAVNLGQGFP DWDPPLFVQQ AMRNAIDPAQ ARHANQYARP NAHLPLATVL AEDYSSRWPQ VDLNPATQVA TAVGCTNVLY CTLQALVGVG DQVILLEPAF DIYASQVRMA GGTPVYVPLR PTGHVHEGAS QAFSLDLNEL EAAVTPNTKV LILNTPHNPT GKIFSRDELE GIAAIVQKHP QLTIISDEVY EHILFDPAHE PHISMATILF DQTLTLSSSG KTFSCTGWKV GWAVGPPHLV QAVVAVQQWV NFSAPTPNQD AIAQALVEAR QPFQGYDSYY AYLADEYLRK RGILVEALEA AGMTPIVPPG GFFIMADTIT AAMPTNPMPR DWALSRWLTK EVGVTAIPPS AFYSEENVPL AQNLLRFAFC KGDDTLREAQ TRLATYFQR
|
| |