Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41316 |
Symbol | |
ID | 7199191 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 102190 |
End bp | 103449 |
Gene Length | 1260 bp |
Protein Length | 380 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185329 |
Protein GI | 219130348 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTCT CCGCCTTCGT CGTTTTCTTT CTCCCCCTTG CTTTTGCAAC CGAACGCTCG GCCAACAACC TGGTATGTTG AATGAAGAGC TTGTCCTTGT CGGAAGAAGT GTTCCGAAGC CTGCGCCTGC GAAAACAACA CGAAGTTGCA CCGGAAAATC GATCCTCATA TTTTCTCGAC TCTTTCCAGC CCGAACGGTA CCTAAATAGT CCGACTGGAC CTAGTTTGAC GGGTCCTAGT TTGACTGGTC CTAGTTTGAC GGGTCCTAGC GCCACCGGGC CAAGCATGAC GGGTCCCAGC GCCACCGGGC CAAGCATGAC GGGTCCAAGT ATGACGGGAC CTAGCATGAC TGGACCCAGC GACAGCGATG ATCGTCGTCT CAAGAGCCCC AGCTCTACGG GTCCTAGCGC CACCGGGCCA AGCATGACGG GTCCCAGCGC CACCGGGCCA AGCATGACGG GTCCAAGTAT GACGGGACCT AGCATGACTG GACCCAGCGA CAGCGATGAC CGTCGTCTCA GGAGCCCCAG CTCTACGGGT CCTAGTTTGA CGGGTCCTAG CGCCACCGGG CCAAGCATGA CGGGTCCCAG CGCCACAGGT CCAAGTATGA CGGGACCTAG CATGACTGGA CCTAGCGACA GCGATGACCG TCGTCTCAGG AGCCCCAGTT CTACGGGTCC TAGTTTGACT GGTCCTAGTT TGACGGGTCC TAGCGCCACC GGGCCAAGCA TGACGGGTCC AAGTGTCACA GGCCCCAGCA TGACTGGACC CAGCGACAGC GATGACCGTC GTCTCAGGAG CCCCAGCTCT ACGGGTCCTA GTTTGACTGG CCCTAGTGGC ACAGGTCCTA GTATGACGGG CCCCAGCGCC ACAGGGCCAA GCGTGACGGG TCCCAGCGCC ACAGGTCCAA GTATGACGGG ACCTAGTATG ACTGGACCCA GCGACAGCGA TGACCGTCGT CTCAGGAGCC CCAGCTCTAC GGGTCCTAGT TTGACGGGTC CTAGCGCCAC CGGGCCAAGC ATGACGGGTC CCAGCGTCAC AGGTCCAAGT ATGACGGGAC CTAGCATGAC TGGACCTAGC GACAGCGATG ACCGTCGTCT CAGGAGCCCC AGTTCTACGG GTCCTAGTTT GACTGGTCCT AGTATGACGG GCCCCAGCGC CACAGGGCCA AGCGTGACTG GTCCAAGTGT CACGGGCCCA AGCATGACTG GACCCAGCGA CAGCGATGAC CGTTTCCTCA GGCGCCGGAA CAAGATGTAG
|
Protein sequence | MKFSAFVVFF LPLAFATERS ANNLPERYLN SPTGPSLTGP SLTGPSLTGP SATGPSMTGP SATGPSMTGP SMTGPSMTGP SDSDDRRLKS PSSTGPSATG PSMTGPSATG PSMTGPSMTG PSMTGPSDSD DRRLRSPSST GPSLTGPSAT GPSMTGPSAT GPSMTGPSMT GPSDSDDRRL RSPSSTGPSL TGPSLTGPSA TGPSMTGPSV TGPSMTGPSD SDDRRLRSPS STGPSLTGPS GTGPSMTGPS ATGPSVTGPS ATGPSMTGPS MTGPSDSDDR RLRSPSSTGP SLTGPSATGP SMTGPSVTGP SMTGPSMTGP SDSDDRRLRS PSSTGPSLTG PSMTGPSATG PSVTGPSVTG PSMTGPSDSD DRFLRRRNKM
|
| |