Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50527 |
Symbol | |
ID | 7199371 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 4810 |
End bp | 6472 |
Gene Length | 1663 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185469 |
Protein GI | 219130642 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0147272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAGA GGCGGCCGAG TGCTGGTGCT GTTGCAATCG AGCAACATCC AATATTCCCG TGTGACGGCC AGATGCAAAG TAAGAGCGGT AAAGAGGTAA ACTAAGATTT GCAATTTTCG TTGCGCGTAC TTATTTATAT TTTTATCTGG CCGCCACTCG GTTTTGCTTG CCATTGCCAT CCAAAGTAAG CCTACCGCTT GCGTCGTTTC TCCAGTTGCC TGCGTTGTGT CGGTTTCAGC TCTCACCATC TCACATCTTG CATTTGTTCT TTCTAAGCCT GCCAGCAAGG AGGGCGCATG AGCAATTTTC TTCTGGTTAG AGTATAACAC AATGATAGAG ACTAAGGCAA ACGATGAGGT GAACATGGCC ATGGCGGTTC TTGCCTTTTT GCAAAGTCAG AAGTACGCGA AAACCGCAGC TAGCTTTCGG AAAGAACTAT CAGCCAAGGG TATTGACGTC GGTGGCAAAG TAGTCTTGCG GAAACTCACA TGGGAGGCAG TCGAAAGTCA CGAAGCCTCT GGCAGTGAAG ATTCTGACGA GGCGAAACCG GAAGAGAAAA TAGAGATGCA CAGAGCGACG CAGAGGGCGG GAGATAGAAA TAGTTCGGAA TCATCAGACT CATCTTCTTC ATCGGAGTCG GAGGAAGTTG AAGCGAGCAC CAAGACCCCT GCTAGCAAAA TGCAAGCCTC TAGTAAAAAA TCTATCGGTA CATCTTCCTC ATCCTCCTCT GACAGCGAAT CGAGTACCGA TGATTCCAAA AGCACTGTGA AAAAACCAGA CTCCCTTGCA AAATCTCATC CTGAGAAATC TTCGGATAAC TGTTCATCGG AATCTGATTC AAGTTCTAGT AGTGATAGCG AATCAAGCGA TGATGATGAT GCTCCCCGTA TCTCTAAGAA AATTGCTGTC ACGACAAAGA AAGCAAAAGG CAAGGCAGGG TCGTCGCCGC AGGAGATGGC AGACTCGTCT ACAAAGAGCA AGCGCCGAAC AAAGACTGAA ATTGCTATTT CAAAAAGCAG TGACGTCGAC CCTTCGTCGA GTTCCGAGGA TGAAGCGCCG CCTCCAACGA AGAAGGTTCG TCTTGAGGCA AAAGCTGGAG TCGTTTCGCC GGAAGATAGC GACTCGAACG TCAGTGATGT TGAAGTTTCA GACGTTTCGT CGGTAGAAGT GTCATCCTCC GACGGTAGTG ACTCAGACTC ATCGAGCTCC GACGAAGAAT CCGAAGACGA AAACGATATA CAAGAACGTA TAAAACTTAA GCGTCGGGAT GCCGCGAAAA AGGCTCAAGA AGCCGCAAAG GCTGCGCACG AATGGAGACC ATCCGCTGAA AAGAAAAAGG TAGAGATTAA GGCTGCCGCC GGGACGGATG GCGCGCAAGC ATTATCGAAA GGAAAGCCCT TTCAGCGTGT TGACTCCGAG TTTTGGGGGC GTGTTGCAGT CAAAGACGGC GGTGCTATGG CTGATAATTC GTACGAAGGT TTGTTTGGTG ACAACGGCTA TGGTGCACAG TCTAGTGCAA AGCTATTGAC TGTAAGAGGG AAAAATTTTA CCAAGGAAAA GAACAAAAGA AAGAGAAGCT TCAATGGGCT GTCGCGCACA GGTGGACAAA TTGACACGGA GCGGAGCTAT TCAACCAAGT ACCAATATTC TGATGACGAA TAG
|
Protein sequence | MAKRRPSAGA VAIEQHPIFP CDGQMQKTKA NDEVNMAMAV LAFLQSQKYA KTAASFRKEL SAKGIDVGGK VVLRKLTWEA VESHEASGSE DSDEAKPEEK IEMHRATQRA GDRNSSESSD SSSSSESEEV EASTKTPASK MQASSKKSIG TSSSSSSDSE SSTDDSKSTV KKPDSLAKSH PEKSSDNCSS ESDSSSSSDS ESSDDDDAPR ISKKIAVTTK KAKGKAGSSP QEMADSSTKS KRRTKTEIAI SKSSDVDPSS SSEDEAPPPT KKVRLEAKAG VVSPEDSDSN VSDVEVSDVS SVEVSSSDGS DSDSSSSDEE SEDENDIQER IKLKRRDAAK KAQEAAKAAH EWRPSAEKKK VEIKAAAGTD GAQALSKGKP FQRVDSEFWG RVAVKDGGAM ADNSYEGLFG DNGYGAQSSA KLLTVRGKNF TKEKNKRKRS FNGLSRTGGQ IDTERSYSTK YQYSDDE
|
| |