Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37636 |
Symbol | |
ID | 7202455 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 589214 |
End bp | 591338 |
Gene Length | 2125 bp |
Protein Length | 657 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181590 |
Protein GI | 219122518 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCCT TCACACCTTC TTTTGTGAGC TGTCGTCCTG TTTCACCCTA CGGCTTCCAG CCCATCTCCG GAGACCCGCA GCAAGACCAG CGCGTAATGA CCCCGCTTCA ATTCTCGCCA ACACCGATGT CGCCGTACGG AAAAATATCT CCCAGTCCTT CCGTAATGAC GGAGGAAACA TCTTCCCTCA CATTTGAAAG CAACAGCCGC TACAGCCCTG CAACTTTTCG CTCTAATACA CCATCGTCGT TTCGATCAGT TACACCCCAG TCGGACCAGT TCATAGACAG TAGAGGCCAA CGTGTCACCA CACCACGCCC CAAAACTCCA AACGGCCGCA AGTCACCCTT TGCCAAAAAA GAAGAAGACG ATGCGGTCCG AAAAACTCGT ATTAAAACTG AGCTTTGCAT ACATTACGCG AATGGTAGAC CGTGTCCTTT CGGAGCAAGT TGCACCTATG CACACGGTGA AGAGGAGCTT CAGTTGACTA AACTTCTGGA TTTACACGAA GCTGGTCTGA TTGATGTCGG GATCTTTCGT ACAAAACCAT GCTTGACTTG GGTTGCAACA GGATCGTGGT ACGTACAAGG GCCTGCAAAT TGATAGAATT CTACTTGACT GAAAAGAGCA AAAGACTGAC AATGATTTTT CTGGCTATCC TATTCGTAGC CCGTTTGGAA AACGATGCAC CGCCATACAC GATCCTCGAG TGGGCGGATC CCATTCATCT TGGTTGCCTC ATACCGAGAC ACAAGGCAAC ACAATGGCTA CAGACATCAA TGTGGAGGCT CTTCACCAGA AGCGTCAACA TTCGATTTTG TATGGAACCC CGTTTGGGAG TCATTTTTCG CTGGAAAATG ATTCTTGGAG TGACCTGTAC AAGCTCGTAT GTCATATCAA CTATGCCAAA AAGGGATGGA TTGACAAGCG TCGTCGTATG ACTGTTGATC CAGTAACCAA GTTAGAAGTT GCTCTTCTTA TGCGAGGCGA AGCCAACTGG AGTTTCAAGT TTCGACCACA ACACATCATT CACGACGAAC TTTGCATGGT TCTTCAGGAA CGTGCCTTTC GAATAGACAG TCAACTACTA CCTGTGGAAA TTCCGCAGCA CTCATATACC GCTAGCAATC AAAGCCACAT ATTTGTACGA GAGATCGCCT TCGGACCCGA TGAAGATCCG ACCGTACGAA CGGTTGGTCT TTGGTTCAAT ATTGATGAGC GAGATGTCCT AGTGTGTACG TCTCAGCAAG CTAAGCGATT CCGTTGGAAG CGGGGCGTCA ATATAAAGGA CGACACTCAA CAAACAGGGA AATCTTCCGC ATTCGAGACC CTGGATCACT TCCCTATGAT TCGCCCCCAT GACAGGGAAA CATTCGGCTT TACCACAAGT CTCTTAAAAC ACCGCCTTCG AGTCGTGCGC GCAGAACGTA TATGCAGTAT GAGAGGACGA TTTGACGCCT TGCAGAAACT TGAGGGTGAC AAGCAAGTTC TTTATAAGCG ATTCTTGAAT TTGGCACATT GCTGGAAGGT TTGGCTGTGG CCAATCAATG ATGGAAGGGC CAGCGTTGAC AAGCACACGC CAGTACCACC AGTAGATGGA AAGTACGAAT TCGGTAGGAC TGCATCGAAC CTTACTAGAC TGAACTCAAA GCTTGAAGAA ACCAGTGCTC CAATATGGCA TACTGTGAAC GAAATTTGGG AGTCCTTCGT ATCTGCAGAC TTTGAAAACC TTCACGTGAG TTTTTACATT TACAAAACTG GATTGTAAAA TGTGACCTCT TTACTAACAT TTCAGGGGAA ATCATCCCTT TTCATCATAC AGGTTGAAGA ACGTGTATTG CTCAACGTTC GACTTACATC AAGCAAGCGT CTACGACCGT TTCTGCAGCT TGCGCAAGGC AAACCCCTGT CCCTGGACAG GCGCTCGCCG CATATCCTGA AACACGATAG GACCACCGAA GAAAACCATC CTTCGCATTC TCAAGATCAG GATCGTTGCT GGAAGTCGCT GTTGCTGACT TCTGGAAAAT CCATAGAAAA CAGTGAATGG GAACTGGTTG AACAGCATTT CAAGAACTCT CGAAGCAATA AGGTTCTAAA CATCATCCAA GATAAAACTG CATGA
|
Protein sequence | MSPFTPSFVS CRPVSPYGFQ PISGDPQQDQ RVMTPLQFSP TPMSPYGKIS PSPSVMTEET SSLTFESNSR YSPATFRSNT PSSFRSVTPQ SDQFIDSRGQ RVTTPRPKTP NGRKSPFAKK EEDDAVRKTR IKTELCIHYA NGRPCPFGAS CTYAHGEEEL QLTKLLDLHE AGLIDVGIFR TKPCLTWVAT GSCPFGKRCT AIHDPRVGGS HSSWLPHTET QGNTMATDIN VEALHQKRQH SILYGTPFGS HFSLENDSWS DLYKLVCHIN YAKKGWIDKR RRMTVDPVTK LEVALLMRGE ANWSFKFRPQ HIIHDELCMV LQERAFRIDS QLLPVEIPQH SYTASNQSHI FVREIAFGPD EDPTVRTVGL WFNIDERDVL VCTSQQAKRF RWKRGVNIKD DTQQTGKSSA FETLDHFPMI RPHDRETFGF TTSLLKHRLR VVRAERICSM RGRFDALQKL EGDKQVLYKR FLNLAHCWKV WLWPINDGRA SVDKHTPVPP VDGKYEFGRT ASNLTRLNSK LEETSAPIWH TVNEIWESFV SADFENLHGK SSLFIIQVEE RVLLNVRLTS SKRLRPFLQL AQGKPLSLDR RSPHILKHDR TTEENHPSHS QDQDRCWKSL LLTSGKSIEN SEWELVEQHF KNSRSNKVLN IIQDKTA
|
| |