Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39506 |
Symbol | |
ID | 7195182 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 679653 |
End bp | 683037 |
Gene Length | 3385 bp |
Protein Length | 1121 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183530 |
Protein GI | 219126575 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACCGG CCACCAGGCA AATGACTAGC GGAGCTGCTT ACTCGCATTT TTTGGATAAT GTATTTTCAC TTCCTCAAGG GCACCCAATC CGACTTAGTT TCGAACAACA AGGGTATAAT TCTGTTGATG ATCTCCTCAG TATTTTTGAG AACGAACTAG ATGCCCTTGG ATATGTGCCT CCAGCGAGTC CTGACACCAA TGAAGACCCT CAGTGGACCC CATTGCTCAT GGCGCACCGA CAGATCCTTC GACATTTCCT GCGTTGGCAG GCATCACTTG AACGGCAAAA GGGAAGTCCT TTGGAAAATT CGGAGCTTGT TGCATTGACT AGTGGAGATT TCATTTTATA TCGACGCTCG GCACTCGGAC AAGTCTCTAA TGTTCCGGCC ACTATCAGTC CTTCTCTGAA CAACCAGTTA AGTACGTCCA CGAAAGCTCG ATCGGCAGTC GACGAATTCA AGCGAGGAGT CAAGCGTGAC AAGACACACT ATCCTATCCT TAAGGATGAC CGATACTGGG ACAATTTCTA CCGGTCTTTC GTGGTCACCG CGGTATCCCA TAACGTCGAA AAGGTACTTG ACCCATCCTA TGCACCAACG GACCCCTCAG AGAAGTCTCT CTTTGAGGAA CAGAAGAAAT TTGTGTACTC TGCTTTGGAA CATACACTTC AGACAGATAT GGGGAAAAAC CTTGTTCGCG AACATAGTTT TGACTTCAAT GCCCAAGAAG TTTTCCGTAA GGTTGTCAAG CACTACACAG AGTCTGCCAG TGCCAAGATT GGGTCCTCCA ACACTTTGGC CTACCTCACT ACGGCAAAAT ATGGCACATC CTGGACAGGA ACGGCGGAAG GGTTCATCCT TCACTGGAAG AACCATCTTC GTATCTACAA TGATATGGTC CCTATGGCAG AGCAGTTGCC TAAACAGCTT TGCCTCAGTT TGCTTGAAAA CGCTGTACAC GACATCCCTG AACTCCGTCA GGTCAAGATC ACCGCTACTT TAGACTTAGC TAAAGGAGGC ACTCCCCTCA ACTACGAAGG TTACCTGAGT CTATTGCTTG CATCTGCTTC TCTATACGAT AAAGGGAACA ACCTTTCCAA TTCTCGTAAT GTCAAGAGCA AGCGTAGCGC CTTTCTGACC GACCTCTCGT ATGATCAACC TGACTTCACC GAAGACAATG GAATTGACTA TGATATCGAT CTCTCTCCTG CAGTGATCTA TGAGGCCAAT GCTCACAACC GCAAAGTCAG TCCATCTGGC CACCGTAATC GCGATCCGGC AACCAATCGA GAGCGTCCGT ATATCCCTCG CGAGATGTGG AATCAGCTTT CAGATGATGC CAAAGCCATT CTCCAAGGCC TGTCCGCACC CGACAAAGGC CCTACTCGAT CCGGCGATGT CTCGCAACGT GCGTTGGAAG CGAATACCCA CGCCAAGATA TCGAACGATA ATGGCGAGTT CAACCGTAGC GAACCAGACA ACCAGCAAGC TGAAGCATTC CATGACTGTG ATCAAACGAC GGAGCTCCTT GCACACTTGA CTGACCGTGT GAGTCACATG GGAGACGGCG ATATCCGAAA AGTTCTTGCT GCATCCCGCC GTACACCAAT CAATTGTACC CAGTCATCGG ACAATCGACA ACAGTCTGTT CAACTCAACG TTCTGGAATA TCAAGTCTCT CGTCATTCCG TTGAGAACAA AACTGCTGCT CTAGTCGATC GAGGTGCCAA CGGTGGACTT GCTGGCTGTG ATGTCAAAGT TGTGAACAAG ACAGGACGGT CTGCTAGTAT AACGGGTATC AACGAGCATA CCCTGTCAGA TTTGGATATT GTCACTGCCG CTGGGTTTGT TGAGTCTCAC AAAGGCCCTA TCATTGTGAT TATGCACCAA TACGCCTATC TTGGCAAGGG AAAGACCATC CACTCCAGTG CCCAACTTGA GCATTACCGA AACACAGTCG AAGACCGGTC CCGCAATGTT GGAGGACAAC AGCGGATTGT TACCTTGGAT GATTATATCA TTCCTCTTCA TATTCGACAA GGCCTCCCAT ATATGGATAT GCGGCAGCCT ACCGATAGCG AGTTCGAATC TCTTCCGCAT GTTGTGTTGA CTTCCGATAT TGACTGGGAC CCTTCTATTC TAGACAATGA AGTTGACATG GTGAACGACT GGTACGATGC AATGCAAGAT CTTCCGGGCA ATGCCTATGT TGAACCACGA TTTGACAACA CAGGTCAATA CCTCCACCGC CATATTGCGT ACTACGATCT CGATCGCGAG GACGCTATTG ATTGCATCAT CCAGTGCCAT AAGCACAATG TCAAACGCAA TGAACGGGAT TATGAAGCAT TACGTCCCTG CTTGGGATGG GTATCCGGTG ACACTGTCCG AAAAACCATC ATGGCTACGA CACAGTACGC TCGCGAAGTC TACAATGCGC CGCTACGAAA GCACTTCAAA TCGCGATTCC CGGCTCTAAA TGTGCATCGG CGCAACGAGG CTGTTGCAAC GGATACTATC TGGTCAGACA CACCTGCTGT TGACAACGGA GCCAAGTTTG CACAACTGTT TGTGGGGAGA CGTTCCTTAG TCACCGATAT TTATCCCATG AAAACAGACA AGGAGTTTGT CAATGCCCTT GAGGACAATA TTCGCCATCG TGGAGCTATG GATAAACTTC TGAGTGATCG AGCCCAAGTC GAAATCAGTA AGAAGGTTGC TGATATTACA CGAGCCTACA ACATTGACCA ATGGCAAAGT GAACCTCATC ATCAACATCA AAATTTTGCC GAACGCCGTA TTGCTACTAT TGAAGCTAAT ACCAATAACG TTCTTAACAA AACCGGTGCT CCTGATTCCA CTTGGCTCTT GTGCATTGCC TACATCTGCT ATGTCTTCAA CCATTTGTCC CATGAATCTT TGCATGATCG TACACCGCTC GAGACTCTTC TTGGTAGCAC CCCTGATATC AGCGTACTTC TCCAGTTTCA TTTTTGGGAA CCGGTGTACT ACCGGATCGA AGATCCATCT TTCCCTTCCG ATGGTACCGA AAAGAGCGGT CGCTTTGTTG GCATTGCTGA ATCTGTTGGG GATGCTCTCA CTTACAAAAT CCTCACAGAC GACACCAACA AGATCTTATA CCGCTCTAGT GTGCGTTCCG CATTGAAATC CGGAGAAACC AACCTACGCC TTACGCCACA GGATGGGGAG AGTAATTCTA AGCCTATCAA CTTTGTCAAG TCGCGTAGAA CTGAAAACAA AAATTCCTAT GCCTTAAAGG ATCTACCCGG TTTCACCCCT GAGGACCTTA TTGGACGCAC GTTCCTAACC GATACTCAGG ATGATGGGGA GCGTTTTCGT GCACGTATCA CAAGGAAAAT CTTAG
|
Protein sequence | MVPATRQMTS GAAYSHFLDN VFSLPQGHPI RLSFEQQGYN SVDDLLSIFE NELDALGYVP PASPDTNEDP QWTPLLMAHR QILRHFLRWQ ASLERQKGSP LENSELVALT SGDFILYRRS ALGQVSNVPA TISPSLNNQL STSTKARSAV DEFKRGVKRD KTHYPILKDD RYWDNFYRSF VVTAVSHNVE KVLDPSYAPT DPSEKSLFEE QKKFVYSALE HTLQTDMGKN LVREHSFDFN AQEVFRKVVK HYTESASAKI GSSNTLAYLT TAKYGTSWTG TAEGFILHWK NHLRIYNDMV PMAEQLPKQL CLSLLENAVH DIPELRQVKI TATLDLAKGG TPLNYEGYLS LLLASASLYD KGNNLSNSRN VKSKRSAFLT DLSYDQPDFT EDNGIDYDID LSPAVIYEAN AHNRKVSPSG HRNRDPATNR ERPYIPREMW NQLSDDAKAI LQGLSAPDKG PTRSGDVSQR ALEANTHAKI SNDNGEFNRS EPDNQQAEAF HDCDQTTELL AHLTDRVSHM GDGDIRKVLA ASRRTPINCT QSSDNRQQSV QLNVLEYQVS RHSVENKTAA LVDRGANGGL AGCDVKVVNK TGRSASITGI NEHTLSDLDI VTAAGFVESH KGPIIVIMHQ YAYLGKGKTI HSSAQLEHYR NTVEDRSRNV GGQQRIVTLD DYIIPLHIRQ GLPYMDMRQP TDSEFESLPH VVLTSDIDWD PSILDNEVDM VNDWYDAMQD LPGNAYVEPR FDNTGQYLHR HIAYYDLDRE DAIDCIIQCH KHNVKRNERD YEALRPCLGW VSGDTVRKTI MATTQYAREV YNAPLRKHFK SRFPALNVHR RNEAVATDTI WSDTPAVDNG AKFAQLFVGR RSLVTDIYPM KTDKEFVNAL EDNIRHRGAM DKLLSDRAQV EISKKVADIT RAYNIDQWQS EPHHQHQNFA ERRIATIEAN TNNVLNKTGA PDSTWLLCIA YICYVFNHLS HESLHDRTPL ETLLGSTPDI SVLLQFHFWE PVYYRIEDPS FPSDGTEKSG RFVGIAESVG DALTYKILTD DTNKILYRSS VRSALKSGET NLRLTPQDGE SNSKPINFVK SRRTENKNSY ALKDLPGFTP EDLIGRTMMG SVFVHVSQGK S
|
| |