Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37404 |
Symbol | |
ID | 7202341 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 50225 |
End bp | 53609 |
Gene Length | 3385 bp |
Protein Length | 1121 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181651 |
Protein GI | 219122643 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.249909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACCGG CCACCAGGCA AATGACTAGC GGAGCTGCTT ACTCGCATTT TTTGGATAAT GTATTTTCAC TTCCTCAAGG GCACCCAATC CGACTTAGTT TCGAACAACA AGGGTATAAT TCTGTTGATG ATCTCCTCAG TATTTTTGAG AACGAACTAG ATGCCCTTGG ATATGTGCCT TCAGCGAGTC CTGACACCCA TGAAGACCCT CAGTGGACCC CATTGCTCAT GGCGCACCGA CAGATCCTTC GTCATTTCCT GCGTTGGCAG GCATCACTTG AACGGCAAAA GGGAAGTCCT TTGGAAAATT CGGAGCTTGT TGCATTGACT AGTGGAGATT TCATTTTATA TCGACGCTCA GCACTCGGAC AAGTCTCTAA TGTTCCGGCC ACCATCAGTC CTTCTCTGAA CAACCAGTTA AGTACGTCCA CGAAAGCTCG ATCGGCAGTC GACGAATTCA AGCGAGGAGT CAAGCGTGAC AAGACCCACT ATCCTATCCT TAAGGATGAC CGATACTGGG ACAATTTCTA CCGGTCTTTC GTGGTCACCG CGGTATCCCA TAACGTGGAA AAGGTACTTG ACCCATCCTA TGCACCGACG GACCCCTCAG AGAAGTCTCT TTTTGAGGAA CAGAAGAAAT TTGTGTACTC TGCTTTGGAA CATACACTTC AGACAGATAT GGGGAAAAAC CTTGTTCGCG AACATAGTTT TGACTTCAAT GCCCAAGAAG TTTTCCGTAA GGTTGTCAAG CACTACACAG AGTCTGCCAG TGCCAAGATT GGGTCCTCCA ACACTTTGGC CTACCTCACT ACGGCAAAAT ATGGCACATC CTGGACAGGA ACGGCGGAAG GGTTCATCCT TCACTGGAAG AACCATCTTC GCATCTACAA TGATATGGTT CCTATGGCAG AGCAGTTGCC TAAACAGCTT TGCCTCAGTA TGCTTGAAAA CGCTGTACAC GACATCCCTG AACTCCGCCA GGTCAAGATC ACCGCTACTT TAGACTTAGC TAAAGGAGGC ACTCCCCTCA ACTACGAAGG TTACCTGAGT CTATTGCTTG CATCTGCTTC TCTATACGAT AAAGGGAACA ACCTTTCCAA TTCTCGTAGT GTCAAGAGCA AGCGTAGCGC CTTTCTGACC GACCTCTCGT ATGATCAACC GGACTTCACC GAAGACAATG GGATTGACTA TGATATTGAT CTCTCTCCTG CAGTGATCTA TGAGGCCAAT GCTCACAACC GCAAAGTCAG TCCATCTGGC CACCGTAATC GCGATCCGGC AACCAATCGA GAGCGTCCGT ATATCCCTCG CGAAATGTGG AATCAGCTTT CAGATGATGC CAAAGCCATT CTCCAAGGCC TGTCCGCACC CGACAAAGGC CCTACTCGAT CCGGCGATGT CTCGCAACGT GCGTTGGAAG CGAATACCCA CGCCAAGATA TCGAACGATA ATGGCGAGTT CAATCGTAGC GAACCAGACA ACCAGCAAGC TGAAGCATTC CATGACTGTG ATCAAACGAC GGAGCTCCTT GCACACTTGA CTGACCGTGT GAGTCACATG GGAGATGGCG ATATCCGAAA AGTTCTTGCT GCATCCCGCC GTACACCAAT CAATTGTACC CAGTCATCGG ACAATCGACA ACAGTCTGTT CAACTCAACG TTCTGGAATA TCAAGTCTCT CGTCATTCCG TTGAGAACAA GACTGCTGCT CTAGTCGATC GAGGTGCCAA CGGTGGACTT GCTGGCTGTG ATGTCAAAGT TGTGAACAAG ACAGGACGGT CTGCTAGTAT AACGGGTATC AACGAGCATA CCCTGTCAGA TTTGGATATT GTCACTGCCG CTGGGTTTGT CGAGTCTCAC AAAGGCCCTA TCATTGTGAT TATGCACCAA TACGCCTATC TTGGCAAGGG AAAGACCATC CACTCCAGTG CCCAACTTGA GCATTACCGA AACACAGTCG AAGACCGGTC TCGCAATGTT GGAGGACAAC AGCGGATTGT TACCTTGGAT GATTATATCA TTCCTCTTCA TGTTCGACAA GGCCTCCCGT ATATGGATAT GCGGCAGCCT ACCGATAGCG AGTTCGAATC TCTTCCGCAT GTTGTGTTGA CTTCCGATAT TGACTGGGAC CCTTCTATTC TAGACAATGA AGTTGACATG GTGAACGACT GGTACGATGC AATGCAAGAT CTTCCGGGCA ATGCCTATGT TGAACCACGA TTTGACAACA CAGGCCAATA CCTCCACCGC CATATAGCGT ACTACGATCT CGATCGCGAG GACGCTATTG ATTGCATTAT CCAGTGTCGT AAGCACAATG TCAAACGCAA TGAACGGGAT TATGAAGCAT TACGTCCCTG CTTGGGATGG GTATCCGGTG ACACTGTCCG AAAAACCATC ATGGCTACGA CACAGTACGC TCGCGAAGTC TACAATGCAC CGCTACGAAA GCACTTCAAA TCGCGATTCC CGGCTCTAAA TGTGCATCGG CGCAACGAGG CTGTTGCAAC GGATACTATC TGGTCAGACA CACCTGCTGT TGACAACGGA GCCAAGTTTG CACAACTGTT TGTGGGGAGA CGTTCCTTAG TCACCGATAT TTATCCCATG AAAACAGACA AGGAGTTTGT CAATGCCCTT GAAGACAATA TTCGCCATCG TGGAGCTATG GATAAACTTC TGAGTGATCG AGCCCAAGTC GAAATCAGTA AGAAGGTTGC TGATATTACA CGAGCCTACA ACATTGACCA ATGGCAAAGT GAACCTCATC ATCAACATCA AAATTTTGCC GAACGCCGTA TTGCTACTAT TGAAGCTAAT ACCAATAACG TTCTTAACAA AACCGGTGCT CCTGATTCCA CTTGGCTCTT GTGCATTGCC TACATCTGCT ATGTCTTCAA CCATTTGTCC CATGAATCTT TGCATGATCG TACACCGCTC GAGACTCTTC TTGGTAGCAC CCCTGATATC AGCGTACTTC TCCAGTTTCA TTTTTGGGAA CCGGTGTACT ACCGTATCGA AGATCCATCT TTCCCTTCCG ATGGTACCGA AAAGAGCGGT CACTTTGTTG GCATTGCTGA ATCTGTTGGG GATGCTCTCA CTTACAAAGT CCTCACAGAC GACACCAACA AGATCTTATA CCGCTCTAGT GTGCGTTCCG CTTTGAAATC CGGAGAAACC AACCTACGCC TTACGCCACA GGATGGGGAG AGTAATTCTA AGCCTATCAA CTTTGTCAAG TCGCGTAGAA CTGAAAACAA AAATTCCTAT GCCTTAAAGG ATCTACCCGG TTTCACCCCT GACGACCTTA TTGGACGCAC GTTCCTAACC GATACTCAGG ATGATGGGGA GCGTTTTCGT GCACGTATCA CAAGGAAAAT CTTAG
|
Protein sequence | MVPATRQMTS GAAYSHFLDN VFSLPQGHPI RLSFEQQGYN SVDDLLSIFE NELDALGYVP SASPDTHEDP QWTPLLMAHR QILRHFLRWQ ASLERQKGSP LENSELVALT SGDFILYRRS ALGQVSNVPA TISPSLNNQL STSTKARSAV DEFKRGVKRD KTHYPILKDD RYWDNFYRSF VVTAVSHNVE KVLDPSYAPT DPSEKSLFEE QKKFVYSALE HTLQTDMGKN LVREHSFDFN AQEVFRKVVK HYTESASAKI GSSNTLAYLT TAKYGTSWTG TAEGFILHWK NHLRIYNDMV PMAEQLPKQL CLSMLENAVH DIPELRQVKI TATLDLAKGG TPLNYEGYLS LLLASASLYD KGNNLSNSRS VKSKRSAFLT DLSYDQPDFT EDNGIDYDID LSPAVIYEAN AHNRKVSPSG HRNRDPATNR ERPYIPREMW NQLSDDAKAI LQGLSAPDKG PTRSGDVSQR ALEANTHAKI SNDNGEFNRS EPDNQQAEAF HDCDQTTELL AHLTDRVSHM GDGDIRKVLA ASRRTPINCT QSSDNRQQSV QLNVLEYQVS RHSVENKTAA LVDRGANGGL AGCDVKVVNK TGRSASITGI NEHTLSDLDI VTAAGFVESH KGPIIVIMHQ YAYLGKGKTI HSSAQLEHYR NTVEDRSRNV GGQQRIVTLD DYIIPLHVRQ GLPYMDMRQP TDSEFESLPH VVLTSDIDWD PSILDNEVDM VNDWYDAMQD LPGNAYVEPR FDNTGQYLHR HIAYYDLDRE DAIDCIIQCR KHNVKRNERD YEALRPCLGW VSGDTVRKTI MATTQYAREV YNAPLRKHFK SRFPALNVHR RNEAVATDTI WSDTPAVDNG AKFAQLFVGR RSLVTDIYPM KTDKEFVNAL EDNIRHRGAM DKLLSDRAQV EISKKVADIT RAYNIDQWQS EPHHQHQNFA ERRIATIEAN TNNVLNKTGA PDSTWLLCIA YICYVFNHLS HESLHDRTPL ETLLGSTPDI SVLLQFHFWE PVYYRIEDPS FPSDGTEKSG HFVGIAESVG DALTYKVLTD DTNKILYRSS VRSALKSGET NLRLTPQDGE SNSKPINFVK SRRTENKNSY ALKDLPGFTP DDLIGRTMMG SVFVHVSQGK S
|
| |