Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40611 |
Symbol | |
ID | 7198386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 448979 |
End bp | 452395 |
Gene Length | 3417 bp |
Protein Length | 1138 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184616 |
Protein GI | 219128850 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.101521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACCGG CCACCAGGCA AATGACTAGC GGAGCTGCTT ACTCGCATTT TTTGGATAAT GTATTTTCAC TTCCTCAAGG GCACCCAATC CGACTTAGTT TCGAACAACA AGGGTATAAT TCTGTTGATG ATCTCCTCAG TATTTTTGAG AACAAACTAG ATGCCCTTGG ATATGTGCCT CCAGCGAGTC CTGACACCCA TGAAGACCCT CAGTGGACCC CATTGCTCAT GGCGCACCGA CAGATCCTTC GTCATTTCCT GCGTTGGCAG GCATCACTTG AACGGCAAAA GGGAAGTCCT TTGGAAAATT CGGAGCTTGT TGCATTGACT AGTGGAGATT TCATTTTATA CCGACGCTCA GCACTCGGAC AAGTCTCTAA TGTTCCGGCC ACCATCAGTC CTTCTCTGAA CAACCAGTTA AGTACGTCCA CGAAAGCTCG ATCGGCAGTC GACGAATTCA AGCGAGGAGT CAAGCGTGAC AAGACCCACT ATCCTATCCT TAAGGACGAC CGATACTGGG ACAATTTCTA CCGGTCTTTC GTGGTCACCG CGGTATCCCA TAACGTCGAA AAGGTACTTG ACCCATCCTA TGCACCGACG GACCCCTCAG AGAAGTCTCT CTTTGAGCAA CAGAAGAAAT TTGTGTACTC TGCTTTGGAA CATACACTTC AGACAGATAT GGGGAAAAAC CTTGTTCGCG AACATAGTTT TGACTTCAAT GCTCAAGAAG TTTTCCGTAA GGTTGTCAAG CACTACACAG AGTCTGCCAG TGCCAAGATT GGGTCCTCCA ACACTTTGGC CTACCTCACT ACGGCAAAAT ATGGCACATC CTGGACAGGA ACGGCGGAAG GGTTCATCCT TCACTGGAAG AACCATCTTC GCATCTACAA TGATATGGTT CCTATGGCAG AGCAGTTGCC TAAACAGCTT TGCCTCAGTT TGCTTGAAAA CGCTGTACAC GACATCCCTG AACTCCGCCA GGTCAAGATC ACCGCTACTT TAGACTTAGC TAAAGGAGGC ACTCCCCTCA ACTACGAAGG TTACCTGAGT CTATTGCTTG CATCTGCTTC TCTATACGAT AAAGGGAACA ACCTTTCCAA TTCTCGTAGT GTCAAGAGCA AGCGTAGCGC CTTTCTGACC GACCTCTCGT ATGATCAACC GGACTTCACC GAAGACAATG GAATTGACTA TGATATTGAT CTCTCTCCTG CAGTGATCTA TGAGGCCAAT ACTCACAACC GCAAAGTCAG TCCATCTGGC CACCGTAATC GCGATCCGGC AACCAATCGA GAGCGTCCGT ATATCCCTCG CGAGATGTGG AATCAGCTTT CAGATGATGC CAAAGCCATT CTCCAAGGCC TGTCCGCACC CGACAAAGGC CCTACTCGAT CCGGCGATGT CTCGCAACGT GCGTTGGAAG CGAATACCCA CGCCAAGATA TCGAACGATA ATGGCGAGTT CAACCGTAGC GAACCAGACA ACCAGCAAGC TGAAGCATTC CATGACTGTG ATCAAACGAC GGAGCTCCTT GCACACTTGA CTGACCGTGT GAGTCACATG GGAGACGGCG ATATCCAAAA AGTCCTTGCT GCATCCCGCC GTACACCAAT CAATTGTACC CAGTCATCGG ACAATCGACA ACAGTCTGTT CAACTCAACG TTCTGGAATA TCAAGTCTCT CGTCATTCCG TTGAGAACAA AACTGCTGCT CTAGTCGATC GAGGTGCCAA CGGTGGACTT GCTGGCTGTG ATGTCAAAGT TGTGAACAAG ACAGGACGGT CTGCTAGTAT AACGGGTATC AACGAGCATA CCCTGTCAGA TTTGGATATT GTCACTGCCG CTGGGTTTGT CGAGTCTCAC AAAGGCCCTA TCATTGTGAT TATGCACCAA TACGCCTATC TTGGCAAGGG AAAGACCATC CACTCCAGTG CCCAACTTGA GCATTACCGA AACACAGTCG AAGACCGGTC TCGCAATGTT GGAGGACAAC AGCGGATTGT TACCTTGGAT GATTATATCA TTCCTCTTCA TGTTCGACAA GGCCTCCCGT ATATGGATAT GCGACAGCCT ACCGATAGCG AGTTCGAATC TCTTCCGCAT GTTGTGTTGA CTTCCGATAT TGACTGGGAC CCTTCTATTC TAGACAATGA AGTTGACATG GTGAACGACT GGTACGATGC AATGCAAGAT CTTCCGGGCA ATGCCTATGT TGAACCACGA TTTGACAACA CAGGCCAATA CCTCCACCGC CATATAGCGT ACTATGATCT CGATCGTGAG GACGCTATTG ATTGCATTAT CCAGTGTCGT AAGCACAATG TCAAACGCAA TGAACGGGAT TATGAAGCAT TACGTCCCTG CTTGGGATGG GTATCCGGTG ACACTGTCCG AAAAACCATC ATGGCTACGA CACAGTACGC TCGCGAAGTC TACAATGCAC CGCTACGAAA GCACTTCAAA TCGCGATTCC CGGCTCTAAA TGTGCATCGG CGCAACGAGG CTGTTGCAAC GGATACTATC TGGTCAGACA CACCTGCTGT TGACAACGGA GCCAAGTTTG CACAACTGTT TGTGGGGAGA CGTTCCTTAG TCACCGATAT TTATCCCATG AAAACAGACA AGGAGTTCGT CAATGCCCTT GAAGACAATA TTCGCCATCG TGGAGCTATG GATAAACTTC TGAGTGATCG AGCCCAAGTC GAAATCAGTA AGAAGGTTGC TGATATTACA CGAGCCTACA ACATTGACCA ATGGCAAAGT GAACCTCATC ATCAACATCA AAATTTTGCC GAACGCCGTA TTGCTACTAT TGAAGCTAAT ACCAATAGCG TTCTTAACAA AACCGGTGCT CCTGATTCCA CTTGGCTCTT GTGCATTGCC TACATCTGCT ATGTCTTCAA CCATTTGTCC CATGAATCTT TGCACGATCG TACACCGCTC GAAATCCTTC TTGGTAGCAC CCCTGATATC AGCGTACTTC TCCAGTTTCA TTTTTGGGAA CCGGTGTACT ACCGTATCGA AGATCCATCT TTCCCTTCCG ATGGTACCGA AAAGAGCGGT CGCTTTGTTG GCATTGCTGA ATCTGTTGGG GATGCTCTCA CTTACAAAAT CCTCACAGAC GACACCAACA AGATCTTATA CCGCTCTAGT GTGCGTTCCG CATTGAAATC CGGAGAAATC AACCTACGCC TTACGACACA GGATGGGGAG AGTAATTCTA AGCCTATCAA CTTTGTCAAG TCGCGTAGAA CTGAAAACAA AAATTCCTAT GCCTTAAAGG ATCTACCCGG TTTCACCCCT GAGGACCTTA TTGGACGCAC GTTCCTAACC GATACTCAGG ATGATGGGGA GCGTTTTCGT GCACGTATCA CAAGGAAAAT CTTAGATCCC GACAAGCCCT CTGATGTGCG TTTTTAG
|
Protein sequence | MVPATRQMTS GAAYSHFLDN VFSLPQGHPI RLSFEQQGYN SVDDLLSIFE NKLDALGYVP PASPDTHEDP QWTPLLMAHR QILRHFLRWQ ASLERQKGSP LENSELVALT SGDFILYRRS ALGQVSNVPA TISPSLNNQL STSTKARSAV DEFKRGVKRD KTHYPILKDD RYWDNFYRSF VVTAVSHNVE KVLDPSYAPT DPSEKSLFEQ QKKFVYSALE HTLQTDMGKN LVREHSFDFN AQEVFRKVVK HYTESASAKI GSSNTLAYLT TAKYGTSWTG TAEGFILHWK NHLRIYNDMV PMAEQLPKQL CLSLLENAVH DIPELRQVKI TATLDLAKGG TPLNYEGYLS LLLASASLYD KGNNLSNSRS VKSKRSAFLT DLSYDQPDFT EDNGIDYDID LSPAVIYEAN THNRKVSPSG HRNRDPATNR ERPYIPREMW NQLSDDAKAI LQGLSAPDKG PTRSGDVSQR ALEANTHAKI SNDNGEFNRS EPDNQQAEAF HDCDQTTELL AHLTDRVSHM GDGDIQKVLA ASRRTPINCT QSSDNRQQSV QLNVLEYQVS RHSVENKTAA LVDRGANGGL AGCDVKVVNK TGRSASITGI NEHTLSDLDI VTAAGFVESH KGPIIVIMHQ YAYLGKGKTI HSSAQLEHYR NTVEDRSRNV GGQQRIVTLD DYIIPLHVRQ GLPYMDMRQP TDSEFESLPH VVLTSDIDWD PSILDNEVDM VNDWYDAMQD LPGNAYVEPR FDNTGQYLHR HIAYYDLDRE DAIDCIIQCR KHNVKRNERD YEALRPCLGW VSGDTVRKTI MATTQYAREV YNAPLRKHFK SRFPALNVHR RNEAVATDTI WSDTPAVDNG AKFAQLFVGR RSLVTDIYPM KTDKEFVNAL EDNIRHRGAM DKLLSDRAQV EISKKVADIT RAYNIDQWQS EPHHQHQNFA ERRIATIEAN TNSVLNKTGA PDSTWLLCIA YICYVFNHLS HESLHDRTPL EILLGSTPDI SVLLQFHFWE PVYYRIEDPS FPSDGTEKSG RFVGIAESVG DALTYKILTD DTNKILYRSS VRSALKSGEI NLRLTTQDGE SNSKPINFVK SRRTENKNSY ALKDLPGFTP EDLIGRTFLT DTQDDGERFR ARITRKILDP DKPSDVRF
|
| |