Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42841 |
Symbol | |
ID | 7196438 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1286304 |
End bp | 1289597 |
Gene Length | 3294 bp |
Protein Length | 915 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177259 |
Protein GI | 219111015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGAT TCAGTTTAAC AAATGTAAAC TCCGATGAAG AAGAAAGTAA GTCTAGGGAT TTCGAGTCTA CGGTACTAGC TTTAAATGGT GTGGACGTCG AACTCGAGTA CAGCAGCGAC GACGACGATG AAGAAGACGA GCCTCCGCAT CCACTGGGGT TTCGGTCGAG CACTCGGATT TCTCTCCATA GTCCGACAGA AACAATGGAG CAATACAAAC GTAGGTGGGG TTACTTCTAC CCGATAGACG ATGCCCATGA CGAGAACTTC TTTGACACCC CAATAGATTC CAAAATCCAA TCCTTTCCAC CGTTATCGAC CTATATGAAC CCCAACACAG TATCCACGAG AGCCCTTATT GGCTACGCGA ATGAAAGGAA ACCAACAGAT GACTGTGTGG ATCAAGTAAT TCAGCTGCTG CAGGCCGCCG CTTTGGTTGA CGAACGAGAC TTGACGACAC CGCTTTTGCT GGCTACCAAT ACACATCTCG ATGCCACCAA TCGGTTAACG CAACAGAAAT TGATCGAAAT TCAGCATGAA ATTGACGTAG AACGCAGGCG GATGGAGCGT GACCATTTGG AAGCAGCCGA AGCCCTTCAA TTGATTCTGC ATCGAAACCA AGAAACGGCT GAATTGATAA GTGCGGAACA ACGTCGGCTC GACAGCGTCG CACAAGAAGG AGAGGATATC CGAGCTCGAG CGGACAAAGA AAAACAAACA GCGTTGAAAG ATGAAGAGCG ACAGAAAGAA AAGGACGCTC AGGAAAATGC CAAGCGAGAG AATCTAGATT CTCAGAGGGC TGCGGAAAAA GAAGAGGCTG TAAGATCCTC GAAGTATGAA TTTATCGCCA AAGCCAAGAA GCTTGTTGCG CAGCTTGTGC TAATTCGGGC TTCAGTAGAA TCATTCGAAA AATCAAAAGC TGTTGGAAAG CGTCGATTAC AAATGAAGAA AATAGTCAAC GGCAAGGTCA ACACTCTTTC TGAAAACACG CAAAAGATAA GAGAGGTCGC AAATAATGTA TCACAAGCTA TTGAAAAAGC GCGCGACGAA GACAAACAAG CCAAGGAGCA GGGCGAGGAG GGGAACAAAG GCTTCCTCCC AGAAATGGCT AGAGGAAAAC GATACTTTGT TGATTTGCTC TCAAGCAAAG TCATTGTCCG GGTCCAAGCC GAAGGCTTCA ACGGGTGAGT TTAGCATCTA TCAATGGACA CTTATTTGTG TGCATGCATA CACTCACATA TTTTTACTAG TCAACGCGGT GATGGATTCC CTCTTGCAAA TATGTTGGCT CAGGTTTCCA CTGATCACAA AGAACTCGGT CCAAATCTTG CTGCCCATAT ATACACCGTC TGCCCCACAG CTATACCGAG CTTGCCGGAT CCTGCACCAG ATGCAAGCGA GGATGATCTG ATGCGAAGCC TTGGCATGTT GCAACACGCG GATGGCAACT TTGAAAGCTT TGAGCGCTTT TTGGGCAGAA CTGAGGTAGG TGCTGCTGCG TCAACAACAA TTTCTTCAAT TTTTGTCGAA GAAGTGATCG AGATGCTGTA TGCACATGCC TAACATATTT CTCATAACAG GGCATAATTT CAATGGTGGC AAACATCATG TCTTCAAGTC CTGCAAATCA TACGCTGCTT GGCGGCCATG AAGGGGCAGT CAAGTGGATG ACGCGATTTC TTTCCTTGCT ACCAAGCAGT ACCGACACAG CCCTCCCATT GATCGTTGCA CCTGTTCTGG ATGCATTTCT TACAGGTGCA GGTCACATGC TCGCAAATAT CCACGCTGAA GAATTCAAGC TCCTATTGAA AGCTATCGAC GAGAACGTCT TGCCTAGATT GGATGACGGG CCCACTGGGA AGCCTTCCGC TATGCGGTTA GAGAAGACTA TGAGTGGAGG ATACGAAAAG TTTCAAAGTA CACTTCCATC TCGTGCTTTG GCGGAGTTTT ACAATGGATC AAGCTTCTTG CGAAGTCACG GTAGCACATC AACACCCTCA CCTTTCGGTC ATTCGGTGTT TGCCGGAAAT GCGGGGACTA CGGGCGTCCC CCCTTTCGGC CAAAGCTCAA CGAATGTAAG TTCTGGCCCT AGTTTTGGGG GACCTAGCAT TACTGCGAAG AAACAATCAC CTTTTGGTCA GAGCTTGGTA GAAGCGGGAA CAAGCTCGGA TGCATTTGGA AAGCCACCTT TGAATGCATC CCCTTTTACT GGTAGTGGGA CAACATCTCA TACTACATCA GGTATCTCAA AAATGGACCA GTCTGATTCA TTCCAGCCAA AACAGACGCA AGCATTGCCG TTTGGCGTAG CTTCAAACAC TTCAACTTTT GGACAGGTTG CTTCTTCGTC GCCAGCTCTA CTTCAAAAGT CATCCCCCTT TGGAAACACT TTTCCAAGTC CGTCCCCGTT TGGAACTGCT GCCTCCGTCC CCTTTGGAAA GCCGAACGCA GTCCCTTTAT CCGTGAGTGC ACCCAATCCA AGTACTTCTC CGTTTGGAAA TTCTTTACAA ACCGTTGTGC CCTTTGGTGG CCAAAACATG AGCCTTCCTT CGTCTGGTTC TTCTATTCCA AACACTTTTC CATTCGGAAA TCCCGCACAA GCCGCTTCGC CGTTTGGAAA GCCAACCCCA ATGCTTTCGT CATATGGTAT TTCCCATCCA AATCCTTCGC CATTTAAAAA TCCATCAAGT GGCTCGTCTC CATTTGGCTC GGTTGTTGCT AGCTCATCTC CGTTTGGAGG GACCAGCAAG GTAATTTCCA CATTCAATAG TGGCTCGTCT TCTCCGTTTG GCAGCCAAAC GGGTTCTTCG TCTCCTTTTC CATCAGTTGG AGATTTTCCA CCAAATTCTA ATGGAAGACC CAGCCACACA ACCAAAACGC AGCCTTGTAA ATTTTTTGCC GAAGGTCGCT GCCGCTTCGG AGACAATTGC AGGTTCTCGC ACGAACCTCC TAACAGTACC AGAATTCCAT CATCTAGTCC CTTTATGAAC ACAACATTTC GTCAATGACC AACATCCGCA CACAAATCTG GGTTGATAGC AATGCTTTAT CCTCGGTCTA TCGATAAGAT TTGGATCGCA AGTGCCGTGG GGGAGCCCCA GTTCGCTTTG GGCAGTCTCT GATGAAGGGC TTGGAGACAG TAAGTTTTCA TTAGATGGTA TGAGACATGA GAAGCTGTTC GAATTGTGAA CACGTTTAAA ATCGTCTAGC ACTTCGAATT AAAAGACAAG GCTGTTCTTA CTAATGGTAG CATCTTCGAG CTAAAAGATA CAATTAAGAT TATCGACTAT CTAG
|
Protein sequence | MARFSLTNVN SDEEETLNGV DVELEYSSDD DDEEDEPPHP LGFRSSTRIS LHSPTETMEQ YKRRWGYFYP IDDAHDENFF DTPIDSKIQS FPPLSTYMNP NTVSTRALIG YANERKPTDD CVDQVIQLLQ AAALVDERDL TTPLLLATNT HLDATNRLTQ QKLIEIQHEI DVERRRMERD HLEAAEALQL ILHRNQETAE LISAEQRRLD SVAQEGEDIR ARADKEKQTA LKDEERQKEK DAQENAKREN LDSQRAAEKE EAVRSSKYEF IAKAKKLVAQ LVLIRASVES FEKSKAVGKR RLQMKKIVNG KVNTLSENTQ KIREVANNVS QAIEKARDED KQAKEQGEEG NKGFLPEMAR GKRYFVDLLS SKVIVRVQAE GFNGQRGDGF PLANMLAQVS TDHKELGPNL AAHIYTVCPT AIPSLPDPAP DASEDDLMRS LGMLQHADGN FESFERFLGR TEGIISMVAN IMSSSPANHT LLGGHEGAVK WMTRFLSLLP SSTDTALPLI VAPVLDAFLT GAGHMLANIH AEEFKLLLKA IDENVLPRLD DGPTGKPSAM RLEKTMSGGY EKFQSTLPSR ALAEFYNGSS FLRSHGSTST PSPFGHSVFA GNAGTTGVPP FGQSSTNSLV EAGTSSDAFG KPPLNASPFT GSGTTSHTTS GISKMDQSDS FQPKQTQALP FGVASNTSTF GQVASSSPAL LQKSSPFGNT FPSPSPFGTA ASVPFGKPNA VPLSVSAPNP STSPFGNSLQ TVVPFGGQNM SLPSSGSSIP NTFPFGNPAQ AASPFGKPTP MLSSYGISHP NPSPFKNPSS GSSPFGSVVA SSSPFGGTSK VISTFNSGSS SPFGSQTGSS SPFPSVGDFP PNSNGRPSHT TKTQPCKFFA EGRCRFGDNC RFSHEPPNST RIPSSSPFMN TTFRQ
|
| |