Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38931 |
Symbol | |
ID | 7203696 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 579206 |
End bp | 582580 |
Gene Length | 3375 bp |
Protein Length | 1031 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182864 |
Protein GI | 219125179 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAGTT GGGAAATCGA GTCCGTGCAG GATTGTCTGT GCCGGACAAA TCGTTCCTTC GGTGGTCGTA GGGACCATAA AGAGCGACCT TCTTTTCTCA CGATTGGCTC ATACTGCTTT AGCCCAATAT TCGTCGAACC TTTCCGGGGG ACGACAAAGA CAGTAGCGAA GAAGACGACT CGTCTCCACA AGTGTTCCAC CGTAATGGTG GGGGCAAGCA GTTAGTCCGA CCCGGCGGCA AGTATCTCCG ACCGCGGCAC GACGACGAAG ACGATGATGA AGAATCTTCC TCAGAGGAGG AAGAAGACCC CCCGCAACAG CAGATTTTTA GTCGAGCCGG TGGTGGCAAG CAGCTGCGCC CGTCGGGAGG AGGCGGAAAA TATTTGCGTC CATCGTAAGT AAGCCCCCGG TCCCGCGTTT TGATCAAATA ACTGTATCTA CCTAGTTTTC TGTCTGTCTA ATTCTGTGAT TGTTTACAAC AGTGGAGGTC AGGAAGACGA CGACGATGAG AGTGAAGATG ACGATGAAGA GCTGGACGAC GACGAAGACG ACGAGGAAGA CCAGACTGCT CCACAATCCC AGACCGTACG CGTAATGCCG CGTGGAGGGG GCAAAAACTT TATTCGAATG CACGCCCAGC AACAGCAAGC GGAAGACAGC GAAAGCGAGA GCGAAGACGA CGAGGATGTG GTGGTGGTGG AAGAAGAAAC GCCAGCTTCT GCGCCGGCCC CGGTTGTAGT GCCGGCGGTT CCGCAAGCCC GAGGCAAAAT GCTGTCCAGC ATGAGACCTC CGCCACAAGA AGACGACGAT GACTCGGAAG AAGACGATGA CGAAGTAGAA GTTCTGGAAA ATGTCCAACC AATTAAGCCG GTGCAGCGTG GCGGAGGGAA AGCAATTTAT GGTGGAGCGG GTGGGGGAAA GCAGCTTCGC CAACCAGAAC CCGAACACGA GGAAGAAAAG CCCAAGCCAA CAACCATTTC GCGATCTCCC GCCGGTGGCA AGCAATTACG TCGCCCTGAA GCTGAGAAAG AAGAGGAAAC GCCGAAGCCA ACAAATATTT CTCGATCCCC GGCAGGTGGA AAACAACTAC GTCGGGCGGA ATCAGAAGAG GATATTCCAA AGGTTGCTGC AATCATACGT TCACCCGAAG GGGGAAAGCA ACTTCGAAGG CCTGAACCCG AACAGGAGAA AAAGGAGCCG TCTGAGGCAT CGAAAATTTC CCGCTCGCCG GCTGGGGGTA AACAGCTCCG ACGTCCTATT CCGAAGGAAG ACACTGAAGA CGACGACGAT GAAGATGATG AAATCTTTGG CGACGATGAC GATGATGAGG AGTCAGACTA CGAAGAAGCA GTCGCGGCGA AAAGATCGAC TGCAAACGGT CGACCCAAAC GCAGAACTGC GGCAAAGAAG GTCATTGTTG AGGAAGACGA GGAAATCTTT GGTGACGACG ATGTCAGTAG CGACGAAGAA CGGGAACTCG ACATTAATAA TACCGAAGCT TTGATCCGCG ACGAAGACGA TCGCAAATAT CTAGATACCC TGCCAGAGCT CGAACGTGAA GCAATATTGG GCGAGCGTTT TGAAAAACTC AAGAACGAAC AGGATTTGAA AAAAGCCATT AAAGAGGCCA AGTAAGTAAA TTGTGGTGCT TTCAACGGCA CGGTGCATTG ACCTCCGCTC ATTTTTGGGT TCTTTATTAA CAGACGCCAG GCAGACGAGA AATCGGGAAA TGTTCAATCT ACAGCACAGC GTAAGGCAAC CCCGGGCAAG AAATCTGCCA AAAAAGGTAC GGCAGATGAC GACCAAGCCC TCGCGAGAAA ACTCGCTGGA GCCAGTCGGC GAGAATCTAC TCGTGACAAG GATGCGAAAG GAGCGAAGAG CAAAAAGGCC GCGGCTCTAG CGGCCTTGAA GAAAGAGCGC AAAATACAGA AGCAGCAAGA TTCCGATGAC AGTGAAATGG ACTTCGGCGA CGATTCCGAT GATGATTCCG ATGAAGATTA TGATGATGGG GGCTTTATGC CGTGGCAAAA GAAGGCCAAA ACACCGAAAT CGACAGTTTC ACGTCTCGAC AAAGATGACG AGAAAATGGA TTCCGAAGAT GACCGCGACG GCGCCGACGT ATCCAGGAGT AAGACGACTT CGGATCGGAG TGGAGGTTCG GCGGTTGAAG CCACTTTGGA GGACTTCAAA AAAGTAACAG TTCCTCGCCG GCGCCTTGCA CGGTGGTGCA ATGAACCTTT CTTCGAAGCA GCTATTTTGA ATTGCTTCGT ACGGGTGCTT ATCGGAGAAG ACGAGAATGG CGACAAGGTC TATCGCTTGT GTGAAATCAC GGATGTCAAA ACAGGGATGA AAGTGTATAA GTTTCCGATC GCTAAGAAAG GTGACAAGCC CATCATGACT ACGAAGACTC TGACTCTGAA ATTTGGGAAA AACGAGAAAG AGTTTCCTAT GTCGTTGGTC TCTGATGCGC CGCCGGACGA AGTGGACATG AAGAAGTACG TGACTGTGAT GAGAAACAAT CGCCAAGAGC CTTTAACGAA GCGACAAGGA AACAAACTTC ATCGTCTCCA GCACGACTTG GTTCACAACT ACGTATATAC TACTGAAGAC ATTGAGCGCA ATCTTCAACA ACGAAAGAAA CAAGGAAAAA AGCTGGGAAA TTTTGGAGCG GAGCTGACCA AGGCTGCGAT CGCAGTTCAA GCTGGCAAAG ATTTTGTCAA TGAAGCAGAG AAAAAATTGA ACGATGCCAA GCGAAGCTTG ATGGAATCTG ACAGTAATGA TGCGTCTTTT GAGAAGAGCG TGAAGGATGC TGAACAGACT CTTGAGCGTG CAAAGGCGAA CCTGGAGGAG ATTATACAAG ATGAAAGAAA GATGCTTGAT GTCGTGGATA ATCGTAAGCG ATTGCTAAAC CAACGAGCGA AAGATCGAAA TTGGGCCAAA GTTAACCTGC GTGCCGTCCA AGCAAATCAA AAAGCAGACC GGGAGGCTAA CAAGCCACTT GACAGTGCAC TATCGGGTTC CGCCAAAAAA GATACGTTCA ACCCGTATGC TCGCCGTCGA GTGAAACCGA AGATTCTCTG GGAGGTAGGG CAAGATGACG ACACGGAAGA AGCGAAAGTA GGGGAAGTCG GGGGAAGCGA AGACGCTCCG AAAGAATACT CGAACATTCC GCCACCTAAT CTCGTTCAAG AAACCGACGA CAACACCGCT GCCCTTAGCG AGAGCCATCA GTTTGCTATA GATGAAGAAG GGCTTGCTCA GGCATCGGCT ACATCTATTC TGTTCGGATC AAATAGTTCA ATGAAGCGAA AACGCAACCG AAGAGGTCTA AGCCTTTCGG ATTACATGGA GCAAAAAGCA AGCGGGCGTT TATAG
|
Protein sequence | MPNIRRTFPG DDKDSSEEDD SSPQVFHRNG GGKQLVRPGG KYLRPRHDDE DDDEESSSEE EEDPPQQQIF SRAGGGKQLR PSGGGGKYLR PSGGQEDDDD ESEDDDEELD DDEDDEEDQT APQSQTVRVM PRGGGKNFIR MHAQQQQAED SESESEDDED VVVVEEETPA SAPAPVVVPA VPQARGKMLS SMRPPPQEDD DDSEEDDDEV EVLENVQPIK PVQRGGGKAI YGGAGGGKQL RQPEPEHEEE KPKPTTISRS PAGGKQLRRP EAEKEEETPK PTNISRSPAG GKQLRRAESE EDIPKVAAII RSPEGGKQLR RPEPEQEKKE PSEASKISRS PAGGKQLRRP IPKEDTEDDD DEDDEIFGDD DDDEESDYEE AVAAKRSTAN GRPKRRTAAK KVIVEEDEEI FGDDDVSSDE ERELDINNTE ALIRDEDDRK YLDTLPELER EAILGERFEK LKNEQDLKKA IKEAKRQADE KSGNVQSTAQ RKATPGKKSA KKGTADDDQA LARKLAGASR RESTRDKDAK GAKSKKAAAL AALKKERKIQ KQQDSDDSEM DFGDDSDDDS DEDYDDGGFM PWQKKAKTPK STVSRLDKDD EKMDSEDDRD GADVSRSKTT SDRSGGSAVE ATLEDFKKVT VPRRRLARWC NEPFFEAAIL NCFVRVLIGE DENGDKVYRL CEITDVKTGM KVYKFPIAKK GDKPIMTTKT LTLKFGKNEK EFPMSLVSDA PPDEVDMKKY VTVMRNNRQE PLTKRQGNKL HRLQHDLVHN YVYTTEDIER NLQQRKKQGK KLGNFGAELT KAAIAVQAGK DFVNEAEKKL NDAKRSLMES DSNDASFEKS VKDAEQTLER AKANLEEIIQ DERKMLDVVD NRKRLLNQRA KDRNWAKVNL RAVQANQKAD REANKPLDSA LSGSAKKDTF NPYARRRVKP KILWEVGQDD DTEEAKVGEV GGSEDAPKEY SNIPPPNLVQ ETDDNTAALS ESHQFAIDEE GLAQASATSI LFGSNSSMKR KRNRRGLSLS DYMEQKASGR L
|
| |