Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39605 |
Symbol | |
ID | 7195261 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 245918 |
End bp | 249422 |
Gene Length | 3505 bp |
Protein Length | 1062 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183575 |
Protein GI | 219126671 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.608294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAA CCAATACGGA AGAGGAGCGA CAGCGGCCTC CCACGCATCC CAAAAGCACG ACGGTAGCGC CCGAGTCGCG GGATGCGGGG ACGAATTACA CGAGCGGCGT CCCGAACGCA ACCTCGGCGG CGGACCCCTC TTCCAACGCC TCGACGTACC CTCCACCACC GCCTCCACCG TATCCGTTCG CGGTGGAAGA ACGGGCCGGA CGGGGCGTCC TGTCGTCCGT GGAAAAACGA CGACAGCACG TTCGGACCGC CTCGGGAGAG CAATGGCACG CGGCTGTGCC CGGCGAACGA CCGCCTCTCG GAGGCAACGT ACCCTTTCAA CGAATGTACA ACAGTATGGC GACACCGCCG GCTCCCATGA GTAGGACGTC CAGTACGGCG TCGAGTACGG ACGAAGCCAA ATCGGCTAGG CAAAAGTTCC AGGACATTGC TCGGAAGGTC CGAATGTTGA ATCTCGTCGC TCCGGGCAAC GCCCACACCA ACCACGGATT GACCAGTCCG AACGGCAGTA CGTCGGGGGG ATCACGGGGA CACCGGAAAA CGGCCAGCCG TGCGCACGCG TTACTGGACA GTATCCAAGA AGCAACGGAA GATGACGAGT CGAGGTCCAA CTGCGGGAAC GTCTTCTTTG AAGGCGTCGG AGGGGACGCG CTCACGCAGG GCGAATCTCA CTTCTCGGTG GAAGCCACGG ATGCCGATCG ACTCGTCGCC GGCGCGATAC AGGTGGAAAA ACTCTTCGCC ACGAATGATA CGGGGTCGAC GAGCTCGACG GAAGAACTCC AAGATGCGGA CCACGACGGC GGCGCGTTGC CCGATTACGA AGAACAAGTG CCCTTGCGGG ATCATAATGA AAGATACGGA TCCCTGGATG AGTTTAGCAA CGGCGGTTCG AAGTATCCAC GCCGGGTCGT TAAAAAACGT ACGGACCGGT ATTCTTCAAA ACGCCTGCTT CGTCGCATTG CGAAAATGTG TCACCCATTC ACTTTGCTGC AAGCCTTGTG GCATACTATC CTGCATTCCT ATTTCGTCAC ACTAAGCCTA CCGGCGTTTG TTGCAGCGTG GGTGGTCTAC TATCATCTCG GCAACCCCAA TTTGGAGTTT ATGCCCGGGC ACGCATCGGT GGCGTGGTGG CTAATCTTTG TTGGGCGTCA GTGTATCACG TTGGAATTGG CGCGTCTTAC CCAATGGTTA CTCGTGGACA AAATCATCCT CGGTACCCGG GTGGCCGTCA AGTGTTTGGG ACCGTTGCCG ACATTGTACG CCATTCAGGC CAAGGGTTGG CCAATTATGT TGGCCTTATG GGCCATTGCC GACCTTTTCT TGCTGCACGG AGACAATCGC TTCCAACAGC ACTGGTTCTA CTTTACCGGT ATTTCCATTT TCAAAGACGC CAATTCGGGG GTGTACATTT TAACTTCCCC AACCTACTTG CGCATGTTGC TTAGTATGCT GGTGGCAGGT CTCGCGACCG CGGGCAAGCG GACTGCCGTG GCCATGTACT TTGGACGTCG TACGTTTTCC GAATTCAAAC CTCGACTGGA AAAGATTTTG CGAGAAGTAG TGTTGCTTTC CGAAACTGCC GAGCTTGCTC AAGAAGCGGA ACGGGTATCC TACGGGGTTG GACAGACCGG GGAAACTTTG GAAATCGATT TGAATCGTCA AGATAGTCGA GTGCAATTCA TGGACGACGT TTCGTGGACC ACTGATCGAA ATTTGGTGAA GAATTCAGGA CGGGGAAACG CTATGGATGA ATCGAGCGAC GATGAGAGCG AAGATAGACA CAGTCCAGCA AATTTGAAAA GAAGAAGGAG CGAATCACTG AACGACGCAA TGATGGAAAA GACCGAGAGT GGCAGCTTTC GAGTGAGCGA CTTGCTCGAA AATTGGGAAG AGCCAGTGAA CAAACTCGAC AAGGTACGGT TATTTTTTGT TTGAGTATCA TTCTTTGTTT TAATCCTTAT ATACTTACAT CTTTCGAATT CGTATATAGT CTTTGAATGC GTCCATTAAC GATATTTTGA AATTTCGACG TGCTTTGACA TTTATGGACG AACAACATCC CTTTGGGGAT GCCTTCGGTC CTGCTGCATC ACGAAACGAT GTCATCAGTT CGGCGCAGCA AGTATATCAA CGCCTTTTGA AAATGACGCC CGAGAGTATC ATGCTGAACT GTGACGTCTT CACTATGTTG GCGGACGAAG ACGAGGGGGC TACCACCAAC TTGGCCAAGA GGAAGGCCCT ACGCAAGCTC TTTCGTCCCG ACGCAAACAA CGAGCTTTCT CAGTTGGCCT TTATTCAATC TTGCGATTCT CTGTACAAAA AGCTGCGCTT CTTTCGTGCT TCTGTAGGAA ACGCCTCGGT TATTGACCAT GCCCTCGAAA CTATCATCGA CTTTCTCTTC AACTTCATAC TGGCGCTTGC TTTGCTTTCG CTTATGCGCT TTAATCCTTG GCCTCTGTTG GTATCGGTTT CAACATTACT CGTGTCTGTG TCCTTTGCTG TCGGATCTAG TGCCAGCAAA TACATAGAAG TAAGTCACCC GTTTCAATGC CTCCGTCATT GCCCATCTCG TTTCTCAGGT TCATTTTCTC CATAACGTTA CAGGGCATAT TGCTGATTGC GGCAAGAAGG TGAGTTGTTC CCACTATTGT AGTAGCTATT TCTATCGCTT CGAAACGAAC CGTTCCTTAC ATTACATATG TTCATAGACC TTACGATCTT GGTGACCGCA TATACATGCT GGATCCGTCT GTTTTAAACA GCAACGACGG CCTTTTCTGG TCCTGGTTTA TTGAAGGTGC GATCGTGTTG CATTGAAAAC ACCAAATGAT CAGTTTTTTC AGTTCTTCAG CTTTCTTACT CGTCCCACTT CATTTGTACT TTAGATATTA ATCTTTTCCA AACCACGGTA CGCTACGCCG GTACCAACGA AGTGGCGACC ATCAACAACG GTTCAATTGC AAATTTACGT ATCGTAAACG CTAACCGGTC CCCCAACGCT GTTGTTTGGT TCCAGTTGCC TTTTCACATT TCTGTCTTGG AGGAGAAGCG AATGGACCGC ACCCGTGTGG CGCTCGAAAA GTACGCTCAC GCGCGTCCCC GCAGCTGGCA CAGTTTTTCC TATTGTCGTG TTGACGAGGT CCATGTCGAG TTGGAGAAAC TAATGGTCAC CATAGGCTTT CAGCACCGGA CTTCTTGGCA AGACTTGGGT CGAATTTTGA TGGACAAGGC CGATCTGATG TGTTATGTGT ACCAGCTGAC AAAAGATTTG GGCGTTGACT ATGAAGAGCT TCCACAACGT GATCTAGTAT ACTACTCGGG TTTGCTCAAG AGTGGTGGCG TGCGTAACTA CCGCAAGGGT CTCGTCAACC CCTTGAACAT TCAAAACTCT GTTGAAGGAG AAGTACCCAT CAGACAATCA TCATCGGCTG CATCTACTCG TACCAACGAT ACCGATTCGG TGAATCGCGC CTTTCTCGCT AATCTACGTC TTTAG
|
Protein sequence | MKPTNTEEER QRPPTHPKST TVAPESRDAG TNYTSGVPNA TSAADPSSNA STYPPPPPPP YPFAVEERAG RGVLSSVEKR RQHVRTASGE QWHAAVPGER PPLGGNVPFQ RMYNSMATPP APMSRTSSTA SSTDEAKSAR QKFQDIARKV RMLNLVAPGN AHTNHGLTSP NGSTSGGSRG HRKTASRAHA LLDSIQEATE DDESRSNCGN VFFEGVGGDA LTQGESHFSV EATDADRLVA GAIQVEKLFA TNDTGSTSST EELQDADHDG GALPDYEEQV PLRDHNERYG SLDEFSNGGS KYPRRVVKKR TDRYSSKRLL RRIAKMCHPF TLLQALWHTI LHSYFVTLSL PAFVAAWVVY YHLGNPNLEF MPGHASVAWW LIFVGRQCIT LELARLTQWL LVDKIILGTR VAVKCLGPLP TLYAIQAKGW PIMLALWAIA DLFLLHGDNR FQQHWFYFTG ISIFKDANSG VYILTSPTYL RMLLSMLVAG LATAGKRTAV AMYFGRRTFS EFKPRLEKIL REVVLLSETA ELAQEAERVS YGVGQTGETL EIDLNRQDSR VQFMDDVSWT TDRNLVKNSG RGNAMDESSD DESEDRHSPA NLKRRRSESL NDAMMEKTES GSFRVSDLLE NWEEPVNKLD KSLNASINDI LKFRRALTFM DEQHPFGDAF GPAASRNDVI SSAQQVYQRL LKMTPESIML NCDVFTMLAD EDEGATTNLA KRKALRKLFR PDANNELSQL AFIQSCDSLY KKLRFFRASV GNASVIDHAL ETIIDFLFNF ILALALLSLM RFNPWPLLVS VSTLLVSVSF AVGSSASKYI EGILLIAARR PYDLGDRIYM LDPSVLNSND GLFWSWFIED INLFQTTVRY AGTNEVATIN NGSIANLRIV NANRSPNAVV WFQLPFHISV LEEKRMDRTR VALEKYAHAR PRSWHSFSYC RVDEVHVELE KLMVTIGFQH RTSWQDLGRI LMDKADLMCY VYQLTKDLGV DYEELPQRDL VYYSGLLKSG GVRNYRKGLV NPLNIQNSVE GEVPIRQSSS AASTRTNDTD SVNRAFLANL RL
|
| |