Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36106 |
Symbol | |
ID | 7201173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 549445 |
End bp | 551928 |
Gene Length | 2484 bp |
Protein Length | 827 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180459 |
Protein GI | 219119395 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGAG CCGAAAGCGT TGCTTCAGCA GCTAAACTGA TTGCTTTGGT TGTATTGATT CTTACAACTG CCAATATGCT GGCATTGCGG TCATTTTTCT GCTTGCCTAT GCCTGGCGAT GGTTATGATG AGTACCCAGG GAGGCCTGTT GAAGCTTCCG GCAAAGAAGG CGCTAAAAAA CAAAATATTG ATGTATCCAT CGAGGGCATA GATCCCTTTT ACACGAGTGA TGAAACACTG GAATCGATAA CAAGAAGCAA GCAAGCCAGC GGACAAACAA CTTCCTCTTT ATCAAACGGA TCGTACACTC GGTCTATGGT CGTTGTACGG GCCATCGGGA ATCCTCTTTC CTCGACCAAC TCATCGAACA AACCACTACG GATTCTCGAA TATATTCTTG AAAATGAGCC TGCCTTTCCC AATACAACCC GTCATTGGTT TTTGAATCGA ATTACTGATG CACAGGTCGA GAAGGCAATC GTGCAACGCT TAAAGTCTTC TAATGAAACG TACACCATCA TCCCCTTTTC GCTTCGCGAA TACGACAACG TTTTGTATTC TTTCGACAGT AAGGATACAA TTCATTCGAA GCACTATATC AAACGCTCGG CGGAAGAAAA GGCTGCGACG CTTGTTGAAG AAACTGTGCA ACAAAGCAAA ATACGATATG TTGCCAACAT CAACGGCGCG AGAAATGCCA TGCTACGCTA CGGCAAAAAG AACAGCGCTG CGGAATACAT TCTTCCATGG GACGGACACT GTTTTCTCAC GCGGGAAGCT TGGGATGCAA TCCAGTTTTC TATGCAAAAG TATCCGGAAG CAAAATACTT TACTTCACCC ATTCGTTTAC ACGAAGGATC TATCGATGCC AATGTCACAG AAGAGCCACA AATTGCATTT CATCGTACAG CCCTGGCACA ATACAACGAA TATCTTCGGT TCGGCCGCCG CGACAAAGCA GAGTTATTGA CCCGGATTGG TGTCAAAGGA CAATGGGACG AGGGCATTCC CTGGGAAGAC TGGGAACTTG ACTTCGTAGA ACGTGAGCAA GCAGCCGATT CCGTTAGCGG TGTTCCAGAT GCGGGTTGGG CTGCTTGTGT GCATTCTGGA ATTGAAAATG GCGGCAAACT AGGAACAAAA ATACTAAAAC TGAACCGACG CGGTCATAAC CTATCTGCTC AGTTGGTTAG GTTGGATATC CGAGCGTCAC AGGAGCTTCA TGGGCTCTCA TCTTCAACAC TGTTGTACTA CAAGGAGAGC CAACTGGCAG AGGAGAGGGA GCTTTGGAAA GCAGGCAAGC GATTGCCCCT TGTGAAAGAG CTTTTGGAAT TGGCAAATAA AGCATTGTCG TTCGGACCAT GGTCAGTAAT GGACAAGCGC GGTTTCGGTT GCGGCGTTTC CGGTGACTGT CACGATTACT TCCATGTGGC GCCATATCAA TGGCCTACGT TGAACAGTAC AGGATATACG GACTACTCGA AACCATTTGT TCTACGAGAT GGTGAGCGTG CGCCTGGAAC CGTCGCATTT AGTGAAGGGA GCGAAAAATA TGATCGGACA AAATTGCTGG CAATGAAGTT TAACACTACT GTCTTGGCCT TGGCGTACTC GGTAACTGGA AACATTACGT ATGCCCGCCG GGCAGCGGAG AACCTCAGAC ATTGGTTCAT CTACAATGAG ACAAGAATGA ATGCAAACAT CAACTTTGCA CAGATTAAAT GGGACGTAGA AAAACGAGAA ATGTTTGGGT CTCCGTGTGG CTTAATCGAA ATGAAGGATC TATATTTCTT TTTGGATGGG GTGAAACTCA TTGAGAAGTC TGGCGCGCTA TCTGAATCTG AGATTGATCA GCTACGCAAT TGGTTTGCAA ACTATCTTCA GTGGTTGCTT TCCAGTGAAC AGGGTAGGTG GCAGATTTCT GCCAACAACA ATCACGGTCT GTTTTACGAT GTTCAAGTTG CGCCACTTGC TCTGTATGCT GGTAATCTAC CTCTGGCAAT TTCGCGAATG CAACGGTCGA TTTCGCGCAT TCGACGACAG ATGAATGCCA CCACGGGAGC CCTACCACAC GAATTGAGAC GGCCAATATG CGAACACTAC CAGGCATTTA CGCTCCAGGG ATGGATCACA ATGGCAAGAA TGGCGGAAAA GATCGGTTTG AACTACTGGA AACGATTTGC AGATTCAGAT GCCCCAAACA AAGAGACGGC CCTTTGTCGA GCAGTGCGCT ATGCAAACCC GTATCTAAGT CGTCGCGCCG TATGTCCCGG CAATATTGAC GGTATCGACG CGCGGCGCTG GCAACCAATA CTTTTGGACG CACTGCACCA CTGTCCTATG CTAGACTACA AATCGACGTC CGGACAAAAC AACGTCCTGA TCCCCCCTGA ACTGATAGAT CCGCCTTTGA ATCACTACGA GATTACTGGA TTGTTCAATA TGTCGGACGG GATAGGACCT TTCTGGAACT TGGGACTGTA CTAG
|
Protein sequence | MRRAESVASA AKLIALVVLI LTTANMLALR SFFCLPMPGD GYDEYPGRPV EASGKEGAKK QNIDVSIEGI DPFYTSDETL ESITRSKQAS GQTTSSLSNG SYTRSMVVVR AIGNPLSSTN SSNKPLRILE YILENEPAFP NTTRHWFLNR ITDAQVEKAI VQRLKSSNET YTIIPFSLRE YDNVLYSFDS KDTIHSKHYI KRSAEEKAAT LVEETVQQSK IRYVANINGA RNAMLRYGKK NSAAEYILPW DGHCFLTREA WDAIQFSMQK YPEAKYFTSP IRLHEGSIDA NVTEEPQIAF HRTALAQYNE YLRFGRRDKA ELLTRIGVKG QWDEGIPWED WELDFVEREQ AADSVSGVPD AGWAACVHSG IENGGKLGTK ILKLNRRGHN LSAQLVRLDI RASQELHGLS SSTLLYYKES QLAEERELWK AGKRLPLVKE LLELANKALS FGPWSVMDKR GFGCGVSGDC HDYFHVAPYQ WPTLNSTGYT DYSKPFVLRD GERAPGTVAF SEGSEKYDRT KLLAMKFNTT VLALAYSVTG NITYARRAAE NLRHWFIYNE TRMNANINFA QIKWDVEKRE MFGSPCGLIE MKDLYFFLDG VKLIEKSGAL SESEIDQLRN WFANYLQWLL SSEQGRWQIS ANNNHGLFYD VQVAPLALYA GNLPLAISRM QRSISRIRRQ MNATTGALPH ELRRPICEHY QAFTLQGWIT MARMAEKIGL NYWKRFADSD APNKETALCR AVRYANPYLS RRAVCPGNID GIDARRWQPI LLDALHHCPM LDYKSTSGQN NVLIPPELID PPLNHYEITG LFNMSDGIGP FWNLGLY
|
| |