Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35525 |
Symbol | |
ID | 7200894 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 117869 |
End bp | 119681 |
Gene Length | 1813 bp |
Protein Length | 550 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180178 |
Protein GI | 219118823 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATGT TTTATGAGGG CATTACTTTC GAATGTATTT CGACTTGGAG AGAACGCACT TCATCGTCAG TTAGCATATA CCTCTAGAGG CGGTCTCTTA TGTTATCGCT GTGTTGACTG AATCTGTACA ACAGACCCAT CCAGCCCCAC TATTGTTATA ATATGGAGCA GCCGGGCAGT ACTCTCCAAA CAAGAAGCGT GAAGAATGTT CGCTATGGGA AAAGCATTGC CATACTTTCT TGTGTTTTGT ACATTTTGGC GGTGGCCAAG CTTTTTATTC CACTTCTGAC CGACGCTCGA AAACTAGAAG ACGCCATTGC CACAGAAAAA AAGCGTCTAT TGGTGGACAA GATAAAGGAG GAGAAGATGC TTCATACTTT TACGGTTCAG CAGAACTTAG CTAACAACGT CTCGGTTCAT TTTAGCAACC ACTTGACGGA GAAGGAGAGA ACCTCTTCCG AACGTCCCCA AATCGTGGAG AGGGAAACGC TAAGAACAAA TTCGCTTGTC GCTATCCGCA GCGACGAAGT GTCAAGCACA AATCTCGGGA ATGCAACAAC AATTGAACCG GGTCCACCGC CCTCGTCACA ACATTACCCC AAAGGGACAA ACTCGACAAA AGACAATGCT CCACTCACAA GGGGCAAACC TAGACCATAC GACGTGGACG TGGGCGACAA AGCATTCCTT AGTTCTCTTT CTTGGGAAGT CTTGCTGAAA CTACCACAAT GTCGAGGTAA GAGAAGATTG CTCGAGATTC TTTCGGCTGC TGGTCTGAGC GCGGGTGACA TTAAAGCTCG CTGTAAGTAC CTTCCGCTTT GGCATGAAGT CGCATCGCTT TACGGAGAAG AACCGATTAT ATTAGGCTTG GAGACGTGTG CGGAGTATCG GCGGTCCGTC AACACCAATG ATCCCAGAAC GAAACGCCTG AACGGACTAC GAATTGCTGG CTTGTACAAT TCAGGGACCA ATGCTTTATG GAAGACAATC GTAATGAATG TGGAGGGTCG GAAAAGTATC GAAAACGACT GGGGAGATCC GGGACCATCA GTTCCATGGG GTAAACACAT GCCACCCCGG TATCGCTTTT CCAATCGCTT TCTTCCGGAT GATCCTCTCA ACGTGATGCC TGTTGTCATT GTTCGAGATC CGTATCGATG GTTGGCCGCA ATGGTAAGCT AGCTAGGCGA AGGAATGGAA ATTGTTTCTC GTCCTTTGGC TCTGATTTGA GAGAGTTCTT ATTTGCGCTT CTCTCTTTCG ATAGTGCAAA GCACCGTATG ATGCCAGATG GCAGAGATCT CCGTACCATT GTCCAAATTT GGCTATCGCG GACAAGGAAA AAGAGATATT CGGAAATTTC ACGAAGCCAT TTCAAGTAGC ACTCTCGGCT CGTCAAACCG GTTTTGCGTT TACGGACTAT TACGATACAT TGGCAGATCT CTGGTCGACC TTTCACGAAG AATATATCAA CGCCACTTTT CCGAGAGTTT TTGTTCGTTT TGAGGATACG ATATACCACG CCGAGAAGGT ATTGAAGGCT TTGACGGAAT GCGTTGGGAT TCCAATCGCT CGCAAGTTTC GCTATTTGCT CGAAAAGCCC AAGAAGCACG GAAATCCTTC CGATTTTGTC ACGGCACTTG TCAAGTATGG GTCTAGTCAG GGTCGCTTTC GAGGAATGCT CGTTGAAGAT CACGAGTATG CCCAAAAGCG ATTCCCGGCC GATTTCTTGG CTGCGCTACA TTATTCGCAT GCAACCCTAA ATCCATTGTC CAAACGTGGT GGACCAAACG GCACAATGGA CATCCTTCGG TGA
|
Protein sequence | MPMFYEGITF ECISTWRERT SSPIQPHYCY NMEQPGSTLQ TRSVKNVRYG KSIAILSCVL YILAVAKLFI PLLTDARKLE DAIATEKKRL LVDKIKEEKM LHTFTVQQNL ANNVSVHFSN HLTEKERTSS ERPQIVERET LRTNSLVAIR SDEVSSTNLG NATTIEPGPP PSSQHYPKGT NSTKDNAPLT RGKPRPYDVD VGDKAFLSSL SWEVLLKLPQ CRGKRRLLEI LSAAGLSAGD IKARCKYLPL WHEVASLYGE EPIILGLETC AEYRRSVNTN DPRTKRLNGL RIAGLYNSGT NALWKTIVMN VEGRKSIEND WGDPGPSVPW GKHMPPRYRF SNRFLPDDPL NVMPVVIVRD PYRWLAAMCK APYDARWQRS PYHCPNLAIA DKEKEIFGNF TKPFQVALSA RQTGFAFTDY YDTLADLWST FHEEYINATF PRVFVRFEDT IYHAEKVLKA LTECVGIPIA RKFRYLLEKP KKHGNPSDFV TALVKYGSSQ GRFRGMLVED HEYAQKRFPA DFLAALHYSH ATLNPLSKRG GPNGTMDILR
|
| |