Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48519 |
Symbol | |
ID | 7194701 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 87505 |
End bp | 90661 |
Gene Length | 3157 bp |
Protein Length | 953 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183029 |
Protein GI | 219125527 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.598613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGACGG TGAGACCCAC CACCACCCCT ACCTACCGCG ACAACGGGAG CTTGGGGACC ATTCTCGAAG TACCCGATTC TCCACCGCAA CCCAGTAAAC CAGAGTTGCG CGTCGGCGTG GACAGTGTGA CTGTGTGAGA AAGGTCCAAG ACCGAGAGAT ACGGTACACA TAGGTAGGTA GCTAAGTAGG TAGCTGACTA GCTAGGTATT CCCCCTGCCC CCTTTTTCAG ACACAGCCAT CCTCTTTAAT TCGCTGCAAT CCCACTCACG GTCCATATTC GGTTCCGTTT AGCTCCATTG CGTGGAAGGC AATTTTGAAT GCAATCGATT CGAAATCGAA TAGACTGTGA ATTAGTGGAC TCTGTTCGAT TCAACTGACT GTCTGACTGA CTGATTGTTG GAAAAGACTG TTCCACAACA GGTCGTACGC GTCATTGCCC GCATGTCCGG TTTCGTCGCC AAGCTTCCCC TGCGTGTGCC GTTGGAGACA GATTTGGGGG GACAATTGGG GCGTTGGTTG GATCAAGCGC ACGTGCAAAA GTGGGAAGGC CGACCCGGGA TGACCTCTGC CGACTGTCGT GAGGATCTGG ATCGGCTCGA TCAGATGCGA CGCAACGTCT ACACTGCCTG TCGACACGGC GTCGCCGACG CCCTCCCCCA CTTGCACGTC TTGCAAGAAT ACGCGGCGGC CCTCGAACTC TGTGAAGAGC AAGGCTTTCC GTACAACAGC GGTGCCGTTG ATGATTCTGG AACAACTCGT GACGACGAAC ACGATCGGCA ACATCGACAC AAATTGAACA AACGAGGACA GCAACATCAT TCACCGCTAT CGTCTGCATC GCTCGGAAAC AGCAGCCTTG CCAACATGCT TGAATTCCCC TGGAAGTCTT CCGATAATCA GGAAGAAGTC GACGGTACCT TGGCGTGGGA ACGTGCCAAC GTCCTTTGGA ATCTCGCCAT CGTGCAAGCC CACCAGGCCT ACGCCGTCGA GAAGACACCC AACAATCCAC AGTCTCGCAC CGCCTGGAAA CAAGCTGGTT TGCATTTGCA AACTGCCGCC TCACTTTTGC GGTATTTGCA AACGGATCTT TTACCGGCCG CTACGGAACG CTCTTTTCCG TCACACGATT TGTCGGCCTC CTTCTTGACA CTCTGGGAGC GCTTTTGTTT GGCCGATGCG CAGTACGCCT TTTACCAAGC GGTTGCGGCG GCGCCCCGTC CTTTGCACGC TCTGCTGGCC AAGGTATCGG CCGCGGCAAT TCCACTCTAC GGCGTCTGTG AAGAATTGCT TCTCGACGAC GACGATTACG GTCTCGATAC GAGTAGCTCA GCGAGTATCA CTGCAAACGC CAGTGCTACC GGCCACGCGG CGGCTAACCA GTTCCGCAGT AAGCGATTAC AAATTTGGGG CGACGCGGTG CGGGCCTGGG GAATGTGGAT GAGTGCCTTG TGTGAATATC ATCAGGCGCA AACCCACGCG GACAAGGGTG AACGGGGTCC CGCCCATGCA CGCCTCGAAG CGGCACAGAA ATTTGGATCT CTCTGTCTCG ATTTCTGTAA CAGCGAGGAA GAATCGCTCT TGGACGATTT GGCGGAGCTA GTCTACGTAA CCTTGCAAGA TATGGAGACG CAATTGGAAC AAGCGGAGCA AGCCAACAGA CTAGACCCGG TCGACATTCC CGATCGGAAC GATTTGCCCG AAGTTCCACC CCAAACCATG GTCGAAGTCG AAAAGGACGT ATCGAGTAGT TTGCCAAAGC TGGCACCACC GCTCTTTACC AGTGGTCCCG GTTCTGTGCT GCGTCGGTAC GAGCAAACCT TTCGATACGA CATGCAGCGG CTCCTCACTA ACACCACGCT CGCCGCCGAA GATAAAACGG ATCAAGGACG ACGAGCTTTG GCGACAGTTA ATTTGCCGCA TTCTGTTACA GCCTACCAAC AGGAAAGTCA GGGTGGGGGC ATTCCGGACG CTCTGTGGGA AAGGGTCCGA GTAGTGCAAG ACCAAGACAT GCTTCGAGAA TTAAAACAAT CGGTTTGGGA ATTATGTGAT ATTGCCGAAC GGGCGCGTTC GTTGTACCAG ACTGTTCAAG AAAATTTGAA AGAAGATCTG CGGGTGGATT CTCTATTTCG CAGTCAAAAT AGTACGTTTG AAGGACACAA TGTATCGCAA GTTCAAAAGA GTTTTCACAC AACACTCGAG AACTACGATT CGTTGCTGAC GTCAGCTCGG GAAGGGGACC AGCTTGTTAT GCAACGCGTC GAATTGCTCG ATACAGATCC AAAGTATAAG TTGCTACAGT TTCGGAAATC GCAACTGGAT AGACTCTTGC CTGCGGGAGA TCAGAATGTG GACGTGTCCA CGCTCAGTCG AATGCTAGTG GAATTGTCGG CCTTGTTTCA GCGCCGCGAC GTTTCGCTAG AGGAGTTGCG CAACAAAATG GAGGCGTATG ATTTTACGGG TGAATTGGTG CAGGTGGATG AGCTTGGTCT GGAGGCAGAA GCTGAATACA AAGCAGTTTT TCAGCGGGCG AAGGATTCCT TTCAAGGAGC GTTGAACGGA ATTGAACGAA GTATGGAGGA GCAGTCGAGG TTGGTACGTG AAATTTTGAC GGAAAACGAT ATTTTCATGC ACGAACGCGA AAACAGTCGT GCGAAAGGGA GCACTGACCG AAGCATCACG ATGATTGAAG ATGCAGTAGA CGAAGTGGAG CAATTGTCCA CTCATTTGAA GGAGGGGCGG GATTTTTACG ATTCGGTCCT GCCCAAATTG GAAAAGCTTC GCAAACAAGT TGGCGATGTC AGTGCCCGTC TCACAATGGA GCGGTGTGAA TATGAAGACA ACACCCAGCG GAACCGACAA GAAGCCGATG ACGCACGTAT GGCCGCCAAT TTGTCTGATC ACGGTCAAGG TCAACAAACG CAAACCTCTA TACGGTATAT CGACAATGGA AGTGGCTCGT CCCCTAGGCG TCCTATGGAC CGTGTGGCGA CCCCTGGCAT GCATCCAGTA TCCCACGAGC TTCCTCAAGT ACGCGTAGAC GACGAAAAGG TCGCAAGTTT GGTAGCCATG GATTTCGATC CTAACCGAGT CTTTGCAGCT TTGTTACGAT ACGACAACAA CTTTGAGCAA GCTTTGAATG ATCTGTTGTC GGGATAG
|
Protein sequence | MVTVRPTTTP TYRDNGSLGT ILEVPDSPPQ PSKPELRVGV DSVVRVIARM SGFVAKLPLR VPLETDLGGQ LGRWLDQAHV QKWEGRPGMT SADCREDLDR LDQMRRNVYT ACRHGVADAL PHLHVLQEYA AALELCEEQG FPYNSGAVDD SGTTRDDEHD RQHRHKLNKR GQQHHSPLSS ASLGNSSLAN MLEFPWKSSD NQEEVDGTLA WERANVLWNL AIVQAHQAYA VEKTPNNPQS RTAWKQAGLH LQTAASLLRY LQTDLLPAAT ERSFPSHDLS ASFLTLWERF CLADAQYAFY QAVAAAPRPL HALLAKVSAA AIPLYGVCEE LLLDDDDYGL DTSSSASITA NASATGHAAA NQFRSKRLQI WGDAVRAWGM WMSALCEYHQ AQTHADKGER GPAHARLEAA QKFGSLCLDF CNSEEESLLD DLAELVYVTL QDMETQLEQA EQANRLDPVD IPDRNDLPEV PPQTMVEVEK DVSSSLPKLA PPLFTSGPGS VLRRYEQTFR YDMQRLLTNT TLAAEDKTDQ GRRALATVNL PHSVTAYQQE SQGGGIPDAL WERVRVVQDQ DMLRELKQSV WELCDIAERA RSLYQTVQEN LKEDLRVDSL FRSQNSTFEG HNVSQVQKSF HTTLENYDSL LTSAREGDQL VMQRVELLDT DPKYKLLQFR KSQLDRLLPA GDQNVDVSTL SRMLVELSAL FQRRDVSLEE LRNKMEAYDF TGELVQVDEL GLEAEAEYKA VFQRAKDSFQ GALNGIERSM EEQSRLVREI LTENDIFMHE RENSRAKGST DRSITMIEDA VDEVEQLSTH LKEGRDFYDS VLPKLEKLRK QVGDVSARLT MERCEYEDNT QRNRQEADDA RMAANLSDHG QGQQTQTSIR YIDNGSGSSP RRPMDRVATP GMHPVSHELP QVRVDDEKVA SLVAMDFDPN RVFAALLRYD NNFEQALNDL LSG
|
| |