Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46239 |
Symbol | |
ID | 7201197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 689896 |
End bp | 692319 |
Gene Length | 2424 bp |
Protein Length | 778 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180487 |
Protein GI | 219119454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.514117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGGT TGGCCTCCCG TAATCGAGGC TCATCGTTCC AAGGTCAGCT TGAGGCCGTT ATGGAGGTGG ACCTTCCTTG CACAAACATG AATACGGGAA GCAAATCTCC ACCTACCTCT TCTTTCCGAA TCAATCCCGT GACCCCTTCG TCGAGAACGG TGGTCGCTGA GAAGAGTGCC ATTTTGAGAA GACTACAAGG TCTTCCCGAA TCTGTTCAGG ACGATAACGA CACGGTAATG TCTTCCGCAA CAACTTCTTC CAGGAAGTCG TGGTGGGGAC GACGCTCCGG AGCAGGAGTT GTACGTGGCG GTAACGCGAG TTCTGCTGCT AGCGTTAACG CCAGTATGCG AAACACGAAG AAATCCAAGC CTGAAAAGCA AAAAGTGGAA GCAGGCAAAG GGTTTCGAAG TCGGTTCAGG CTGCAAGGAG CTCATAGTTC CCGCGACCCT GCATCGAGTA CCAAGACCAA CAAAAAGAAG GCGCCTCAAG TAGACAGCGA TCAACTTGTC ATGGACTTCT TCGATGTACA AAGTATTGTA TCGGAACCGC ACTTTGCACC GTTCGTAAAG TCGGCACGGA CGCGGGATCC AACCGAAACC CATTCGGTCA CATCGCAACG ATCAATTGGG AGCTTTTTCA TGCGCTTCCG CGAACGAAAT AATCGTGAAG AAGCTATATT GGCCACAGAA ATTACGGCTC ATCACGATGA AGAGGACGAA GTCGATGATG ATATTACCCT CGCTTCCGGG CTGCTGTCAA GTAGCGGCTC ATATGGTTTT ATACGCGATG AGGACTATTC GGATGGAGAC TTTTCGGCAT TTGTCCAACC CGAACCAAAA CCCGGACAGC ACGACTCGTC TTTGCCAAAA GACCCGAAAG CTTCCCGTCT AGGGCATTTA TTTGGTCGTA AGAAGAGATT CCGTAGGCCA CGTCGAGGCT CGGGAGATAG CAGCGCTTCC TCTGTGACTG ATGGCAACGG TAGTCTTGCT AATGGAGCAG TTGCCACAAC TTCCACCAAC GCTCTGCATT CTACAAAGGG TCGTAACAGT ATGCACAGCG GCGGTACCGA TTCTCCTACT GAATCAGAGG ATCTTGAAAT TGAGGAAGAA GAGTTGGAAA AGCTATTGGA GATCTCGAAT CACATCGCGA GCAATTCTAA CCGTGAAAAT GTGGCTCGTC CTATTCCCGT TTCCGAAAAG AGACTTACTG CATTCCTGGC AGCGGCAGAA GCTGCTAGCA CCCGACCGTC CGTCTCCAAT ACCATTGCAA ACTACGAAAT CGCTCCTCCG CCGGTCCTCA GCAACAAGCA AGAAAGCAAG CAGGCTAAGG ATGGTATGAA ATTTGCCAAA AGTGCACGAA AAGCAGCAAA GGAAGCTCTT AGCGCAGCAA AAGAAGCAAA GGCGGCCAAA GAAGCGGTGA ATTTCGCAAA CGATGATGTC TTCATGCCAA TCATTGAAGA AATCGAGGAA GACGCGGAGG ATATCTCCTC CCAGATTCTT TGCAAGCAAA CACCAGATCG GTGGTCTCCC TCGCCAACAC CTGATGTCAA TTCTGGATCT TATGCACCAA AGCGACCACA TCGACGAATA CCCGAGGGGG ATGTTGGTGC AAACAAGAAT AAGGATTGGC GCGCTCAAGT GTTCAGCCAT GTCACTTGTT CCAACGACGT TCACTGTGAT GGAGTAGAAG AAGAGAAAGC AACTGAGTCT TGCTCCTTCA TGCCGGACGA GGACGACGAC AGTGAGTCGG TATTTTCGCT GCTTAGTGGT CTCATGGCAA CCTGGGTTGC CGAGCAAGCT CAGGAAGAGT TCAAGCAAGA TGGGATCGTT CCATTCCTGC AATTGCGAAG TTGCCTCAAG CAAGGCGGAC CTATTGACAA CGCCCGCCTG TGCTACCGAG TCAGTTTTGC CAAGATTGAG ATTCGGGAGT ACGAACGCAC GGTTGGCGAT AATCCTGCCT GTGGCTCCGG TCCTCCTATC ACCATTGGAT GGGGTTACGT TCCCGGGGTA GAAGCCAACA TTGAAGAATA CGAAGCCACG AGAGTGCCAA GGACCAAGAA GCAATACTAT TTGCCACCCG CCAAACGTAT ACACTTGCTT ACTCAAGAAT GGCAATGTAC CGAAGAGCAA ATTCGAAAAG CCCGACGAGA GGCAACGTAC ATCCAATATT GCCGTGAGAA GACAGCCTTT TCGAAAGCTG ACAAGGAAGC TGCCTTTTTG CGCAAGGCAC AGCGACGGCA ACCAATTACC AACAACGCTA GCTGGCCTAC ATCAGATACC AAGCGAGCAG TGTCGGCACC ACAGTCACCA GTGCTGCCGG GCATGAGCCT GGTTTAGATG AAGAAAGTTT AGTTGGTAGT AGACGGGTCT AATTGCTGGT TGCACAACCA TTTTTGACGC TGCCCCAAAC ATTAGTTGTA CTAC
|
Protein sequence | MNRLASRNRG SSFQGQLEAV MEVDLPCTNM NTGSKSPPTS SFRINPVTPS SRTVVAEKSA ILRRLQGLPE SVQDDNDTVM SSATTSSRKS WWGRRSGAGV VRGGNASSAA SVNASMRNTK KSKPEKQKVE AGKGFRSRFR LQGAHSSRDP ASSTKTNKKK APQVDSDQLV MDFFDVQSIV SEPHFAPFVK SARTRDPTET HSVTSQRSIG SFFMRFRERN NREEAILATE ITAHHDEEDE VDDDITLASG LLSSSGSYGF IRDEDYSDGD FSAFVQPEPK PGQHDSSLPK DPKASRLGHL FGRKKRFRRP RRGSGDSSAS SVTDGNGSLA NGAVATTSTN ALHSTKGRNS MHSGGTDSPT ESEDLEIEEE ELEKLLEISN HIASNSNREN VARPIPVSEK RLTAFLAAAE AASTRPSVSN TIANYEIAPP PVLSNKQESK QAKDGMKFAK SARKAAKEAL SAAKEAKAAK EAVNFANDDV FMPIIEEIEE DAEDISSQIL CKQTPDRWSP SPTPDVNSGS YAPKRPHRRI PEGDVGANKN KDWRAQVFSH VTCSNDVHCD GVEEEKATES CSFMPDEDDD SESVFSLLSG LMATWVAEQA QEEFKQDGIV PFLQLRSCLK QGGPIDNARL CYRVSFAKIE IREYERTVGD NPACGSGPPI TIGWGYVPGV EANIEEYEAT RVPRTKKQYY LPPAKRIHLL TQEWQCTEEQ IRKARREATY IQYCREKTAF SKADKEAAFL RKAQRRQPIT NNASWPTSDT KRAVSAPQSP VLPGMSLV
|
| |