Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39465 |
Symbol | |
ID | 7195166 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 586180 |
End bp | 588033 |
Gene Length | 1854 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183384 |
Protein GI | 219126270 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCAAG GAGGTAAACC ACAACAGGGT AACTCAAATA CGGTATATTT GTGCAATAAG GCCGACACAA ACTCCATATG GGACGCGTAA CCGCTGCGTC CATCGGATTG GCTCACACTT ACACTTTCCT TACGAAGTGG GAGCTGTCCG AGGGACATGC CGCAAGGTAG CGTACCGTAA CACCTAGCTA CCACACTGAG ACAGCAGCAA CGCAAACAAA ATGCTTCCTG TAAAGGATCG ACGGCACACA TTTGGGGAAC GGACGTGTAC GTTCCTACGG CGGCAGCGGC CACTACCAGC GTTCGTCGTA GCTTTCAATC ATTACAGATC TCCCATCACG TGAAGGGGAA AGCTGCGAGG GTAGAGGCCG AAGACAATAT TGTCGGGATA AAGTGGACGT CGACGGGGAC GATCCGCCAC CGTTGTAGCG TGCGCGACTC GAGTCGCTCC TCACTCGCGG TGTCACCCAG GGATCAACAA CTACCAGTCT CAATATTGAT GCTATGCATC TGCACTATCA CACCGCAGGC TGTCAGCGTA TCTACATCAG ATTTTGGCCT GTCCAATGGA GCTGGTCCCC CTCATGGACT TGTGCGTCCA GTGCGAAAAG AAATGGAAAG ACAGGGACTA CTGACTGTGA GTACTTTTTC GTCTCCCAAG TCACGGTCGT TTCAAAATCT AAGTGTAAAT CCTTCGCTAA ATCCTTCACT AAATCCTTCA CTAATTTAGT GAAGGATTCA GAAACCTTCA CTAATTTAGT GAAGGATTTG GAAACCTTCA CTAATTTAGT GAAGGATTTG GAAACCTTCA CTAATTTAGT GAAGGATTTA GAAACCTTCA CTAATTTAGT GAAGGATTCA GAATCCTTCA CTAAATTTAA CAAGAATTGA AAACCCTATT CAAAAGTTCA GCCAGTCCAA GCTTTCAAAC GGCACTTAGA AAGAGCTGTC TGGCGCGCAA GAAAGAACAC TCTTTATATA CGAAATGTCT TCTCTGTTAT TTCGATTAAA TTAGTCAAAT GAGGCACGCC CCAGGAGTTG GATCAACAGG ATTACCTACA GAGATTTTAC GGCCACATCG AGCTATTGGA TCGGAAACGA TTGTCCAGTT TGCTTGCCAA TCCCGGGCCG AAAACCGAGC ACCTCCACTC TTCCAATATG CGCCGCAAAA GAGGATGGGA ATGGGCCGAC ACTAAACTCG TTGCCGACCT ACCTTATCTC AAGGCTATTT CTGGTTCTAT CGATCCCACT AACTATCATC AGGTCATATA GCTGGCTTAC AATTCAGAAA GCCTCAAAAA CCGAAGTTGT GTTGCGAAAT GGTTTGATTC GTTTACGGCG GCTTTGCTGA CGTTGGGATA TCGGCAATTG GTCTAGAACG GTACCGTTGT AAGGGTTCTA GATGAGGTGC AGGGCTCCGA GCTGATCGAA GTCGTACAAG CGCTACTGGC ACAAGTGTGC ATCCAATTCA AGCAACAAGA CGCGACAGAA TGCAGTGCCG GAGTGCAATG GGCCGATTGT AATACTCAGG CATGGGGATG GTTTCTTTGC CGACCGTCAA CAATTCTCTG GAAGTCAATG GACCAGTCGG CCTGTAGATG GTAGAGGCGC TCTGGGCAAC ACAAGGGGCA TAGAATCCTT TGGCAGTCTT TCGCAATTTC CTGATTACCA TAGCTTGGGC CGTGGCTTGT ATATTTCCGT TGTGATACTT ATGCCACCTT TTCGTACGGC TCGGGCAGCG ATCTCACAAA GTATCGGACC GTCAAGTTTA CGCTCGGTGG CTGCGACCAC CGCCCTGGAC GGTAGCAATG CGGTCGTAGT GGAATCGCAT AAGGCGATAA GCACAAAACT ATGA
|
Protein sequence | MSQGGKPQQG NSNTQQRKQN ASCKGSTAHI WGTDVYVPTA AAATTSVRRS FQSLQISHHV KGKAARVEAE DNIVGIKWTS TGTIRHRCSV RDSSRSSLAV SPRDQQLPVS ILMLCICTIT PQAVSVSTSD FGLSNGAGPP HGLVRPVRKE MERQGLLTEL DQQDYLQRFY GHIELLDRKR LSSLLANPGP KTEHLHSSNM RRKRGWEWAD TKLVADLPYL KAISGSIDPT NYHQNGTVVR VLDEVQGSEL IEVVQALLAQ VCIQFKQQDA TECSAGVQWA DYGRGALGNT RGIESFGSLS QFPDYHSLGR GLYISVVILM PPFRTARAAI SQSIGPSSLR SVAATTALDG SNAVVVESHK AISTKL
|
| |