Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47152 |
Symbol | |
ID | 7201941 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 629230 |
End bp | 630793 |
Gene Length | 1564 bp |
Protein Length | 437 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181238 |
Protein GI | 219121781 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCCG TTTGGGGTGT CGATCTTCCT GGCGGTGCGC GGCTCCGTCC GAGTCAGCCC CACTACGCGC AGGAAACGGA GAAGGCGCGG CGGCGTGACC GGTACGGTAC ACGTGCCGGT GTTCGTTTGC GCCAACGGCT ATAAAAAAAA GACATTCACT CGGCACCACA CCGCTGGCCG TCCACAATCC CGCAGACCAA CGTGTTTCCA CACATCCCTT GCGCAAGTTT TGTACCATGG TCGAAACGAA CGGAACAAAG CCCACGGCGC CGGTGGAAGA TTACACGCCG GAGAATATCC TTGTAACGGG AGGAGCAGGT GAGTAACGGG CGGGTATCAA TCAGCTTACT GCAATGCAAA ACGAACGAAC CCAGCAACGC CCCCGTATGA CCAGTAAGGC GACGACACGG GTACGGAGTG TATCCCTCGA CTGTATTCCT GGCACCACGG TTTCGCACAC ATTTCTCCCT TTTCACGCGC CTCACGTCAC TGTACGAACC TCTTCTTCTC TTGTTTAGGT TTCATTGCCT CGCACGTGGC GATTCTCCTT TGCAAAAAGT ACCCGCAATA CAAGATTGTC GTCTATGATT GCCTGGACTA CTGCGCCTGT CTCGCCAACT TGCAAGAGCT CTTCGACTTG CCCAACTTCA AATTCGTCAA GGGAGACATT GCCTCGCCTG ATCTCGTCAG TTACGTCCTC CGCGAAGAAA AGATCGACAC CATTCTGCAC TTTGCGGCGC AGACGCACGT CGACAACTCC TTCGGAAATT CCTTCGCCTT TACGCAGACC AACATTTACG GAACGCACGT CCTGCTCGAG TCCGCCAAGT GCTGTGACAC CCTCCGTCGC TTCGTGCACG TCTCCACCGA CGAGGTCTAC GGAGAAGGAG AAGACTTTGA AACGGACCCC ATGTCGGAAG AGCACGTCCT CGAACCGACC AATCCCTACG CCGCCACCAA GGCCGGCGCC GAATTCCTCG TCAAGAGCTA CTTTCGTTCC TTCCAATTGC CCTGCTTGAT CACCCGCGGT AACAACGTTT ACGGACCTCA CCAGTTCCCC GAAAAACTCA TTCCCAAGTT CACCAACCAG TTGCTCAAGA ATCTGCCCCT CACCATTCAC GGTGACGGGT CCAACACACG CAACTTTTTG TACGTGACGG ATGTCGCCAA CGCGTTCGAC ATCATCATGC ACAAGGGAAC ACCGGGGCAC GTATACAACA TTGGGGGGAA GAATGAAGTG CCCAACCTGG AAGTGGCCCG TGCCTTGCTC AAGCTCTTTG ACAAAGAAAA GGAGGAAGAT ACGCTCATTA AGTTCGTCCC GGACCGACGA TTCAACGATC TACGGTACAC CATTAATTCC AACAAGTTGC ACGAGCTCGG GTGGACGGAG CTCATGAGTT GGGAAGAAGG CCTCGCCACT ACGGTCGATT GGTACAAAAA GTATACCTCC CGTTACGGCA ACATTGACGC GGCCCTCGTG GCGCATCCGC GCATGCTCAA CACCAACAAG GAGGACTTGG ACGAATCTAC CCAAAAGGTC ATTATGAAGC AGCACAAGAA CTAA
|
Protein sequence | MSSVWGVDLP GGARLRPSQP HYAQETEKAR RRDRHSLGTT PLAVHNPADQ RVSTHPLRKF CTMVETNGTK PTAPVEDYTP ENILVTGGAG FIASHVAILL CKKYPQYKIV VYDCLDYCAC LANLQELFDL PNFKFVKGDI ASPDLVSYVL REEKIDTILH FAAQTHVDNS FGNSFAFTQT NIYGTHVLLE SAKCCDTLRR FVHVSTDEVY GEGEDFETDP MSEEHVLEPT NPYAATKAGA EFLVKSYFRS FQLPCLITRG NNVYGPHQFP EKLIPKFTNQ LLKNLPLTIH GDGSNTRNFL YVTDVANAFD IIMHKGTPGH VYNIGGKNEV PNLEVARALL KLFDKEKEED TLIKFVPDRR FNDLRYTINS NKLHELGWTE LMSWEEGLAT TVDWYKKYTS RYGNIDAALV AHPRMLNTNK EDLDESTQKV IMKQHKN
|
| |