Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44582 |
Symbol | |
ID | 7198088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 966859 |
End bp | 968680 |
Gene Length | 1822 bp |
Protein Length | 501 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178612 |
Protein GI | 219115633 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.370022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAGACGTTT CTATACTTGT GGGTAGGGCC AACTGTACAT GCTGGCCTCT CCATATATCT CAACTTCTTG AGGCTACTTT CCAAAGCCGG CCCACCGTGG CTAGTGAAAC AATTCAGAAT TTTTCGAACC GAAAGGACAG ATCTCATACT TTCGACCTTC AGACTTTTCA ATACGGTACC CTGCCGTCGA CACCCACTTT TATTGTTCTG ATCCATCTGG CATCATGAAT TTGACGGAAT GTGATGAAAG GGGGGCAGAC TGTAAGGATA GACGGGCCTC GGATGCGTCA AAATCTTCTC TTGGAGCAGT ATCAATGGAC GATGAAAATC CGTTAATGAC TCTTGTAGAT GCTGCCGCTT CAATACTAGG AACCAAAGTA GCCCGACGAG AAGTATCTGT TCCAGGCTCT CCTTTGGGGC CGCAAATTAA TGTTTGTAGT GATGCAAAGG CAAGCGGTTT AACTGTAAAA GAAAGCCCCA ACGTCATTCC CGGAACAAAG GAGGATTTGT TTGGCAGTTC GTCAATAAAG CTGACATTCG CAGAACAGCT CTTTGATATT TTGCAGAACG AAGAAAACCA TGATGTTCTC CAATGGATGC CTGATGGATG CTCTTTTATA ATCGTGAACC ATAAGAAGTT TATTCTCGAC AAAATGCCTA AGTTGTTCAA CATTCGCAAC ATGTCATCGT TTGTGAGAAA ATTGGGACGG TGGGGGTTCA GCCGCGTTCA TGAGAAAGCG ACGAGAAATT CCGATATTTT TAGACACCCC TTTTTTGTCA GGGAAATGCG CGAGGAGTGT CGGAAGAAGG TTAAATGTAT CGGCCGGATT CCGTCCTCTT CAAATTCAAA GCCTTCGGTT GGTCAGGTGA ACGGTGTTCC CTACAAGCAA CATTTATATT CTGTTCTACA CGATAGGCAT TTAGATGATG TCTGCCCACG GTATGGAGAC AGATCCGATG TGCCTCGTTC CACGTCTCAA CCTATGTCGA ATCTAAGCGA AGGGTCACGT TCGTCCTTGT ACCGTGACGA TCTGTTGCCA TCACAAGGAG TCCATCTTAC CCAAAGTTCC CCAAATAGAG TGACTTTTCT TGACGAGCAT CTGCATCGTG TACTTCCAGA GCTACCCTTT TCCAACAAAT CTTTGTTGCC TTCCGGATTA CCGGCAAACC TTCCTTTGTT CAACAAATCC AAGAACTCTG ATCTTCAGTA CCGAGGCAAG GAAGGTGTGT TCGTAAAGGC ATCAGCTACC TCTGAGACCT CTGCAGCGGC CCTGTTCTCA CAGTATGAGA AACAGCTTCA AGAACACCAA ATCAAACGAT CCTCCTTAGC GAGCCAGCTC TCGCAGCAAT CAGCTTACGA AGAAGCGAGG TGGTTGTCTG AGCTTGATCA TCAGCTTGCT GAACAGCAAG TTGCTCTAGA GCAGCGGAGA GTGGCTTTGG AACAACAAAG AATAATAAAG CAACGGCAGG TCCTAATGGA GCAAAGACAA GCAATGGAAA AGCGCTTTGG TGGCCAGGTT TTCGATCCGA GATACTCTCA AGCCACGAAA GGTACAGGGT CACTACAGGA GGAGTCTAGA GGGAACGACA ACCTGACATC ATCAATAGCT GACAATCTTC AACGTGGCAG AGATGGGGTT TGTTTTACAC CGAACATGAG TAGAAAGGAA GCCATCCGCG CTCTTCTTTG GGAAGAGCGC GAACTAGGGT TCACTGGAGG TCGAAGATGA AAGGCCTTTA GTCAGACCCG AATTCATAGA GTTTGTTTAT ACTTGAATGT AATACAGCTA CCAGTAAAAG GTAATTTAAC TAAGAGGGGC GC
|
Protein sequence | MNLTECDERG ADCKDRRASD ASKSSLGAVS MDDENPLMTL VDAAASILGT KVARREVSVP GSPLGPQINV CSDAKASGLT VKESPNVIPG TKEDLFGSSS IKLTFAEQLF DILQNEENHD VLQWMPDGCS FIIVNHKKFI LDKMPKLFNI RNMSSFVRKL GRWGFSRVHE KATRNSDIFR HPFFVREMRE ECRKKVKCIG RIPSSSNSKP SVGQVNGVPY KQHLYSVLHD RHLDDVCPRY GDRSDVPRST SQPMSNLSEG SRSSLYRDDL LPSQGVHLTQ SSPNRVTFLD EHLHRVLPEL PFSNKSLLPS GLPANLPLFN KSKNSDLQYR GKEGVFVKAS ATSETSAAAL FSQYEKQLQE HQIKRSSLAS QLSQQSAYEE ARWLSELDHQ LAEQQVALEQ RRVALEQQRI IKQRQVLMEQ RQAMEKRFGG QVFDPRYSQA TKGTGSLQEE SRGNDNLTSS IADNLQRGRD GVCFTPNMSR KEAIRALLWE ERELGFTGGR R
|
| |