Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_11956 |
Symbol | UNC18 |
ID | 7200411 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 822280 |
End bp | 824204 |
Gene Length | 1925 bp |
Protein Length | 588 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179924 |
Protein GI | 219118292 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTTGGAAGG TCCTTATCCT GGACAATCAC GCTATGCGCG TGATATCTGC CGCGGTAGGC ATGTACGATA TCATGGAAAA ACGCATCACG TTGGTGGAGT CCCTCGACAA AAAGAGGGCA CCATTTCCGG ACATGGGCGC CATTTACTTT CTGGATCCGA ACGCGGACAG CGTCGCCAAG TTGGTAGCCG ACTGGAGCGA CCCTAGCAAC AAGAGATTGT ACGGCAGCGC CGTCTTTCTG TACTTTTTGG GGCGCCTTCC CGATAATCTT TTAGCACAGA TCAAGATGTG CCGTCCGCTT TTGAAGCGTG TCAAGGGACT GATGGAGATT AACGTGGACT TTTTGGCGGT TGAAGAGCGC GCCTTTACGT TCGATATGCG GCATGCCTTT CCCTCATTTT ACCTACGTCG TGGCAATACC CCGATCGAAC TTGACATCGC CGAGAAGCTG GTTACGGTAT GTGCGACGCT CAACGAGTAT CCACACATTC GGTACAAGCA GTCGTCGGGT ATCTGCACCA GTCTTGCGTC CGTCTTTCAT CTTAAAATGG ATGAGTATGT TTCACAGAAT CCGTCCTGGT GGTATCACGG CGGACCAGTC AAGAACCAGG CTGCCAATCG CGAGCGTGGC ACGCTCCTGT TGTTGGACCG AGCTGACGAT TGTTTGACAC CTTTGATGCA CGACTTTATT TATCAATCAA TGGTGCAAGA CTTGCTCAAA ATGGATGGTG ATCGCATCAC GTTCCAGGCA GAAACTAAGA ACGATCCTTC TCGGACGGAG GCCAAGGATG TTTTGTTGGA CGATCGTGAT TCTCTCTGGG TCGAGCTTCG CGGAAAGCAT ATTGCCTCAG TTATTGAGAC GCTTTCTGGA CGCATTCGTG AAATCATGAA TTCTTCCACG GGTAGCGCGT TTGGAGGCAA AAAACAACAG CAGGGTAACT TGAGTATTTC TCAAATGGCG GCTGCCCTAA AGGCCTTGCC AGAGTATCGA GAAGTTATGT CAAAACTGTC TCAACACATG CATATTTCCC ATGAGTGTAT GGAGGTTTTC AAACACAATG GCCTTTACAA CCTTAGCGAG CTGGAGCAAA CCTTGGCAAC CGGGAAGGAT GAGGATGGAC GCACTCCCAA ATTGTCGGAC ATCATGGAGC GGGTGGAACA GGAGCTGCTG AAAATGCGCG ACCCTAAAGC TCGGTTGCGA TTAATTCTCA TTGCTACTGT TAGTCAAGGA GGTCTTCGCC AGCAGGATCG TCGACGCCTC ATGGGCGCTG CAGAGCTTTC CCGCAAACAG ATTCGGACTT TGAATAGTTT GGAGATCCTG GGTCTGTCTA TTTTTGCTTC GACTGAAAAG AACAGGTTGG TGTCCATGCT AGCAGGGTAC GTCAAGAATA TATTTAAAGT TCGTGGCCGT GCATTTTTCT ACGTACTCTT ACAACTGTTT GTCTACATAG AGGGCGTTTG TCATCTGGCA GTACCTCCGA TGACGAATCC GAATATGCTG CCAGTCGCTA CGTCCCACCA CTGAAGCACA TTCTTTCCGA GTTAGTAAAC AATCGACTAA GCTTCGAAGA TTATCCGAGT ATTCTTCCCA TGCCAGAGAG CAACCCCTCG CTGTCCACTC CGAGCGCCCG TGGCAGCGTG AGATCTTCTC GTAGTAGTGG TGGCAGTAGC GCCCGAAAAT CGGCCGGAGC CTCGTCACGA TGGGGGAAAA CCAGTACCAC AGAATCTCGT CCTTCTGGGG CCATCAATCT TTCCGGAGGT CGGTCGATTG CGTTCATGAT GGGAGGAATG TCCTTTTCGG AATTGAGAAT TGCTCGAGAC GTTATGGAAA AGGAAATGCG TGAAGTGATT GTTGGATCGA CGGCTTTTAT CTCTGCAAAA GATTTTGTTG ATGATTTGGC ACTTTTGGGT AGAGATGAGG AGTAA
|
Protein sequence | GWKVLILDNH AMRVISAAVG MYDIMEKRIT LVESLDKKRA PFPDMGAIYF LDPNADSVAK LVADWSDPSN KRLYGSAVFL YFLGRLPDNL LAQIKMCRPL LKRVKGLMEI NVDFLAVEER AFTFDMRHAF PSFYLRRGNT PIELDIAEKL VTVCATLNEY PHIRYKQSSG ICTSLASVFH LKMDEYVSQN PSWWYHGGPV KNQAANRERG TLLLLDRADD CLTPLMHDFI YQSMVQDLLK MDGDRITFQA ETKNDPSRTE AKDVLLDDRD SLWVELRGKH IASVIETLSG RIREIMNSST GSAFGGKKQQ QGNLSISQMA AALKALPEYR EVMSKLSQHM HISHECMEVF KHNGLYNLSE LEQTLATGKD EDGRTPKLSD IMERVEQELL KMRDPKARLR LILIATVSQG GLRQQDRRRL MGAAELSRKQ IRTLNSLEIL GLSIFASTEK NRLVGRLSSG STSDDESEYA ASRYVPPLKH ILSELVNNRL SFEDYPSILP MPESNPSLST PSARGSVRSS RKSRPSGAIN LSGGRSIAFM MGGMSFSELR IARDVMEKEM REVIVGSTAF ISAKDFVDDL ALLGRDEE
|
| |