Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_10354 |
Symbol | |
ID | 7204086 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1151227 |
End bp | 1153245 |
Gene Length | 2019 bp |
Protein Length | 653 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186193 |
Protein GI | 219113219 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAGACATGG GCACTTGGTC CACGTACGCT CCCACCAACA CTGAACAGCA GCGAAGATTA CACCGGACTG TGTTGTTACT TCTCGTCCGG AGCCGCCTTT TGTCTGTACA CGATCTTGAT TCTTTTCTCG CCGCAAGGGC TGACAATGGT AGCAATCATA TATGGCTTGA ATTCTCCTTG CTCTTTATTC GTACTGCTTT CATGGAAAAA ATAGCTACAA CGGCTGACTT TCCGAAGCTA CTTGATTTGA TGAGTCAAGT CGCTGAGGGC AGGAGCGATG CAAGTGCTCA AATTCCGCAA TCTTTTCGAA AGCCAATCTT GCTGATGCTA GAGGAAGCTC GTGTTCCAGC TCTCCAGATC CACGCACCAG TTCCTGCGAG TGCGAGCAAG TCGGTAGAAA CGCAACGACT TGAAAGTAGC AGCAGTCTTT CGATTGTTTC TCTTGACACT TATTCGAAGG CATCGAAAAG AGTTGCTGAA GCAGTTGCGT CTTTCTTCCG AAACGATCCA ACAACTGCAA AGCAGCAAGT TACTGCCTTG CTAGAAGGGT GGATCCGTCT GCAGTCTGAG CCCTCTCTGA ACGAGAAAGC TCTCGCTCAA TACATGATTA TTTTGCAGCG CTTCGGGATG GGGAAAAACG AGGAGCAGAC GGAAAGGTTC CTCCGAAATT CTGTCATCAT TGTTGTCGAT GCTGCTCTCA AATCCAGTAC TCAGCGTGGA GACGGGAAGA AGCATATCAA TTACGTATTC ATAGACCACT TCGCCAAGCT TCTCGGTATT CTTGTTCGTC ACATGAACGC AGGAGGCTCA GCCGACCAAG TAAATGCTCA GCGCCTGGGA GTTCTCAACA AGATTCTTGG GACAATCGTC AGGTCGATGA TGTGGCACTA TGAAAGTAGT ATGGAAGGGT CTGCAAACCC GTGGGACCAG CGTCCGTGGT TTCGTCTCTT GCTCAATCTT GTAATTGATC TAAACAAGCC GGATCCTGTA TTTGAGGCTG TTCGTCTCGG CATTTTGAGT GTTTTCGGAG CCGCCTTTCA CGTATGCCAA CCAATGATAT TTCCAGCTTT CGCATTCTCA TGGCTCGAGC TTGTGTCGCA TAGGCACTTC CTTCCTAACC TGATTTTGTT CTCCGACGAA AAGGGATGGA ATGTCGCTCA TCAATTACTG ATTGACCAGC TTCTTTTCTT GGAACCTTCG CTAAGGCGAG TTGAACTCAC TGTACCCGTC AAAAAGCTAT ATGAAGGGAC CTTGCGTGTT CTGCTTGTTT TACTTCACGA TTTCCCAACT TTTCTCGCCG GTTTTCATCT CAGCTTCTGT AATGTTATTC CCGAAAGCTG CGTACAGCTT CGTAACATTA TTCTGTCTGC CACTCCGAAG GCCATGAATC CTCCGGACCC GTTCACTCCA AATCTCAAAA TCGATTTGCT TCCCGAGATC TCTCAAAGTC CTACAATTTT GTCGAATATT CTTAGTCCAA TCGCTTCCTT TCGTGGGCAT CTCGATGCGT TCCTAAAGGA TGGACAGCGT CGGAACTTTC TCTTGGAGCT TCTTCCTTTG CTTCATCGTG ATGGTGGAGC TGAAATTGAC GTGCCGAAAG TCAATTCTTT GGTTGTTTAT GTTGGAGCGC ATGCACTGGC GAGGCTTCAA AACTCACAGA TTTCTCTGAC GCGTACTCCG GAGATGGAGG TAATCCAGAA ACTAATGGAA CTAGAAGATC GGGGGCGCTA CGTTTGCCTG AATGCAATCG TGAACCAGCT AAGATATCCG TCGAGTCACA CGCACTACTT TTCCTGCGTC GTGCTCTATC TCTTCAGCGA GTTCAAGAGC GTCGCTGTGA AGGAGCAAGT AACAAGGGTA CTGCTCGAAC GCCTTATTGT AAATCGCCCT CACCCATGGG GTTTGTTGAT TACGTTCATT GAACTCGTCA AAAATCAACG CTATGGCTTT TGGAATTATC CTTTTACACG CTGTGCCACT GAGATTGAGA AAGTCTTTGA ATCAGTTGCT CGTTCCTGC
|
Protein sequence | KDMGTWSTYA PTNTEQQRRL HRTVLLLLVR SRLLSVHDLD SFLAARADNG SNHIWLEFSL LFIRTAFMEK IATTADFPKL LDLMSQVAEG RSDASAQIPQ SFRKPILLML EEARVPALQI HAPVPASASK SVETQRLESS SIASFFRNDP TTAKQQVTAL LEGWIRLQSE PSLNEKALAQ YMIILQRFGM GKNEEQTERF LRNSVIIVVD AALKSSTQRG DGKKHINYVF IDHFAKLLGI LVRHMNAGGS ADQVNAQRLG VLNKILGTIV RSMMWHYESS MEGSANPWDQ RPWFRLLLNL VIDLNKPDPV FEAVRLGILS VFGAAFHVCQ PMIFPAFAFS WLELVSHRHF LPNLILFSDE KGWNVAHQLL IDQLLFLEPS LRRVELTVPV KKLYEGTLRV LLVLLHDFPT FLAGFHLSFC NVIPESCVQL RNIILSATPK AMNPPDPFTP NLKIDLLPEI SQSPTILSNI LSPIASFRGH LDAFLKDGQR RNFLLELLPL LHRDGGAEID VPKVNSLVVY VGAHALARLQ NSQISLTRTP EMEVIQKLME LEDRGRYVCL NAIVNQLRYP SSHTHYFSCV VLYLFSEFKS VAVKEQVTRV LLERLIVNRP HPWGLLITFI ELVKNQRYGF WNYPFTRCAT EIEKVFESVA RSC
|
| |