Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35101 |
Symbol | |
ID | 7200525 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 60881 |
End bp | 63004 |
Gene Length | 2124 bp |
Protein Length | 570 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179569 |
Protein GI | 219117552 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCCG CTCTACAGAA TCTGACCATC TTTTACCGTC ACTCCCTACC GAAATTATCC CTATCCAGTC CTCCCGTTTG GGTACTATGT GCTTGCGGAT GACAATCCAG TGACTGGGGC AATTCGAAGC GAATCGAAAG AGTATATTGG TTGTCCCTGG TGGGTTTTTG ATTTCCTCGT TGAATCGGTG GTGTGCCTTC TATTGGCGGT GACTGCGACG GCAGTCTTCA GAAGCGCTGC TATGAATCGC GTCGCAACCC ATCTCTCACC TCGCGTACAT TCCGTGGGAC GTGCAGTAGC TCCAAAAGCG TGGTGTCCCC CCACAGTTTT ATGGAGTGGA CGAGTGGTTA CTGCTACGTC ACCACCATTC GGAAACGGTC AGGAATCAAT CATCATGGAA CAGCCTGCCG CCGTTGTCGT CAATTCAGAA GGCTTGATTG CAGACATTCT CACAAACGTT ACAGAGGCGG AAGCTTGCCG ACTTTCTCAG TCTGAGAACT GCGAATTTGT CAATCTGGGT CCATCCATGG TTTTGTCACC CGGTATGATC GACGTCCATT GTCATATTAG CGAACTCGGT CGGGATTGGG AAGGATATCA CACGGCCACC CGAGCAGCTG CTGCCGGAGG AATCACCACT CTAGTGGGAA TGCCTTTGAA TTCTATACCA TCAACTACGA CTGTGGATGC TTTGGAGCAG GAAATTGAAG CTGCACTCGA TACGACTCTA CTGGCGGACG TTGCGTACTG GGGTGGCGTG TACCCAGCAA TCTGAAAAAT GATGCGTTCG AGCTCAAGGC CCTCTTGAGC GCAGGAGTCT TCGGGCTCAA GGCCTTTTTG TCTCCCCTTC CACCGAATGC TGGCTACGAG TCGGTATCGC CTGCGCAGCT TGCTGAAGCG GCCAAGATAT GCGGCACATA CGATCGACCT ATTTTGGTAC ACGCGGAGCT CATGTCCCTG GACGTTTGCC AAGAACGTTT AGAGGATGCG TACCGTGGGC AATCTCTAAA ATCCTATAAA GCACATGTTC AGTCGCGTCC ACCCCAGTGG GAACAAAATG CCGTGCAGGT TGTTTGTGAT CAGACCATGC ATTGCAAAAT GCATATTGTC CATTTGAGCG AAGCATCTTG TCTGGAAATC ATTGCTGCAA CGAAGAAGCA CCTTGAGAAA TTCGCAAAGG AGCAAAATAT TTCTGTGGAA ACTTGCTCGC ATTACCTACT GTTTGACTGT GACAGCATTC CTGACGGTGA AACGCGTCTC AAGTGTTTTC CACCAATTCG AAACAAAGCC AATCGCGAGT TGTTGTGGAA CACTGGCATT CGTGGCGGCT TGATATCCAT GGTGACATCC GACCATAGTC CATGTCCGTC GAAGATGCGC AACCTGGACA CATTGAACGT AAAAGATGCC TGGGCTGGGC TGACTTCTCT ACAATATCAG TTGCCGGCTA CTTGGACTGC CTCTCAAATG CGAGGGCTCG ACTATTCCTT GGTCGACATG GCACGGTGGT GGTCGCTTAG CCCTTCCACA TTGCCGACGG GAATGAGCGA TATAAAGGGA AAAATTGCGA TCGGTCATCA AGCGGACTTC GTGGCTTGGG ACCCCCAGTA CACTGGGCCG CCCAACGGCA ACAGCACAGA ATACCATCGA TGGAAGGATA GCTACTTGGG CACAATGTCT TTGCGGGGTA GAGTCATTGG TACGTGGCTT AAAGGACAAA AAGTCTACGA TGGCTTTGCA GACCGGTTCA TTTCCTGTTC TACGGATAAT TCACTGGGCA GAGTGTTGCT AGCCTCGATG AATCCCAGTC CCGACTGAGT CGATATTCGT AAGAACCATG CGTACTCCGT AAAATGGAGG CTTTCTGTAT TGCTTAGAAT ATGAAATGAA TGTTTTACAA AAACTTTTGT TTCCCAATTG GTTGATGTCA TTTTTTGTGG ACAGAGAGGA CGCAAGCAAC CAAGGTTCTC TTGCGTATGG AACGTGTTCT TTTGCAGACG GAACTTGCCT CTTCGTAGTA ATTATCATTG TATGTTCGTT CTCTATGAAT TTATAGTCAA GCTTCCTTTT GCTACTACCC TGCTCCTCAA TCAGCGTCAG GGAAGCAAGA TCCTCCTCGG CTAG
|
Protein sequence | MTSALQNLTI FYLLPFGYYV LADDNPVTGA IRSESKEYIG CPWWVFDFLV ESVVCLLLAV TATAVFRSAA MNRVATHLSP RVHSVGRAVA PKAWCPPTVL WSGRVVTATS PPFGNGQESI IMEQPAAVVV NSEGLIADIL TNVTEAEACR LSQSENCEFV NLGPSMVLSP GMIDVHCHIS ELGRDWEGYH TATRAAAAGG ITTLVGMPLN SIPSTTTVDA LEQEIEAALD TTLLADVANL KNDAFELKAL LSAGVFGLKA FLSPLPPNAG YESVSPAQLA EAAKICGTYD RPILVHAELM SLDVCQERLE DAYRGQSLKS YKAHVQSRPP QWEQNAVQVV CDQTMHCKMH IVHLSEASCL EIIAATKKHL EKFAKEQNIS VETCSHYLLF DCDSIPDGET RLKCFPPIRN KANRELLWNT GIRGGLISMV TSDHSPCPSK MRNLDTLNVK DAWAGLTSLQ YQLPATWTAS QMRGLDYSLV DMARWWSLSP STLPTGMSDI KGKIAIGHQA DFVAWDPQYT GPPNGNSTEY HRWKDSYLGT MSLRGRVIVK LPFATTLLLN QRQGSKILLG
|
| |