Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50342 |
Symbol | |
ID | 7199078 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 366120 |
End bp | 367595 |
Gene Length | 1476 bp |
Protein Length | 395 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185182 |
Protein GI | 219130039 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.204798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGACCAGCAT CATGAAATCC TTGTGCTCCT GACAGCGACG AGTACATTTG TTTGTGCCTT AGTCAATCTA AGCCACCAGT CCTCATATGC TTCTCAACCC TCACATCGTA CCAGTGTTTG TCAATTGGAA TCTGGATACG ATTTCCGCAA CTTACCACGC GTGAGGCGAG GACCACTAGT GCAAATGCCT GGAATGTACA GCTCAAGGAG TGAAAGTACC GCAACTGGAC TTACGGGACC GTTGAAATCC AACGAAACCG AAAATCCTGT CCTCTACAGC GAGACATTTC AAATCCTTTC GGTGGATCTG CCCGAGATAA GCCCATACTA TCGCCACGAC CGACGAGATA AACTAGGGGC ACCAATCAAT GATACCGCTT CCAAAGATCT GAGTCGTCTG CTCGAACAAC GTTTTCAAGC ACGGAAAGCC CGTAACTTTG AGCAGCTAAA CCAAGTCGAA GCCGGGACAA CTACGACGTG CGAGTATACG ATCATCCCCC AATCTGGACA AGGCTAAAGG AACCTCCTAG AGCCCATTTA CGGAGACAGG CGCGCAAATG GCTCTGGAAG GCCGAAACCT TGTACGGACC CCGGGGGCAT CCCTTGGTTC AGGTAGGTAA TTTGATGGAT ACAGACTCGT TCGTATGCCC TTTGACGATT TCGCAAATAC ATTCGTCATT GATGCAACGG GAGCTTGGGC GAATGCAGGG GCAATTCGAT CAAGTGAATG CAATTCGGTT GGAGTTATTA GTCTACGGTG TTCGTGTTCA CGACGACTTT CGCCAGTGGA CGACTGACCC AAATCACGTC TTTCTTGCGA ATACTGCTTC GAAACATTCC CCTGCATTTC CACATGCATA TCAATGCGAT CCGTCTTCGC AATCGTCGAC ATCTCTAACT ATAGATGGGA GCGAAGCAGA CCGTCTGACG CAACGAATCG AGTTCCTGGT ACGCACGAGG GCCGCGGCAC TCTTCCGTGG CGACGACAAA AAAGCTCTGT TCATAGCTTG CAAGCTCTAC ATGACTTACG GAGTCGGAGT CAACGATACA ACGAGGACAT GGTCAATTGG GTCTCGGTTC CTGAAAAGTT ACGAAAATGA ATGGAAGGCT CCGACCATTT CAAAAATCTC TGAAATGAAG GAAAAAGTGT CGTTTACGCA TGAGCTCTTT CAAATGCGTC GCCAGTTTGA GAGCCCAAAC TTCAGACGAA GCCAAAATTC GCATTTTTTT CCAAATGCGA TTGTAGAAAA GCGAGTGGCT TCGATGGTAC AGGAGCGTAT TCACAAACGA GAAGAGGGTA TGTTTTTGGA AGCTGATGCC ATCCGTCGGG AACTTTGGTC CACTTACGTA AGTGCATTTT GATTTCGGTG CTTTTTGACA ACGCCAATAC TGATTTTCCT TGTTATTCGC CCACAGAATG TTGGAGTCAA TGATCGGCTA CAGCAGTACA GCCTGGGAGG AGTATTTGAA ATTTAA
|
Protein sequence | MPGMYSSRSE STATGLTGPL KSNETENPVL YSETFQILSV DLPEISPYYR HDRRDKLGAP INDTASKDLS RLLEQPKPSR SRDNYDVRVY DHPPIWTRLK EPPRAHLRRQ ARKWLWKAET LYGPRGHPLV QVGNLMDTDS FVCPLTISQI HSSLMQRELG RMQGQFDQVN AIRLELLVYG VRVHDDFRQW TTDPNHVFLA NTASKHSPAF PHAYQCDPSS QSSTSLTIDG SEADRLTQRI EFLVRTRAAA LFRGDDKKAL FIACKLYMTY GVGVNDTTRT WSIGSRFLKS YENEWKAPTI SKISEMKEKV SFTHELFQMR RQFESPNFRR SQNSHFFPNA IVEKRVASMV QERIHKREEG MFLEADAIRR ELWSTYNVGV NDRLQQYSLG GVFEI
|
| |