Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50253 |
Symbol | |
ID | 7199024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 110146 |
End bp | 111819 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185127 |
Protein GI | 219129924 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCG TTCGGAATTC CACCGCGGAT TCCTCGGTTG TGTTGGTAAC TGCGCCTGGA TGGTCCGAGA GGTCTCGTTC GGATGGGTTG GTCCAGGTGG AAGCGGTGCG GTCTGAAAAC GTCGATATTC CTAAAACAAA CTTGATTAAA CACTTTGTCG AGTCTAGGTT CGGCGGTGAA ACAAAGGATG GTCATTTGAC AGAAATGAAG GAAATGCCGC TGAAGGTCAC GCCGTCTCCG AATCAACAAA CCGAGGCGGA TGCAGACTTT GACGCCTGTC AACAATACAA CTGCGAAACG AGTGGAGAAA ACGAAAAGGA GGAAGCTAGT GATCTTAATG GTCCTTTCGG TACTATGAGT GCGTTTGCTT CCGCGGTTAT GTCCTCTATC CTGCTGGATG GCAATAATTC GGAAAGTCCG CAATCAAACC CTTCTGTTGA AAGCGTTTCA ATGTCTTTTG TGCTTGGAAA GACATACCAT CCGCTACACG ACTATTCCAT TCGTCGAGAT GATGAAAGGT CGCTCTTCTG GTTTACGTAC CGCTGCGACT TTCCGGAGAT TGCCCCCTAC AACATTACAA GTGATGCTGG ATGGGGTTGC ATGCTAAGGT CGGCACAGAT GATGCTGGGT CAAGCCCTTC GCTTGCATTT CAAGTCGCGG GATTGGCGAC CTCCGCAACT TTTGGCACGC AGACGGCAGG ATTCATTTAT TAGAAGCGTT TTGACCTGGT TTGCGGATTA TCCTTCTTCA AGTGAGAGCG TATACTCACT GCATAACATG GTAGCAGCTG GACTTTCCAA GTACGATAAG CTTCCAGGGG AATGGTATGG ACCAGGTACA GCTTGCTATG TGATGCGCGA CTTGGTACAT ATTCATGAGA AGCAACAAGC TTTGGGAAAA ACTCGTCTTG ATCGGCGCAT ATTTCGGGTC TATGTTGCTC CACAAGGTAC CGTATATCGA GATACTATTC ATGCCTTCAT GACGACAGAA GCTAGAGTAC GGATCGAAGA AAAAAAGAAA GTGAAGGAGC AAACTCAACC TCAAGCTCAT CCCTTAGATT TGGAATGGGA AGAAGAGCTC ATGGAATCGG CGAACACTGT TGAATGGGAT ACAGCACTGT TGCTATTGGT ACCGTTGCGG CTTGGACTGA CTAGCTTAAA TGAAGAGTAC GTGCAATCTC TTGCCCACAC CTTCAGCTTG CCACAATCGG TAGGTGTTTT GGGTGGTCGT CCGCGTGGAG CCCGCTGGTT TTACGGAGCG CAAAAGGACG GGAGTAAAAT TTTCGGGCTG GATCCTCATA CGGTACAAAC AGCACCCGGT CGACAGACGG CACGCGTCAA CGGTCAAGCT TCGTCGGTCG TTGAGCTATC TGACGACTAC TTACGATCAT GCCACACAAC CTGCCCTGAA ATGTTTCCTT TTTGCAAGAT GGACCCAAGC ATTGCACTTG GATTTTATTG TCGGACGAGA GCTGATTTGA ATCACGTTTT GAATTCCATG GGGGCTTGGC AAAAAGAACA TTCATCTATT CCAGAGCTTT TTAGTGTTTT GGATAGGGCT CCAGATTACT CGGCCAACGT CGACGATCTT CTTTTGGGAG GGGATTCCTC AATGATGGAG ACTTCTGGCT TTGAAGACGA AGCAAGTGAC GCAGATGAAT ACGTTATGCT GTGA
|
Protein sequence | MSIVRNSTAD SSVVLVTAPG WSERSRSDGL VQVEAVRSEN VDIPKTNLIK HFVESRFGGE TKDGHLTEMK EMPLKVTPSP NQQTEADADF DACQQYNCET SGENEKEEAS DLNGPFGTMS AFASAVMSSI LLDGNNSESP QSNPSVESVS MSFVLGKTYH PLHDYSIRRD DERSLFWFTY RCDFPEIAPY NITSDAGWGC MLRSAQMMLG QALRLHFKSR DWRPPQLLAR RRQDSFIRSV LTWFADYPSS SESVYSLHNM VAAGLSKYDK LPGEWYGPGT ACYVMRDLVH IHEKQQALGK TRLDRRIFRV YVAPQGTVYR DTIHAFMTTE ARVRIEEKKK VKEQTQPQAH PLDLEWEEEL MESANTVEWD TALLLLVPLR LGLTSLNEEY VQSLAHTFSL PQSVGVLGGR PRGARWFYGA QKDGSKIFGL DPHTVQTAPG RQTARVNGQA SSVVELSDDY LRSCHTTCPE MFPFCKMDPS IALGFYCRTR ADLNHVLNSM GAWQKEHSSI PELFSVLDRA PDYSANVDDL LLGGDSSMME TSGFEDEASD ADEYVML
|
| |