Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47754 |
Symbol | |
ID | 7202738 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 737772 |
End bp | 739117 |
Gene Length | 1346 bp |
Protein Length | 407 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182124 |
Protein GI | 219123627 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGCTG TTTTGTACGG AGCGACCGAC AACAACAACA ACAACAACAA CGGCACCAAC AAAACAGGAA GTGATGGTTC GGGAAGTGAT AAAGATGGGC GCCCCGTTTT CCCGTTTACA AGTCTCGGCA ACGGCAAAGG GGATAACGGA ACCAATATTT CCATCCGGGG TCTTACCCTG CTGTTGACCT TCTTGACGGT TGGAGTGCTG TTTTGCGTCT TGTTGCCTTT TGTCATCCCG AGACTGCGTC GGATTTGGAG GCAACGCCTA GAAACGAACA ACGAGGAGGA TCATAGATCG AGCGTGGACG AACAGGGCAG GATTCAGTTA AGGTATAGTG CCATTGAATC ATGGTTGGTG AGCAAACAGG TCTTGGCGCA CGATGACTTG TGTCGCAGAG TTCTGCAGCA CAATGGCAAG GAATGTTGGG AACCAGACAC ATCGGCACAC GGCAAAAACC GCTCGAAAGA ACGTACCATG ACTGTAGAGA CGTTGATGTC GGCGGACGAC GACAGCAACG ACGAAAACCA CGGGGAGATT TGGAGTGAAT GTGACAATGA AAATGAAGGC CTCGAATGTC CTGTTTGTAT GGATTCCATA CAAGTGGGAG CCATTGTATC GTGGTCAGCC AATCCCAAAT GCAGGCACTA TTTTCACCAC GCGTGTATCA AGGAGGTAAG CTTCATGCTG TGCCGTACTG ATTGCAACAA CAGTTGCGAT GGCAGGTCCC AGCCTTAAAA TATTTCCTTA TTTCCCCTCA AAACATTTTC CTGCTCATTT CTGCGCTTTC TCTTCAGTGG CTCCTCAAAC ACTCCGATTG CCCCTTTTGC CGTGAATGCT TTTTACCGAC CGATCAAATG CCTAGCAAAC TATCCATGAA ACACATTTCG GAGCTCATTG TAGCACAACA GCACCGATCA GCACACTGCT TCTTTTGTCT CGAACACGGT GTTGTTGCAT TGCCTAACAA ATTAGAAGCC TGTTTTGCTG TAGACTCCGA CGGCACCAAG GCGCTATGGC AACGAGCGTC TGCCGTACCG AGCCGCCAGG CGCTCGTAAC GTTACGAAAA GTTAATCACG AACAAGATTA TTTGAATCGC ACTGGCTCGA GTCAAGAAGA CGAAGATGAT CTCGGCAATG ATTCCGTAGG TTTTGCACAA GACAGTGAAT TTAACGCAAC AGCTTTGAAC AGCAACGAGG ATACAAGCAA TGCTTTAAGT GCTGATGCCA GTAGTGAGGA GGTGGACATT GTTCCACAAG CAGGAAGTAG CGTAACGGTT GCCTTCGGTG GAGCCGACGG TACGGTTGTG GATGCGGATG ACGGCGGATT CAGACGCGAA AAGTGA
|
Protein sequence | MYAVLYGATD NNNNNNNGTN KTGSDGSGSD KDGRPVFPFT SLGNGKGDNG TNISIRGLTL LLTFLTVGVL FCVLLPFVIP RLRRIWRQRL ETNNEEDHRS SVDEQGRIQL RYSAIESWLV SKQVLAHDDL CRRVLQHNGK ECWEPDTSAH GKNRSKERTM TVETLMSADD DSNDENHGEI WSECDNENEG LECPVCMDSI QVGAIVSWSA NPKCRHYFHH ACIKEWLLKH SDCPFCRECF LPTDQMPSKL SMKHISELIV AQQHRSAHCF FCLEHGVVAL PNKLEACFAV DSDGTKALWQ RASAVPSRQA LVTLRKVNHE QDYLNRTGSS QEDEDDLGND SVGFAQDSEF NATALNSNED TSNALSADAS SEEVDIVPQA GSSVTVAFGG ADGTVVDADD GGFRREK
|
| |