Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47577 |
Symbol | |
ID | 7202800 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 160386 |
End bp | 161849 |
Gene Length | 1464 bp |
Protein Length | 461 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182013 |
Protein GI | 219123400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.346341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATC CTCTATTTAG AATGCTCACT TTAAGGAGGG CAGCGGTGCA CAAAACAACT AGCGACCATC GCCAAGTCCA TTATCTCGAA CGCGTATCCC ACATTTTTGG AGATTTTGTC TCCCGCGGAA GGGAAGAAAC GCGGAAAAGG TCCACCAGTC GTAAACCGAT GGTTCTCGTA CAGAGGAGGA CATTGGAGGC TACGTGTTTG CAGGCCGCTG AACCAAAAAA AGGTGGCGTT TCCCGACGTC GAAGTTCGTG TGAGGTCTTC TCGGGGACGG CGGAAGAACC GACATCTCCG AAAGAATGGG TATATTATCG AGCGAACCGG CCTTCGTGGA ACATCAACAC CAAGGCTCCG AGACGGACAA GTCCTTCTGC TAGATCCGGT AGGGGACTAG TGCTTCGATT TGTTTGTGGA ACTCTGTTAT TTTGGATTCT GTCGTTAAGC TTGAAAAGAC AGGTTGGACC GGCGCTGAGG GCATTCGAAG CCGAGCTTGC CCTCACCGTG CTCCAAGGAA ACCAAGCGAT CGACTTTTAC GCTGGCACAG TTGGAGAAGA AGCACAACAA ACCAAAAATC TCTTGAAGGC TCTGCAGAGA GCAAAAGCTG CTCTGGAGAA GCAGATAGAG GCACGGCAAA TAAAGATCTT TTACAGCTCC GGTGAAACAC AAGCCTCTTC CTACCAGCCA GTCGATAGCG AAATCACCCA ATGGCTCGAT GAGAAGGAAA TTGCTTTAGA GCACAAAATT GCGGTGCTAC AAACAAACTT GAAACAGCTT AGTCGAGATT TTGTCAATAG CCGGTGAGTC AGCATTCTTG AACGTACACG ACAGACGGTT TGCGTCTTGC TTACATGATC TTCTGTGGCT CGCCCTAACA GATACGGACC TGGTCCCCAT CGTGTCGAAT TTGCAATCGA GTTTCGCAAC CACGAAAATT TGCCCGTTCC TCAAAGTTTT ATCGTAGAAC TCGCTCCCCT GGATCTTTTA CCCCATGCGG TGCACTTTTT CTTAGATCTT GTTCATCACG GCATATGGAA TGACACTGTC TTTCTACACC ATGAGGATGT GAGACACATA ATTGCGGCGG CTCCTATCGA CTTTGACACC CAAGAAATTA AGTTTAGACA GCTAGATGAA CTCGAATGGA GTGGCTTGGG ATTCCCCGAA TATTCGAAGG AAATGGCGCA CGAAAAATAT ACGCTCGGAT TTGCCGATCG GGGTCCCACG TTCTATATCA ACACAATGGA CAACACGGTT GCCCATGGCC CAGGCGGTCA GGGACATCAC ACACTTCCGA ATGACGCCGA TCCGTGCTTT GCCAAGATAG TGGAAGGGAC AAAGGTCGTG GACTCGCTAG TCCGTATGGG ATTGTTACAC ACCAAGTTTG AAGAAGGTCA GAGCCATCCG TGGGCCGATA GTGAGCACAC CTGGACGCGT ATTGTCTCTG CCGCAATACT GTAG
|
Protein sequence | MNYPLFRMLT LRRAAVHKTT SDHRQVHYLE RVSHIFGDFV SRGREETRKR STSRKPMVLV QRRTLEATCL QAAEPKKGGV SRRRSSCEVF SGTAEEPTSP KEWVYYRANR PSWNINTKAP RRTSPSARSG RGLVLRFVCG TLLFWILSLS LKRQVGPALR AFEAELALTV LQGNQAIDFY AGTVGEEAQQ TKNLLKALQR AKAALEKQIE ARQIKIFYSS GETQASSYQP VDSEITQWLD EKEIALEHKI AVLQTNLKQL SRDFVNSRYG PGPHRVEFAI EFRNHENLPV PQSFIVELAP LDLLPHAVHF FLDLVHHGIW NDTVFLHHED VRHIIAAAPI DFDTQEIKFR QLDELEWSGL GFPEYSKEMA HEKYTLGFAD RGPTFYINTM DNTVAHGPGG QGHHTLPNDA DPCFAKIVEG TKVVDSLVRM GLLHTKFEEG QSHPWADSEH TWTRIVSAAI L
|
| |