Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50026 |
Symbol | |
ID | 7198724 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 180760 |
End bp | 184287 |
Gene Length | 3528 bp |
Protein Length | 741 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184837 |
Protein GI | 219129315 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.301248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTCACACTC GAATGGTTGA ATGGTGACTG ACAGTGACTG CTTGTCTTTA CTTAGAGAAT GGACAGATCT CCTCCCCGTT GCTTCAACTA ATTGTGTGTC CAGAGTACGA ACACCGATGG TGGGATAGGT TCCTTTGCTA GAATACTCTT GCGTAGAAAA AGTACCATCT GTTAGGGAGG ATGGATCTCA TTGACACAGT GAACGTCGCA TTTCAGGACC CATACGCCAC TCCATACCAG TTCGTTCGGG GGAAAGAAAA TCCACAAGTA GCCGTAAGTA CAGCAGTTCG TTGGCTCGAG TGAACGACAC TTACCGACGG TGTCATAAAA AGCAATCGAC ACCACTATCA ACAACACACA CACACACACA CACACAAACA CCACACATTC CTCAAGGTCC GTTGGTTGCG TTGCGCTGTT TCGGTTCACT GCCTCTACTT ACCTACTCAC CTGCGTACAC ACGTGCTCAC TTCCTTTTCC TTCCAGCGTT CCCCCATGGT TGCTTCGTTT TCAATGGGCC GAAACTCGGC GACGTGGTCG GTGTCACCGT GCACCCGCAT TCTTCTACTG ACGCTCGCAA CGTGGAACAC GTTACCATCA CTCGTCCAAG CCGAGTGCTT GGCCAATACG GAGTTCAACG ACTTTTTCGA AGGCCTCGTG GGAGCACCCA TTCCCAGTGA AGGCTCGTGC TGCATGTTTG ACGTGTGCGG ACTCGAATGT CCCGTCGATG TGCCCGAACC CAGTAACGGT ACGTTCCAAA GGTGCTGCCG TCCAGCTCTC GGGCGATGGA TGGATCTAGT ATTTCGTCCC TTGTTCCGGG GTCGATCGAC ACACACACAC ACACACACAC ACACCTTTCT TTCTACCCAA CGATACGATA CGGTCAAAGT CTGCCCACAG AAAAAACAGA CACGTGCCTT GGCTCGTGGA CAGGATTGGC AGTCGCCCCT TGTATTTACT AGGAATGCAA CGAATCCTAT CCATTCTCAC CTGTGCCTGT TGTGTTCTTT GACAGGCTTT GGAATTGCCG TGGCCATTAG CATCGTCATC TCCTTCTTGA TCGGCATCGC CACCTTCTTT GTTGTCAAGG GACAGTCCGT CAACTTCTTC GTCGCCGGTC GGTCGCTCCC GTTATGGATC GTATCCATGA CGCTCGCGGC CCAATCGATC GACTCCAACG CCATCCTCGG GAACGCCGAT TTGTCCTACA AGTTTGGCTT TTACGACGGC GCCGTCATTC CCATTGGACT CGGACTCAGC CTCATCCTCA ACGGTATATT CCTCGCCCAC CACGTCAACA ACGATGAAGT CCTGACGCTG CCCGATATCT TCGCCAAGCG CTACGGCAAG GTCGTCGAAA TCCTCGTCTC CTTGACCACC ATTTGCAGCT TTCTCATGCT CCTCGCCGGG AATCTCGTCG GATTCGGGGC CATTACCAGC TACGTTTGGG GTATTTCCGA TACCGCCGCC ATCTGGATAG CTGCCGGTAT CGTCTGGGTC TACACCGCCA GTGGCGGACT CTTCTCCGTC GCCTATACCG ACGTAGTTCA GGGACTCATG GGATGGTCCG GATGCATCGT CATGGCCTTT TGGTTCATCG CCAACGAAGA ACCCAACGCT CCTCCACCTT CGATTGGATA CCCACGTACG TAACAAGCGT CCCCACGCGT GCGGTAGACA CCACATGGCT CAATCCGCGT CTTGCGCCGT TGATTTTTGC CACTCACCGT AGTATTCTTT TTTGTTTCAA TGGCGTGTAG TCTACGTCTA CCCGGATAAT ATTGGGGACG GCGGTGTCTG CGACATGTAC GACGGAGTCC CCTGTGCCGT CACGGCCGAC GCGTGCTGCT ACAATATTGA ACGCTGGTGC CCGGAATTCA ACGTCACGGG ACAGTGCGAA CGCTTCGACC GCGGAGCCTA CCCCGTCGGT GATCAGCGCA TTTTTTCCGA CCAAATGTCC AATTTTCGTG CGCTGACACC CTTTCCCAAC GCCATTTTCT GGAACTGGGC CACCATTTTT ATTCTCGGAT TCGGCAACCT CGCCGCCCTC GATTTTCAAG CCCGGTGTAT GGCTAGCAAG ACGCCCCGAA CGGCACGTAT CGGCTGCATT ATTGGCGGGT TGTTTACCTT TTTGATCGGT ATTCCCTTTG CCTACATGGG CGCCATTTCA CGGTACGTGG CTAGAGTCGT GTTGTGTCCT AGTGGCACCC GTACCTTGGT CCTGGCTGCT GGTCGACGTC GTACGGTTCC CCATCACGTG TGTCTGACCC CGTATTTGTT TTATTGTTGC GTTCCGTTGC GTACAGAGTG TACTACGGTC CGGATTCCAT CCACGCCGAA TTCGAGGTGG ACACGTGTCT CAGTCAGTTG GCCCTGCCCA CCTGTGGCCT CTGGCTCCCC GACAAGAACG CCTTTATCAA GTTGTTGACG CACCAAGCTC CCGACTTTTT GGGTGGTTGG TGTTTGGTCG GGATCGTTGC CGCCAGTATG AGTACAGCCG ACGGAGCTAT TCTAGCCATG GGTACGGTCT TTGCCCATAA CATCATGCGT CAATTTGATG AATGGATTCC CAATTTGGTG ACGCCGGACA ATTTGTTGGT CACCGCACGC GTCGCCACTT TGCCCTTTAC TCTAATCAGT ACCTTTATTG CGGCCTTTTA CCGATCCTCC CATTCGGCCG GGGCGACCGG GTACTTGTTG ATTGTTGCTT TTGATATTGT CTTGGCCACC GTCGTGGTCC CTTTGTTTGG ATGCTTCTAC TGTAAAAACC CTTCTCCCCG TGCCGCTTTT GTGGCCATTA TCGGCGGTGC CATTACGCGT GTCGTGCTGG AATTCGCTCT GCCCAAGGAC GGTTTCTTGC TTTTGCCGTA CGATGCGCCC GAATTTCTCA ACACTGGCCC CGGCGCCAGT ACCGGTACGC CGGTCTTTTG GGACGTGGAT CCGGGCGACA TGTGGGACGA AACCGTCGAA CCCTGCGTCC AGGAATCCTT TGAAGACTTC ACCGGCGTGG ATTCCTTGTC CGCCTTTTTG GTTTGCATTC TTTCGTTCGT GTCCGTGCAA ACGATCGAGC ACTGTACGGG CAAGCCTTTG TTCAGTTTTG CCGGTATGCA GGGTTACCAC AAGGATACTA CGGAACATCC ACTCAAGGGT AGCAGTATTG ACAAAATGGA CGAAACGGCC TTTCAGGACG ATACCACGAA ACGCGGAAAT CCCGACGCCG ATGAAGGCGA TGCCTAAGGT CGACACGAAC TCGGTGTAAG ATGTCTCTTC TTTTGAAGTT CGAGAGTGGA ATGTTGTAAT ATCTACTTGA TCAGCAGTTT GGATGCCGGT GGGCTCATTG CTGGTTTTGT TGGCATATTC TATACCGTAC ATGGTTTTCG GGTCGAGAAA TTCAACGTGT GTTTCGATTT AATATGTTTT GTTGCTATGG GAACCGCTTC CTTGACTTCG GCAAGCAATT TGCGTTTTTG AATAATAGCT TGTCCGTCGG TTTTAATTTG GTAGTATAGC GTGATTTTTA TGTTTTGC
|
Protein sequence | MVASFSMGRN SATWSVSPCT RILLLTLATW NTLPSLVQAE CLANTEFNDF FEGLVGAPIP SEGSCCMFDV CGLECPVDVP EPSNGFGIAV AISIVISFLI GIATFFVVKG QSVNFFVAGR SLPLWIVSMT LAAQSIDSNA ILGNADLSYK FGFYDGAVIP IGLGLSLILN GIFLAHHVNN DEVLTLPDIF AKRYGKVVEI LVSLTTICSF LMLLAGNLVG FGAITSYVWG ISDTAAIWIA AGIVWVYTAS GGLFSVAYTD VVQGLMGWSG CIVMAFWFIA NEEPNAPPPS IGYPLYVYPD NIGDGGVCDM YDGVPCAVTA DACCYNIERW CPEFNVTGQC ERFDRGAYPV GDQRIFSDQM SNFRALTPFP NAIFWNWATI FILGFGNLAA LDFQARCMAS KTPRTARIGC IIGGLFTFLI GIPFAYMGAI SRVYYGPDSI HAEFEVDTCL SQLALPTCGL WLPDKNAFIK LLTHQAPDFL GGWCLVGIVA ASMSTADGAI LAMGTVFAHN IMRQFDEWIP NLVTPDNLLV TARVATLPFT LISTFIAAFY RSSHSAGATG YLLIVAFDIV LATVVVPLFG CFYCKNPSPR AAFVAIIGGA ITRVVLEFAL PKDGFLLLPY DAPEFLNTGP GASTGTPVFW DVDPGDMWDE TVEPCVQESF EDFTGVDSLS AFLVCILSFV SVQTIEHCTG KPLFSFAGMQ GYHKDTTEHP LKGSSIDKMD ETAFQDDTTK RGNPDADEGD A
|
| |