Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50011 |
Symbol | |
ID | 7198710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 131058 |
End bp | 132846 |
Gene Length | 1789 bp |
Protein Length | 443 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184896 |
Protein GI | 219129438 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.604892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACACTGTTG CGGTGCAGCG CTAATGGTTT CGATCTGGGG AACAAAAAGG AACCATGAAA CGTCGCGATG TTTGTCGGTC ACTTCACATC CCGGGGACCA TCGTTTGCAT GATCCTGTAC ATCGCATCCT TGTATTGGTG GACCCAGGGT ACGACCAATT GTTCACTGTC ATCAACACCA TCATTGAAGA CGCAGGGGTT CAACAATGGA TCCGTGCTGC TCCAATTTGA GGGGGGCGTT CCTGCCGACG TGACAACTAG AAATGGCAAC AACGATGACA ATGCCAAGCG ACGCGCCTGG GAACAGGCTA TGCTAGCAAA ACACCGCAAC GCGACTGTGG AATTAGAGCG AGTGGATTCG TACGCGCATA AGGATACGCG CCACCAAACG CATCCCGGAT TGTATCCCAT CGACCCGCAC GCTCGACCCC GTCCCGAGTG GCGGAACACT CACTTTCCCG ATATTAACGT GGTGGGATTA CCCAAGGCTG GGACTACGCA ACTCTACAAC ATTCTCGTTT CCCACCCCCG CACGGTCGCA TTTAATCCAC TCAACAAGGA GGAGTGTTTT CCTGGAAACT ATTCCGATGT ACTCCACGGA TGGGATGGCT TTGTTTCCGG GGAGCAGCCG ACAGCCAAGC AACGGGCGCT ACAGGGAGGC TTGTACTCTA CATTACAGAA CTATCATCAG GAAAACCCGC AATCCTTTCC ACATAAATTG TCCGTCAACG GATGTTTGAA CGGACGCATT GTCGCAGCAG TCTACGATTA TTTAAAACCT CCAGCGACCA AAAAGTTTAT TATTGCATTG AGAGACCCAG CGGATTGGCT GTGGGCCGTT TACAATTTTT GGGCCGTCCA GGACGTAGAC GTCGTCATTC CTGAGGGAGA GTGGGCTCAT CGCGAGCAAC ACTACCGATC GCCCGAAATG TTCCACGATC TCGTGGCCTC CAGTCACGAC ATGGTCTTTT TCGACAACAT GCTAGGACGT CGCGCTTTGG CTGCCTTTAA TTACGTTTGG CAGTTCGAAG CAATGGTGGG ACGGGAGAAT ATTCTCTACA TTCGCAACGA AGATCTTCTA CCAGGGGTAG TCGCGCGGCC GGGAGGAGTC CTCGACCAGC TTGCTGCATT TGCAGGTCTC AATCGCAAAG GTTTTGACTC GCAGACGTTC GGCGAGATTT CCAACTGCAA CGACCAGAAA GGGTTTGTGA AAAAATGTGG AACATCCAAG AGTAATGCAT ACGAAATCAC TGGAGGAAGA TCCATGCTTC CAGAAACGCG CACTCTGATA TATTTACTCT ATTACGAAGA ATGCAAACTG TGGTCGCAAA GATACGATGT TGTCTACGAG GACTGTTTGA ATGTGTTGGA GGCAACTAAA TCTTAGCTGA TTCAAGCTTT TACTGTGCCT TTTTTTCATG GAGCTTGGAA ACGAAAAAAT ATTTCTGATT GCTCATACTT GAGGTATTTT CAGCAACCCC CTTCAGCAGC AGCATTCCAA ACAGTTGCTG CTGTCTCTTA CCTCCAACCT GAGCTGTACA ACATTATTTT GTACTACTGG AGTACGGTTG GTCTATGCAA AGAGGAGCTT GTTGACTTCA AAGAACGCTG CTACACAGAT CTGTTGAAAA TTCGGTGCAG AAAATGAGGG CCTTAGGGTT GAAAAAATTT GCTCCGACGA TGATATTTTA CGGTGTAGGC TCTGCGTTCA CTGACAGACG GAGTCCTGCC TGATAGTGGT CATATAAGAT TTACATTACA TTAACAGTAA ATATAAAAAG ATGTTCTCT
|
Protein sequence | MKRRDVCRSL HIPGTIVCMI LYIASLYWWT QGTTNCSLSS TPSLKTQGFN NGSVLLQFEG GVPADVTTRN GNNDDNAKRR AWEQAMLAKH RNATVELERV DSYAHKDTRH QTHPGLYPID PHARPRPEWR NTHFPDINVV GLPKAGTTQL YNILVSHPRT VAFNPLNKEE CFPGNYSDVL HGWDGFVSGE QPTAKQRALQ GGLYSTLQNY HQENPQSFPH KLSVNGCLNG RIVAAVYDYL KPPATKKFII ALRDPADWLW AVYNFWAVQD VDVVIPEGEW AHREQHYRSP EMFHDLVASS HDMVFFDNML GRRALAAFNY VWQFEAMVGR ENILYIRNED LLPGVVARPG GVLDQLAAFA GLNRKGFDSQ TFGEISNCND QKGFVKKCGT SKSNAYEITG GRSMLPETRT LIYLLYYEEC KLWSQRYDVV YEDCLNVLEA TKS
|
| |