Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49604 |
Symbol | |
ID | 7198214 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 172732 |
End bp | 174858 |
Gene Length | 2127 bp |
Protein Length | 608 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184413 |
Protein GI | 219128424 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGTCAAGCA ACGCACCCTT GTTCTCCATA CCATGACCGT AGCTACTCAA GAAGTATCCA ACGCCCCACT GCATCGCGTA TGGAAAACAG CGGTGACGCT GACTATCGGT GCCGCAGTAT TACTCCTAGG TTGCGCCGCT TCGTCCCTTA GTCCAACGGG AAAAGGAGGC GTTGTGGGCG CTTCGTACGA TCATCTCATC TCCGCGAGTA CCGTGACTAC TTACAGCGAT CCTCCTCTCT CTCGTTACCT TCAAGACACT GACTTGGTAT GCCAGGTCAC CGTCTTTGAA ATCGTCAACG ACGCCCTGAA CGGTGTTACC GATTCCGCGG ACTTTTATGA ATGCCAGGTA GTCCTCGGAA CAACCTTGGA CAGCGTGCGT TTTACGCTGG AACTCCCCGA ATCCCTCGCA GCGGAACAGA AGGCATTGGT ACTCGACGAC TCTCAAGATG TCTTTTACGT CAAGATCCCC AACGGTAGTG TGGACCTCAC CCAGGCTTCC ATTGTGGTTC CTGACGCCAA CGCAATCTCC CTAATCTCTC CTCCGCACGG TCTTGCTCCC CCGGGACGCC GTCTTGCTGC TCCTACCGGT ACCATCTCCG CAGTAGTTAT TCGAATCACC GATAACAGTC GTTCCAGTCC CTCATTTTCC GCCTCGGAAC TCTACCAGAG CCACTTTGGC AACCAAGTGT CTCTCAAATG GCAGTACCAC AAGTGCTCCG CGGGCAAGCT CCGAATTGAA CCTGCTCCCG TTGCGATTCT GGATGTCACC GTTAACCGTC GCGCTGCTTC TTCAAACTCC CAACAGATGG TGCAAGCTGC CGAACAAGAA GCTCTGAATG TACTGCGTAC TCGTCACGGA AGAAGCCATT CGTCATTGCG AGACTACGCC GGTATCACTA TCTTTGTGAC ACCTCCTATG GGTAACTGGT TGGCGTACGC TTCGATTGGA GGCGGCCTCT CGGTCTACAA CGATAAATGG GGAGGTTACA TTGCTTCTAT TATGCACGAA GTAAGTTGGG TAATCCTTGT TCACGCACCT TGCTAGCGTT TTTTGTTTCA GCGTACTCAC ACAACCTGTG GAAATCTTTG TCTATAGGTC GGACACAACT TCAAGCTTTT GCACGCCAAC GAACGTGGCG AATACCAAGA TCACACCGGG TACATGTCCG GAGCGGTTCT CGGATACCAC GATGCTCAAC ACTTTCCCGA AATGTGCTTC AATGGAGCCA ACCATTGGGC TCTCGGTTGG TACTCCGACA AGACCCAATC AGTCAGTCTC AACAACTTGC AACAACCGAC TCTGCTTTCC ATTGCGCCCT TTGTGGACTA TCAACGTGTC CCTGGTGGCT ACCCAGTCGT TGTCCAGATC CCCGAGATGA ACTACTACCT CCAATACAAT CGCGCTAAGT CCTTCAATGC CGATACGTAT GAATACGCTG ATCACGTTAC CATCACTCTG CAGCAAAACT CTGCTACGGA ACTCGTGGAA GCCATTAGTC CATCTGATAC CCCCAAGTTT GAGGCCAACT ACGGATCACA GAAGGTAACC ATCCAAATCT GTTCCGCGGT GGCTGGAAGC GGAAATACCC CCGAGCATAT GCTTGTCTCC GTTGGCCTGA ATGGCGCCTC GGCCTGTAAT TCTGGTGGTG GCGGCGGTGG TGGTGTTGGT GCACCCGACA TCTTTACACC AAACCAAGAC AATGAGGTTT GTGGACTGCA GAGCGGAGCC AAGTGTACTT CCGACGGGCA ATGCTGTTCC AAACGGTGTC GCACTTCCTG GGATCCTGCT TACAACATTT GCAGCCACGA CACCACCAAC CCATTTGCGA TAGACCGACA AGATGTACTG GACGCCCTGA ATCTCGGCGG CTCGGACCGT TCTCGAACTC GAGGCGGCAA CGTACGGAGC TTGCGTTTGC AAAATGTGGA ATCTACTTCG AACTAGGTCA CTACCGACAG TTCCTAGATT GCGCACGGAA TACTACCTGC AATCTATGAT GTTACATATC CTTTCTATCA TTCGTTCCAA CTTCTCTCTA CCGTTCAAAA CCAATTTCAT GGTTCTGAAC TAATGTTGTC GAAGTGTGTA AATGATCATC ATAGCTAATG TAAGAACCAC GATTTGT
|
Protein sequence | MTVATQEVSN APLHRVWKTA VTLTIGAAVL LLGCAASSLS PTGKGGVVGA SYDHLISAST VTTYSDPPLS RYLQDTDLVC QVTVFEIVND ALNGVTDSAD FYECQVVLGT TLDSVRFTLE LPESLAAEQK ALVLDDSQDV FYVKIPNGSV DLTQASIVVP DANAISLISP PHGLAPPGRR LAAPTGTISA VVIRITDNSR SSPSFSASEL YQSHFGNQVS LKWQYHKCSA GKLRIEPAPV AILDVTVNRR AASSNSQQMV QAAEQEALNV LRTRHGRSHS SLRDYAGITI FVTPPMGNWL AYASIGGGLS VYNDKWGGYI ASIMHEVGHN FKLLHANERG EYQDHTGYMS GAVLGYHDAQ HFPEMCFNGA NHWALGWYSD KTQSVSLNNL QQPTLLSIAP FVDYQRVPGG YPVVVQIPEM NYYLQYNRAK SFNADTYEYA DHVTITLQQN SATELVEAIS PSDTPKFEAN YGSQKVTIQI CSAVAGSGNT PEHMLVSVGL NGASACNSGG GGGGGVGAPD IFTPNQDNEV CGLQSGAKCT SDGQCCSKRC RTSWDPAYNI CSHDTTNPFA IDRQDVLDAL NLGGSDRSRT RGGNVRSLRL QNVESTSN
|
| |