Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44839 |
Symbol | |
ID | 7199556 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 393597 |
End bp | 397124 |
Gene Length | 3528 bp |
Protein Length | 928 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178992 |
Protein GI | 219116394 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0126188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCCG TCGATGGGGT TGAGCATCTG TCCTGCTCGC TGAGACCACC CACATTGGAT GATATTCAGA GTGCGTTTGT TGAGACGTCC GTCTCGGATA CTATTGACAA CAACGATGAA ACCCAGGCGA GTACTTTAGA CCAGCCCGCG AACGGAGATT GCACTGCTTG TCCGGGGATA AACAGCATCG ACAGTATGGA GCGCGGCGAA AGTGTGGAAG AAATTTTTAG AGCGTCACTA AATTTCGACC GATCTAGTTC TTTTTGCTTT TCAGGAAGTC TACCATTATT TGGTGCCTCT CTTTATCCGA ATGTTGCGCG GATTGAAGAT GCTGATGGAA TATCGACAAC AGCACAACTT GAACAAGGAA AACGAGAGCA GTCGGAAAAT ATCAGAGTGG AATCAACCTA TGCTGAAGCC TCGCTGAACA CTCTCACTGA GGAAGCAGTT GTACTCGGCT CGGCTAGAAT GTCCGATGAT CCCTGGGACG AAATGACCGT CGAAGACAGA GAAATTCTTG CCATGAAGAG GGCAGCTTAC GCCGGAGAAA GGAGCGATCC ACTGGTGGCA CAGAACATCA AGAAGCATGG ACAAGAAGCG ACAGTCGTCG CTATATCGGA GTATGACGTG CATCCGAATG AAATTGTCCA AGATGCCGTT AGAGCGGAAC TGGTTGGACA GGACTTCACC AGGACAGTCA GCAATGCAAT GGACAACGAA AACGTGGAAA GTACAGAAGC GGATGAATTC ACAGTCAGTG CAGTCTTTTC AGCATTGAGA ACGGTCGACA ATGAAGCTAC CGAGGCAACT GTGATTGACA GTGCCCCCCT CGCTTCGGAA GAAGATATCC CTTCATGGAA GCACACGGAA GAAGCCCAGG TTCTCGTTCT GGATACGTCC TTGCCTTTGA TTGACGTTAA ACCAGCAGCT GTTGACTTCT CCGAGTACAG CAACAGTGGA ATAGAGCCTA TCGCCGACCA AGAAGCGGAA GTTCTTGGGA TTCAGGAAGA GATACATCCG TCCGAGGTTT CAGAGAACGA AACGGAAGCT GAGCTCGTGG GAACGGACCA CAATTTTGCC TTCACGCTGG CTAATGAGTC ACATCTTGAC CGTAATACAG CAAGAAGCGA CGAGCCAAAT GAATTGCGGC AAGTAGATTT TGTAGTTGAG ACTGTAGATG GCGGTTCTGA CGGTGAAGAG CCTATTAGTT TTGGTAATGT TCGCGCAGCT CTTCTGGTTC CAACTATTTT GAACGCTGAA ACCGTCGATT CCATTCCAAC GGTAGCACCT TTCACACGCT CACTGTCTAT GGAAGGCACA GATACTTCCT TTAGAGATGA TACATCGGCA CAGACGGGGA ATGGGTCCGC GTTCCCGTTG CCTCTCCCAC CTCCTAGACA AACTGCTGAA CCCAGTACCA GAAATTCTTC TGTTGGGGAA GAAGACAACT CACGTCCTGA TTGGCTTCGA GATTCTCCTG AGCAGCTTCC TTTGGGGAGC GAAACGGTAC AAAGAGAAGC TTCGAACGCC AGTGGCAGAA GTGCCGGCAG TACAGGGAAT CAGATAGTCC AGAGGACGTC CTCACAACTA CAAGTGGTAA TGTCATATGG CGCATAGCAT CAGAATATCC CCTTACTCTA CCACCGAACT CACTGAAGTT TTTTTAATTG TCGAAAAGCT TTCCAGCAGC ATAGCTCGCG GTACTAACAA GGCCTTTGAG GCCTTGTTCG GTGATGCAAA ACCTCCGTTT GTCAGTCGAA GGCGTCTTGC TGATGGGGAA GCTGCGAGGT ACATCGTTTC CCGGACACTT CTTCCCGCTT CCGTTATCTT CAGCCAGCCC ACAAAAATGG TGCGTTCTCC AGCGCTCGTG TTGGATGCTC TATCGTAGCT TCGTCTCGGG TTGCTAATAC GTAGTTCGTT GTTCTTTTAA AAGTGGATCG CAACTCTGCA GACGAATCAA AAGGCTCTGG ATAGTAACGA TGTCATGGAG GCGTCTAAAT CTTTGCGGGC TTTTAGTTTG CCGTCCGAAC GCCAGGCAAA ATGTTTGGCC CAAGCTTGGA CGCCTCCGCG GATGGAGCCG TTCGCCAACC ACCCACTGTG CAATACCTGC CAGTCCAAGT TTGCTGTCTT CCGTCGAGCC TGTCACTGCC GAAATTGTGG TGTTTGCGTC TGCAAAGACT GTACCGTGAC TTGGCCGGCA AAGATGGTAC CGGAGACTTA CAACATAAAG AAAACAGCGA CAGTAAACAT ATGCAAGGCC TGTGATTGGC TTTGCAACAG CTTTCGTCTT GCTCTTTTGG AGGGTGACCA GGACAAGGCA GTTGCTCTTT ACGCTACTGG GAACATCAAC ATCGTGTGCC CTTTTGGAAA CGTACGTAGT CAGCCTACGG AATCCGGCTG CATTGCAGTG CCACCACTCA TTTTCATTTC TTCTAGGTCA AAGGAGAGTT GTTCTACCCT GTTCACGCAT GCGTTCTCGG TGAGTCACTT TCGATCTTGC GATGGTTAGT CGACGAGAAT TGCTGCCCCA TAAAATCCGT TCGAGTCAAT GGAAGGACGA AGGATGGAAT CTGTAACTAT ACGGCAATTG TGACATCAAA GGGCCGGTCG TTGCTGGGAA TCGCAATGGA AAACAACTTG ATACCAATCG TTCGATATCT GGTTGTAGAG AAGGGCCTCT CTTTGGCAGA GGAGAAGTCC CTAACCCGCG AAACGCTTGT CCAAAACCTT CAGCTTGCTC TAAGGGCCAT ACCAAACGCC ACAACATCAA CCGAGGCCAT CGAAATGGAT GTGTCGGAAG CATTGTATCA CGACGCCACA GTCAATGACA GCGCCGAGGC GGATTTGGGG GACACGACGA GCCCGCTTTC TTCAGAGTAT CAAAACGTTG TACCAGTGCC TATTCCTGAC CGCGAACAGG GTAGCGAGAG AGATTTGCCA GGTGGGCGTA CTCTAAGTGA AGAAGCTCGC AATTTCGGAG CAATTAGCCG ACCCGGTAGA GGTTCTTTTT CGTACGATGG TCGGCAGGAT GAAAATGAAT GTAGGTGTAC TTACTGAAAC CCGTCTTAGC CTCAGAAGAT TCTCATTTAT CCAACTTGTT TGTCCACAGG TATCATTTGC TTTGACGCCA ATATCGAGTA CGTCCTTCAC TCTCGCCTGA ATATGTTCGT CTCTACCTAG CGAGACTTCG TTGCTCACCT TTAATTGTTA CTCTTCGCTC TGAAGTTGCG TGGCTACCCC TTGCGGTCAT CAAGTCTGTT GTCTGGACTG CAGCGAGCAC TTGTCCCGTT GTCCAGTTTG CGCCATGCCG ACCTCCTTCA TGCGAGTATT TAAAGTATGA AAAGAAGAAA TTAAGACCGA AGGTACGCTT GCACTTCGTG TTGTTGGCCA TGCCATACCG AAACGACCCC ATCAGCAAAA ACACTTAGTT CAGAAGGATC TGTTGCGTAT AGTTTTGTCA CACGTAAGGC TTTTATGTAG ATTTTTATGG TAGTGTATGA GCTTGTCTTT TGGTCTAGTA GTAAGTAG
|
Protein sequence | MPSVDGVEHL SCSLRPPTLD DIQSAFVETS VSDTIDNNDE TQASTLDQPA NGDCTACPGI NSIDSMERGE SVEEIFRASL NFDRSSSFCF SGSLPLFGAS LYPNVARIED ADGISTTAQL EQGKREQSEN IRVESTYAEA SLNTLTEEAV VLGSARMSDD PWDEMTVEDR EILAMKRAAY AGERSDPLVA QNIKKHGQEA TVVAISEYDV HPNEIVQDAV RAELVGQDFT RTVSNAMDNE NVESTEADEF TVSAVFSALR TVDNEATEAT VIDSAPLASE EDIPSWKHTE EAQVLVLDTS LPLIDVKPAA VDFSEYSNSG IEPIADQEAE VLGIQEEIHP SEVSENETEA ELVGTDHNFA FTLANESHLD RNTARSDEPN ELRQVDFVVE TVDGGSDGEE PISFGNVRAA LLVPTILNAE TVDSIPTVAP FTRSLSMEGT DTSFRDDTSA QTGNGSAFPL PLPPPRQTAE PSTRNSSVGE EDNSRPDWLR DSPEQLPLGS ETVQREASNA SGRSAGSTGN QIVQRTSSQL QVVIIARGTN KAFEALFGDA KPPFVSRRRL ADGEAARYIV SRTLLPASVI FSQPTKMTNQ KALDSNDVME ASKSLRAFSL PSERQAKCLA QAWTPPRMEP FANHPLFRLA LLEGDQDKAV ALYATGNINI VCPFGNVKGE LFYPVHACVL GESLSILRWL VDENCCPIKS VRVNGRTKDG ICNYTAIVTS KGRSLLGIAM ENNLIPIVRY LVVEKGLSLA EEKSLTRETL VQNLQLALRA IPNATTSTEA IEMDVSEALY HDATVNDSAE ADLGDTTSPL SSEYQNVVPV PIPDREQGSE RDLPGGRTLS EEARNFGAIS RPGRGSFSYD GRQDENEFAW LPLAVIKSVV WTAASTCPVV QFAPCRPPSC EYLKYEKKKL RPKIFMVVYE LVFWSSSK
|
| |