Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43937 |
Symbol | |
ID | 7204168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 476811 |
End bp | 479810 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186068 |
Protein GI | 219112969 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.499306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTGT ATATTGAGCC TAACAGATGT CACAGCGGCT CGCCACGCCA AAGGGTGGGT AACCCAACCC TCGAGAACGT GGCGGAAGAA ATGCTGAACG GGCTCTTCTG CCATTACGAG CCTCCCGACC AGTCTCGGCG AGATCGAAGT GATTCGGGCC TGACCAGGTC CATCCTCCTC CCTCCTCGTG ATTTCCGCCA GCAAAGTGTC CGCTCTAAAA AAGAAGTGCG TTGGTCCGAC AGTATCAAAG AATCGCAAGA CAACGGGTCT CCCAGAGCGA TTCACTGCAA CGTCGGTTCA CACTGCGTCG ACGCTGGCAT CGAAGCCGGA TGCGTCGACC CTGAACACAA GGGAGTTACC CGCGATTACT TTGCCAAGTA CGACTACGAT TTGCAGCAAG TCGACAGTAC TGAAAGTGGC GCGCCGTTTC GTCCTTCTTG TCTCCACAAG AATGTAGTCT TTGATGACGA CGGGAACCCG ATTGCATCCG TTCCCGGGAC CCCGATCCCT ACGCTCCCCC CTCTCCCTCC GCTTCCATCC AGGAGGGAGC GATTTATTCC CGAGCTTTGT GGTATTGGTA TCAATTGTGG CGAAGACGAA GAGGAATTGC GTGAGATGTG CGCTTCCAAG CCGTATCGTA GGCCTTCGCA ACCGTACGAC TACTCCGAAG ACGATGAAGA CAACTCTCTT CTTCTTGAAG AAAACAACTA TGAAATTCAA GGAAGGGCAA CATTCAAGAC TCGCGGGCGC AAATCTCTTC CCACCCCCGC CTCCAGTCCG CCCACCGAAA CCAAGTCGTC CCCAACAAAG AACAACGTAA ATGGATCAAC ATTACACGAG GCCGACTGGG AGACTACTTC CAACGCATCC TCGTCACTGT TTCGCCGTAT ACTTCCTCGT TGGAAACGTA ACGGCAGCAA GCATAGAGAA GCAGTTTCTG AAGAGCAGGC AGTTCTCGAT CCAATTGAAC GGCAGCGGGC AGAAGACAAT AGAAAGGCTG AAAAGGCTTC CTTCTTGCTT AAGGATTCGA AACTAAGAGC TTCGCAAGCA TCCACATGCG AAGAAAACAA CGACTGGGAC GACATTGTAC CAGTCGCTCC ACTTTTGAAA ACGGAAATTT CGAGTCAGCT CCCTCATAGG CCCGCTGCGT GTCAACCCAC TAGTAACGCC GCTGCCCTTT ATACCGTTCC AAAATTGGAA GCAGCAAAGC GCGACCCGTC TCCTAGCCGG GGGCGAGGCC GCAGTGATAG TCTCGACAAG AAAGAGAAAC CCAGACGCGG TTTTCTGAAA CGACGAAAGA GATCCAAGTC GCCCTCGTTG CCCGGAAATT CTTTAGTTGT CTTGAACGCG CTTCAAGATA AAACCGACTG TGAGCGCAAA AGTGCCATTG TGGCTCACAG GCAAGGACTA AATCAAACGG GATTGCAACA GCCTCTTCCG ATGGGAAAAA ACCCTTATCA GCCTTCTCCA GCAAACCCAT CCCGTGAAGA ACTTCTTCTC AAAAGGCAAC AACAGAAATC TGCATCTATT CCTCCAAATT TGATTTCGAG GGAACATGTG GACCAACAAC AACCGCCAAT CATACAACAG CAACGCAATG CTTTGGCGCA ACCGCAGTTA ATCCATGATT CTCGCTTTCA TCCAGCACAT CCTTTGAATC GTTCTGTTAG CGGCAGCAAT GGCAACAGTC ATTCTCTTCA TGCAACCCAC AAACAGCTTG TGGATGCAGT AGATGATCGA ACAGAGCGAA TGAGTTTCAT TGCAGCTCAC CGTCAGGGCT TCGATGAAGA CATAAGGGTG GCCACTCTCG CCCAGTCCTC CCGTCACGAT GTACCAGACT GTCTTGACGA TCAAATGGAG AGAATGAGTT ACATCACCGC TCATCGTAAG AGTTTTGGCG ACGGGTCATT TCATTCTGCG AAAACGCCGT CGCAGATGAC ATTGATACAT GAATGCCAGC CTCTTGCAGA TTACCGCGGA GGTATACCAT CGACCGGGCG TCATTATTCT CCCCACGATC AACGCATTTC CTTAGAGCAA CCTTTCAAAG ACAAACGGAA AATAACAAGG GTGGAGCAAA AAGCTCTGAA GAAGCTCGAT AGGGAACAAC GCAACGAAAT GCGAAGCGTA GAGCGCTTGA TTCGAGTACG TAATCCTACG CAGCACTTAA AGACTTTAGG GTCATCGAAT CCGCAAAATT CGGCCATATT CCGAAAAGTT GATGTACCCG CTCGAAGTCC AAGCCATCAA TCTTCCTACC AAGAACCACC GTTGCCTTTC CGTCCAGTTG AGAATGTGGA TCAAACCAAC CCCAGCCAAG ATCCTCCAGT CCAGACCACT GGCTCTTTTG AAAAGTTTGA GGGAGAGAAA CCTATTGAAA CTACTTGGGA GGAAAGAACA CGATTAGCTT GGGAGCGTCT GCGGGGAAGC TTCACCATGG AGAGCGCGAC GGAAAAATCC AACGAAGAGG CCCAACTGTT ACGCCCGCCT ATATCGAACC AGTCGCCCTC CCAGCAAGTA CGGGGCATTC TGAAGATACC GTCGCACGAG AAGCGCGTGA CCTTCGGTCA AGACACAGAA CACATTTTTG ACAAGGCAGA AGAGAACATA CCAATGCAGA GTGAAAACGG GTCTTACGTT CAAAATACCG GTAGGCTATC CACGACAATG CCACAGCCAC TGGGTATGGT CGACCTTACG GATCAGCACG GTCGAGTCAC CATGCCATCC ATGTATCCCC AGAACCTGCC ACCCATTTCA TCCAGCAAGA AAACAAAAAA AGTCTTTCGC GGCGCAAGGA TTTTCAGCGA ACTATTTCGA AAAGGGAAGA AGCGTAGCAG CCGTAGCAAA ATGACAAAGG GATCGTTCCG AGTCAACCCT ATGGAAATCG TCAATTCACT CTCAACTGAA ATTACCACCC CGTCTATCTC ACGTTCTGGG GATGAGTATG ATATGCTTGG AAATACCGCG ATGGGCTACA ACGTAATGCA CGCCGTATAG
|
Protein sequence | MALYIEPNRC HSGSPRQRVG NPTLENVAEE MLNGLFCHYE PPDQSRRDRS DSGLTRSILL PPRDFRQQSV RSKKEVRWSD SIKESQDNGS PRAIHCNVGS HCVDAGIEAG CVDPEHKGVT RDYFAKYDYD LQQVDSTESG APFRPSCLHK NVVFDDDGNP IASVPGTPIP TLPPLPPLPS RRERFIPELC GIGINCGEDE EELREMCASK PYRRPSQPYD YSEDDEDNSL LLEENNYEIQ GRATFKTRGR KSLPTPASSP PTETKSSPTK NNVNGSTLHE ADWETTSNAS SSLFRRILPR WKRNGSKHRE AVSEEQAVLD PIERQRAEDN RKAEKASFLL KDSKLRASQA STCEENNDWD DIVPVAPLLK TEISSQLPHR PAACQPTSNA AALYTVPKLE AAKRDPSPSR GRGRSDSLDK KEKPRRGFLK RRKRSKSPSL PGNSLVVLNA LQDKTDCERK SAIVAHRQGL NQTGLQQPLP MGKNPYQPSP ANPSREELLL KRQQQKSASI PPNLISREHV DQQQPPIIQQ QRNALAQPQL IHDSRFHPAH PLNRSVSGSN GNSHSLHATH KQLVDAVDDR TERMSFIAAH RQGFDEDIRV ATLAQSSRHD VPDCLDDQME RMSYITAHRK SFGDGSFHSA KTPSQMTLIH ECQPLADYRG GIPSTGRHYS PHDQRISLEQ PFKDKRKITR VEQKALKKLD REQRNEMRSV ERLIRVRNPT QHLKTLGSSN PQNSAIFRKV DVPARSPSHQ SSYQEPPLPF RPVENVDQTN PSQDPPVQTT GSFEKFEGEK PIETTWEERT RLAWERLRGS FTMESATEKS NEEAQLLRPP ISNQSPSQQV RGILKIPSHE KRVTFGQDTE HIFDKAEENI PMQSENGSYV QNTGRLSTTM PQPLGMVDLT DQHGRVTMPS MYPQNLPPIS SSKKTKKVFR GARIFSELFR KGKKRSSRSK MTKGSFRVNP MEIVNSLSTE ITTPSISRSG DEYDMLGNTA MGYNVMHAV
|
| |