Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47704 |
Symbol | |
ID | 7202708 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 583076 |
End bp | 586664 |
Gene Length | 3589 bp |
Protein Length | 1183 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182093 |
Protein GI | 219123565 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGTC CCAACGGCAA AGGCGGTCGC AACCCCAAAA AGAAGAAGAA AAAGAAGACT TCTCCGTCGA ATCGACTCGT TGACGGGGAT TCTTCCGTCC CGCAAAACGC AGTGACTGTG AATAGTACCA TTCCGACGAA TACGATCACG ACGAACGGTA CTTCCGAGGC GGCCACGGAC TGGTACCACG CCGCACTCGC TTGCAAAGAA CAAGGCAACG CCGTCTTGGC GTCGACCACA CCGCTGCACC GGGACCCTAC CACTACCAAC CACACGAATC CGATACAACA AGCGGTAGCG GACTACCAAC GAGGCTTGGC GTGCCTGTCC GCCGTGGCCG ATACCCCCGC CGCGGCGACG GAGTGTTGGC GGGATTTACG CACGCAATTG CACGGCAACT TGGCCATTGC CTGGGCCAAG CTTGGTGACT ACGAGGCGGT CGAAGCCGCT TGCTCTCTGG TACTGGACAG TCCCACGAGT GCCGCGGATG TGGCGACCGC CAAACTCTGG TACCGTCGGG GGACGGCGCG GTACGAACGA GGCCGACACG GACCGGATGA CGCTCTCCTA CACGCGAGTC ACGACGATTT GCGGCACGCA CAAATTCTAC TCGAACAAAT CGACAACAAC GCTAGTAAGA GTCACCATCA CAACAATAGC AACAACACTA TGCAACAGAG CGTACAGAGT GCCATGCAAA AGACGACACG AGCTTTGGAA GAATGCGCAC GTATGTGCAA CCGAAGTTCC AACTATTCTT CGAACGGCAC TAGCGAAGCG ACCGTCGACA TCTCCATGTC ATTGGCAAAC GTTACAGCAG TCCGTCCCGA ACGACCCGAT CCGTTGACGC AACGCGACGA CGTACGTAAA TTATTGTTGG CTCGACACTG CGGATTCGCT CAGCAGCAGC AGCCTCCATC ATACGGAAAC GGCCACGGTC ACTGCGCGAG CGAAACAGTC GAATCCACTG CAGGAGAAGC ACTCTTTTTG ATAGATTGGG ACTGGTGGTG TGACTGGTGT TACCACGTCG GCCTTTACGC CACCAACGCT CCACAAATAC AATACTACAT GGTGCAGGGT GCCGTCTTGC CGGATGAAGA AGAGGACCGC GATATGGATC AGTCCGACGC GCCTCCCGGT CCCATTGACA ATACCGCACT CTTCCTCTTG GCACCCCAAG TATGGCACGC CAAGAAAACT CTTTCTACCG CCCAACACTT TTACAAAACC TGGTATCTTT CCTACACAAC CACGCATGGC CATACCACCA CTGTAGACGA TATTGTGCCA CCACTCCAAC CCCACCTGGT CCGCGGCTAT CACTACGAAT TATTACCCCG GGAAGTTTAC GCCGCGTTGC GTTTGTGGTA CGGGGAACTG ACACCCAGCA TTTGTCGTCG CGTATCGGTG TCCCGTCACG TGCCCACGGT ACACCTCCAT CCCCAATCAC CAACACTACA ACCAGCGACC GGACCAACGT CGTCTTTCTG TTCCGCTTGT TACCGAGCGG GGGCGACCAT GCGTTGCAAA CGGTGCATGT CCGTTTACTA CTGTCAGCGC TCGTGTCAAG AATCGCACTG GCAATTTCAT AAAGCTCCCT GCAAGCGGTT GGCAGCCACC AACACGGAGG AGACCGTAGG ATCGCCCGTC TTACCGCCGC CATCGTACGG TCGCGTTGGT TTACATAATC TGGGCAATAC TTGTTTCATG AATTCCGCCT TGCAATGTTT GAGTCACGCG ACGCCACTGA CTCGGTCCTT TCTGTCTAAC CTGTACTTGA TCGACGTTAA CGTTGACAAT CCCTTGGGGA GCGGTGGGAA CCTCGCACAC GCCTACGGCG CGGTCTTGAA GGATCTGTGG ATGAAATCTA ACACGACTTC TCTCAGTCCC ACCGCGCTGA AACGAGCAAT CGCCATGTTC GCCCCGCGTT TTGCCGGATG CCTGCAACAC GACGCACAAG AATTCCTGGC CTATTTGTTG GACGGCTTGC ACGAAGATTT GAACCGGGTG CGACAAAAAC CGTACGTGGA AATGCCCGAC ATTACTCAAG GGCAAAACAT GGCCGTCGCC GGTGCACGGG CTTGGGAGGC CTTGCGCCGG AGGGATGATT CGCTCGTCAT GGATACCTTT TACGGACAGT TTCGATCAAC CTGTGTCTGT CCACGATGCC AACGAGTGTC CGTCTCCTTT GACGCTTTTA ATCACGTGAG CTTGCAAATC CCGACATCGG TAAACGCAAC AATCTCCGTC GGGGTATTTG TTATGGGGGA AAGTGGACGT TGGACAAGAT ACGGGGTCAG CCTACCTAGG ACCGCCACCA CCGCGACTTT GCGATTGCAC TTGACAGAAT TGTGCGGTGG GAAAGATTTG GCGCGGCTGG TTCTTTTGGA AGTATTCCAC AATGCCATTG TTCGTGTCGT AGACGAAACG AAATCTGTGG GGCAGTTGCA TCCCAACACC GTCCTGGCCG CTTTTGACGT GGATCCTCTG ACGGGCAATG CTGATCCAAC CTTTCACGTG TGTGCCAGCC ACAAGCTACT CCCGGAGGAT GGGGACAACA ATTTGGACCA GCCAGAGCTG TTTGGCTTTC CCTTTATGAT TTCCTTCTCG GGGAAAACGA CGTGTCGGCA AGCCTGGGAA CATCTTTGGT CTAAGGTGCA ACATTTGGTG GCGCACGGAA GCGACGAACC CGACTCGAGT GCCCGCGATT TACTGCAAAT TCATCTGCAC GATCACCGGA ACCAGCGCCT ACCCGTGTTC CCAGTGGCCA ATCTCGATGT CACCTTGGCG GAGATGGACA ATGCAATGGA CACCGAATGC ACGTCCGCTC TCCCTCGAGA TTCGGACCTC AGGCTTATCG ACCTTCTGGG CCCCAAATCC ACTGACAACT ACATATTTTT CTGGCTGGAA TGGCAAGAGA GCCCAGACGT TGTTTTAGGA AGTGTCCCAG GAGAAAAAGG ATTGGAATCC AGGATTGATG AAGAACGATT TCTAGCTTTT GAAAGTGATG CGAGCTGGTT GTTATGCCAG AAGAGGCAGA GAGCACAAAG TTTGGCAAAA GGAGTGACGT TAGACGAGTG TTTCGAAACC TTCATTCAGC CTGAACGTTT GGATGACAAC AATATGTGGT ACTGCTCGAA TTGCAAGGAT CACGTTCGAG CCATGAAGAC TATGGAACTC TGGCGGTTGC CAAATGTTCT GGTCGTGCAC TTGAAGCGCT TCGAGTTCCG CAATGTGCTG CGGCGAGACA AATTAGAAAC TCTGGTCGAT TTCCCCCTGG ATGGGCTGGA CATGAGCAAG CATTGCGGGT CGTATTCGTC CAGGTCGTTT GAAGACGAAC ACGTTCCGGC CACTTACGAT TTATTTGCCG TGACGAATCA CTTCGGACGA ATGGGATTTG GCCATTACAC CGCATTTGCC CGACGATGGG ACGAAGAGGG CATCCATAAC GAGCACTGGG CACTCTTTGA CGATTCAAGC GTACAGGAGG TCACCGATGA GAGGAATATA GTGTCATCCG CAGCGTACGT ACTCTTCTAC AGACGTCGAA CCTTTCATTA GATGGTGGAT CTTTTGCGAA GGGAATTAA
|
Protein sequence | MAGPNGKGGR NPKKKKKKKT SPSNRLVDGD SSVPQNAVTV NSTIPTNTIT TNGTSEAATD WYHAALACKE QGNAVLASTT PLHRDPTTTN HTNPIQQAVA DYQRGLACLS AVADTPAAAT ECWRDLRTQL HGNLAIAWAK LGDYEAVEAA CSLVLDSPTS AADVATAKLW YRRGTARYER GRHGPDDALL HASHDDLRHA QILLEQIDNN ASKSHHHNNS NNTMQQSVQS AMQKTTRALE ECARMCNRSS NYSSNGTSEA TVDISMSLAN VTAVRPERPD PLTQRDDVRK LLLARHCGFA QQQQPPSYGN GHGHCASETV ESTAGEALFL IDWDWWCDWC YHVGLYATNA PQIQYYMVQG AVLPDEEEDR DMDQSDAPPG PIDNTALFLL APQVWHAKKT LSTAQHFYKT WYLSYTTTHG HTTTVDDIVP PLQPHLVRGY HYELLPREVY AALRLWYGEL TPSICRRVSV SRHVPTVHLH PQSPTLQPAT GPTSSFCSAC YRAGATMRCK RCMSVYYCQR SCQESHWQFH KAPCKRLAAT NTEETVGSPV LPPPSYGRVG LHNLGNTCFM NSALQCLSHA TPLTRSFLSN LYLIDVNVDN PLGSGGNLAH AYGAVLKDLW MKSNTTSLSP TALKRAIAMF APRFAGCLQH DAQEFLAYLL DGLHEDLNRV RQKPYVEMPD ITQGQNMAVA GARAWEALRR RDDSLVMDTF YGQFRSTCVC PRCQRVSVSF DAFNHVSLQI PTSVNATISV GVFVMGESGR WTRYGVSLPR TATTATLRLH LTELCGGKDL ARLVLLEVFH NAIVRVVDET KSVGQLHPNT VLAAFDVDPL TGNADPTFHV CASHKLLPED GDNNLDQPEL FGFPFMISFS GKTTCRQAWE HLWSKVQHLV AHGSDEPDSS ARDLLQIHLH DHRNQRLPVF PVANLDVTLA EMDNAMDTEC TSALPRDSDL RLIDLLGPKS TDNYIFFWLE WQESPDVVLG SVPGEKGLES RIDEERFLAF ESDASWLLCQ KRQRAQSLAK GVTLDECFET FIQPERLDDN NMWYCSNCKD HVRAMKTMEL WRLPNVLVVH LKRFEFRNVL RRDKLETLVD FPLDGLDMSK HCGSYSSRSF EDEHVPATYD LFAVTNHFGR MGFGHYTAFA RRWDEEGIHN EHWALFDDSS VQEVTDERNI VSSAAWWIFC EGN
|
| |