Gene PHATRDRAFT_47704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47704 
Symbol 
ID7202708 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp583076 
End bp586664 
Gene Length3589 bp 
Protein Length1183 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182093 
Protein GI219123565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGTC CCAACGGCAA AGGCGGTCGC AACCCCAAAA AGAAGAAGAA AAAGAAGACT 
TCTCCGTCGA ATCGACTCGT TGACGGGGAT TCTTCCGTCC CGCAAAACGC AGTGACTGTG
AATAGTACCA TTCCGACGAA TACGATCACG ACGAACGGTA CTTCCGAGGC GGCCACGGAC
TGGTACCACG CCGCACTCGC TTGCAAAGAA CAAGGCAACG CCGTCTTGGC GTCGACCACA
CCGCTGCACC GGGACCCTAC CACTACCAAC CACACGAATC CGATACAACA AGCGGTAGCG
GACTACCAAC GAGGCTTGGC GTGCCTGTCC GCCGTGGCCG ATACCCCCGC CGCGGCGACG
GAGTGTTGGC GGGATTTACG CACGCAATTG CACGGCAACT TGGCCATTGC CTGGGCCAAG
CTTGGTGACT ACGAGGCGGT CGAAGCCGCT TGCTCTCTGG TACTGGACAG TCCCACGAGT
GCCGCGGATG TGGCGACCGC CAAACTCTGG TACCGTCGGG GGACGGCGCG GTACGAACGA
GGCCGACACG GACCGGATGA CGCTCTCCTA CACGCGAGTC ACGACGATTT GCGGCACGCA
CAAATTCTAC TCGAACAAAT CGACAACAAC GCTAGTAAGA GTCACCATCA CAACAATAGC
AACAACACTA TGCAACAGAG CGTACAGAGT GCCATGCAAA AGACGACACG AGCTTTGGAA
GAATGCGCAC GTATGTGCAA CCGAAGTTCC AACTATTCTT CGAACGGCAC TAGCGAAGCG
ACCGTCGACA TCTCCATGTC ATTGGCAAAC GTTACAGCAG TCCGTCCCGA ACGACCCGAT
CCGTTGACGC AACGCGACGA CGTACGTAAA TTATTGTTGG CTCGACACTG CGGATTCGCT
CAGCAGCAGC AGCCTCCATC ATACGGAAAC GGCCACGGTC ACTGCGCGAG CGAAACAGTC
GAATCCACTG CAGGAGAAGC ACTCTTTTTG ATAGATTGGG ACTGGTGGTG TGACTGGTGT
TACCACGTCG GCCTTTACGC CACCAACGCT CCACAAATAC AATACTACAT GGTGCAGGGT
GCCGTCTTGC CGGATGAAGA AGAGGACCGC GATATGGATC AGTCCGACGC GCCTCCCGGT
CCCATTGACA ATACCGCACT CTTCCTCTTG GCACCCCAAG TATGGCACGC CAAGAAAACT
CTTTCTACCG CCCAACACTT TTACAAAACC TGGTATCTTT CCTACACAAC CACGCATGGC
CATACCACCA CTGTAGACGA TATTGTGCCA CCACTCCAAC CCCACCTGGT CCGCGGCTAT
CACTACGAAT TATTACCCCG GGAAGTTTAC GCCGCGTTGC GTTTGTGGTA CGGGGAACTG
ACACCCAGCA TTTGTCGTCG CGTATCGGTG TCCCGTCACG TGCCCACGGT ACACCTCCAT
CCCCAATCAC CAACACTACA ACCAGCGACC GGACCAACGT CGTCTTTCTG TTCCGCTTGT
TACCGAGCGG GGGCGACCAT GCGTTGCAAA CGGTGCATGT CCGTTTACTA CTGTCAGCGC
TCGTGTCAAG AATCGCACTG GCAATTTCAT AAAGCTCCCT GCAAGCGGTT GGCAGCCACC
AACACGGAGG AGACCGTAGG ATCGCCCGTC TTACCGCCGC CATCGTACGG TCGCGTTGGT
TTACATAATC TGGGCAATAC TTGTTTCATG AATTCCGCCT TGCAATGTTT GAGTCACGCG
ACGCCACTGA CTCGGTCCTT TCTGTCTAAC CTGTACTTGA TCGACGTTAA CGTTGACAAT
CCCTTGGGGA GCGGTGGGAA CCTCGCACAC GCCTACGGCG CGGTCTTGAA GGATCTGTGG
ATGAAATCTA ACACGACTTC TCTCAGTCCC ACCGCGCTGA AACGAGCAAT CGCCATGTTC
GCCCCGCGTT TTGCCGGATG CCTGCAACAC GACGCACAAG AATTCCTGGC CTATTTGTTG
GACGGCTTGC ACGAAGATTT GAACCGGGTG CGACAAAAAC CGTACGTGGA AATGCCCGAC
ATTACTCAAG GGCAAAACAT GGCCGTCGCC GGTGCACGGG CTTGGGAGGC CTTGCGCCGG
AGGGATGATT CGCTCGTCAT GGATACCTTT TACGGACAGT TTCGATCAAC CTGTGTCTGT
CCACGATGCC AACGAGTGTC CGTCTCCTTT GACGCTTTTA ATCACGTGAG CTTGCAAATC
CCGACATCGG TAAACGCAAC AATCTCCGTC GGGGTATTTG TTATGGGGGA AAGTGGACGT
TGGACAAGAT ACGGGGTCAG CCTACCTAGG ACCGCCACCA CCGCGACTTT GCGATTGCAC
TTGACAGAAT TGTGCGGTGG GAAAGATTTG GCGCGGCTGG TTCTTTTGGA AGTATTCCAC
AATGCCATTG TTCGTGTCGT AGACGAAACG AAATCTGTGG GGCAGTTGCA TCCCAACACC
GTCCTGGCCG CTTTTGACGT GGATCCTCTG ACGGGCAATG CTGATCCAAC CTTTCACGTG
TGTGCCAGCC ACAAGCTACT CCCGGAGGAT GGGGACAACA ATTTGGACCA GCCAGAGCTG
TTTGGCTTTC CCTTTATGAT TTCCTTCTCG GGGAAAACGA CGTGTCGGCA AGCCTGGGAA
CATCTTTGGT CTAAGGTGCA ACATTTGGTG GCGCACGGAA GCGACGAACC CGACTCGAGT
GCCCGCGATT TACTGCAAAT TCATCTGCAC GATCACCGGA ACCAGCGCCT ACCCGTGTTC
CCAGTGGCCA ATCTCGATGT CACCTTGGCG GAGATGGACA ATGCAATGGA CACCGAATGC
ACGTCCGCTC TCCCTCGAGA TTCGGACCTC AGGCTTATCG ACCTTCTGGG CCCCAAATCC
ACTGACAACT ACATATTTTT CTGGCTGGAA TGGCAAGAGA GCCCAGACGT TGTTTTAGGA
AGTGTCCCAG GAGAAAAAGG ATTGGAATCC AGGATTGATG AAGAACGATT TCTAGCTTTT
GAAAGTGATG CGAGCTGGTT GTTATGCCAG AAGAGGCAGA GAGCACAAAG TTTGGCAAAA
GGAGTGACGT TAGACGAGTG TTTCGAAACC TTCATTCAGC CTGAACGTTT GGATGACAAC
AATATGTGGT ACTGCTCGAA TTGCAAGGAT CACGTTCGAG CCATGAAGAC TATGGAACTC
TGGCGGTTGC CAAATGTTCT GGTCGTGCAC TTGAAGCGCT TCGAGTTCCG CAATGTGCTG
CGGCGAGACA AATTAGAAAC TCTGGTCGAT TTCCCCCTGG ATGGGCTGGA CATGAGCAAG
CATTGCGGGT CGTATTCGTC CAGGTCGTTT GAAGACGAAC ACGTTCCGGC CACTTACGAT
TTATTTGCCG TGACGAATCA CTTCGGACGA ATGGGATTTG GCCATTACAC CGCATTTGCC
CGACGATGGG ACGAAGAGGG CATCCATAAC GAGCACTGGG CACTCTTTGA CGATTCAAGC
GTACAGGAGG TCACCGATGA GAGGAATATA GTGTCATCCG CAGCGTACGT ACTCTTCTAC
AGACGTCGAA CCTTTCATTA GATGGTGGAT CTTTTGCGAA GGGAATTAA
 
Protein sequence
MAGPNGKGGR NPKKKKKKKT SPSNRLVDGD SSVPQNAVTV NSTIPTNTIT TNGTSEAATD 
WYHAALACKE QGNAVLASTT PLHRDPTTTN HTNPIQQAVA DYQRGLACLS AVADTPAAAT
ECWRDLRTQL HGNLAIAWAK LGDYEAVEAA CSLVLDSPTS AADVATAKLW YRRGTARYER
GRHGPDDALL HASHDDLRHA QILLEQIDNN ASKSHHHNNS NNTMQQSVQS AMQKTTRALE
ECARMCNRSS NYSSNGTSEA TVDISMSLAN VTAVRPERPD PLTQRDDVRK LLLARHCGFA
QQQQPPSYGN GHGHCASETV ESTAGEALFL IDWDWWCDWC YHVGLYATNA PQIQYYMVQG
AVLPDEEEDR DMDQSDAPPG PIDNTALFLL APQVWHAKKT LSTAQHFYKT WYLSYTTTHG
HTTTVDDIVP PLQPHLVRGY HYELLPREVY AALRLWYGEL TPSICRRVSV SRHVPTVHLH
PQSPTLQPAT GPTSSFCSAC YRAGATMRCK RCMSVYYCQR SCQESHWQFH KAPCKRLAAT
NTEETVGSPV LPPPSYGRVG LHNLGNTCFM NSALQCLSHA TPLTRSFLSN LYLIDVNVDN
PLGSGGNLAH AYGAVLKDLW MKSNTTSLSP TALKRAIAMF APRFAGCLQH DAQEFLAYLL
DGLHEDLNRV RQKPYVEMPD ITQGQNMAVA GARAWEALRR RDDSLVMDTF YGQFRSTCVC
PRCQRVSVSF DAFNHVSLQI PTSVNATISV GVFVMGESGR WTRYGVSLPR TATTATLRLH
LTELCGGKDL ARLVLLEVFH NAIVRVVDET KSVGQLHPNT VLAAFDVDPL TGNADPTFHV
CASHKLLPED GDNNLDQPEL FGFPFMISFS GKTTCRQAWE HLWSKVQHLV AHGSDEPDSS
ARDLLQIHLH DHRNQRLPVF PVANLDVTLA EMDNAMDTEC TSALPRDSDL RLIDLLGPKS
TDNYIFFWLE WQESPDVVLG SVPGEKGLES RIDEERFLAF ESDASWLLCQ KRQRAQSLAK
GVTLDECFET FIQPERLDDN NMWYCSNCKD HVRAMKTMEL WRLPNVLVVH LKRFEFRNVL
RRDKLETLVD FPLDGLDMSK HCGSYSSRSF EDEHVPATYD LFAVTNHFGR MGFGHYTAFA
RRWDEEGIHN EHWALFDDSS VQEVTDERNI VSSAAWWIFC EGN