Gene PHATRDRAFT_43482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43482 
Symbol 
ID7197537 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp578227 
End bp582120 
Gene Length3894 bp 
Protein Length1225 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177644 
Protein GI219111785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTTGT TACGAAGACT ACATAGAACT TCACGTCCTA ATATTCCACT AAGGGGGAAA 
AGAAAATCGG GAAGCGATGA AGAAACGGCA AGTGATTCAA ACGAGGATCT TGGCCGGAAG
CGGTTGCTTT CTAAGGATCG GGCCATATCC GCCCTTTCCC AACCAAACGC TCCTACCCAA
ACTCCAAACA CGAACAACGA TTCTACAAAT AAGCAGTCAA CTTCCAAAGC AGCTAGCTTA
CTGCCATCAA AGAAGGTTAA GAGTGATCGG CAGGAGGATG AGAGGACAAG GATAAAGAAA
AACAACGCTC CAAAAGCTTC CACAGTGCCC TTGCAATCAA GATCTTCCGA CCGCAGGTGG
AAAAATTCGA AGCGGAGTCG AGAAATAACA CCGAAAGTAC CCAATACTGC GAAAGCCGCC
TATTCATCAC TGCTACCGCC ACCCCTATCG GGATCGAATG GCAGTGACGA CGATTCATTG
AGTACAGGCC TATCATCGTG GGAAGAATTT CTTGGCAAAG GGAAAGAGAG TAGTGGCTCT
AGGACAACGC CAATTGTAGG ACGTTCGCAA GAGAAGCGAT CTCCTTCAAG AGTGGAAGCA
AGGAAGCGTG AGAAGTCGAA GGAGGAGCAG GCTGACGAAA GTGACAACAC TGACAAGCAT
CGGAATCTGC CGTCGATTCT GGATCTGTTT CCGTCAGCTC TCTCAACAAA CCCAAACGAA
AGGTCTGCAT CCAGCTCAAA AGTGCTGTCG GAAGACGCAG GGAAATCGTA TACATCGCTG
GATGGAGTAC TGCCTGTGTC GGATCTGTTT TATCGCTCTT CGATACCGCA AGCGCAACGT
ATTGATGGGT CCGAGCAGGC GACCAATGCA ACTGACGATG GCGAGGAAAG CCCGTATCGT
AGGGCCGTTT CGCAAACACC CGGCAAGCAA GAAATGGCCA AAAAGCCGTC TTCCACAAAA
AAACGCAGTG GTAGAAAAAT GGTTCGACGC GGTATGGAGA TGCTGGTTGG CGGTGTACCG
ATCAACGCCG ACCCCCCGCA AAGAAACGTC GATCTCTGTT ACGATAGATT GGCAGCTGAC
TGGGCCACTT CTATAAGTTT AAATACCCGA GAATTTGGGC CTCTTTTACA TGGTGCCAGC
ATCCCAAAAG TATCGTTGAA GGAACGAGGC TTGTTTTGCG AGTACTTTTG CCATGCCGCC
ATGAAATGGG ATGTCTGTCC GACAGATTTA CAGTCCATTA TCGATTCGCA TTCAAAGCAA
ATTTCAGGCT TTGAGGTAAG TGCCTGGAAA AAGACCTCCG CACACCTTCC TGGAGGCAAG
GACTTGGACA AATTAGCTGA AAGTATTAGC AAGGATATAT TGTCAAGGTT ACACGAAGAC
AGCGACACTA GTGATTCTAT GATCACAAAA GATACTCGAC ACGACGGCAA AAGCTCCGGC
GACGGAATTC CACGAATAGC TGCAAAGCTT GATTTACACG CTAAGGATTC AAGAAAGCAA
AGGTCGAGGA GGTCAAAATC GAAAGCATAT GCTCAGTCTA AAGCAAAGGG ATTTGGAAAG
GATAAAAATG GCGCGAGGAG ATCAAAAAAG GAATTTGTGG ACACCGATGC TTTGGCCTTG
TACAGCAATG AATTCAAAAT AATGGCACAG CCTATTGGTA TCTCGAAGTC CGATCTGGAA
AGCGGCGGAC AAGGCACTCA GGTTTTCGAG TCAGTTTTGC GGAAGGCTTT TGAAACCCAT
CCAGTGCTCA TTGAGGCAAT TGGAGAGTTC CACGTCGATA TTTACAACTG TACGATCGAG
GGAACTGGTC CAGATTCTTC ATTGTATTCG GTCGAATTTG GGGTATTTCC AAAAAAGGTA
ATTCACCAAA GCGAGAAGCC GGAATTACTT CATAAAATGA TAGAAGCTCT GCGTTTTATA
TTGGATACAG ACGACTCAGA AGATACATTG AACTCTACCT TGGCAAGGAT CGCAAGCGAA
GAATATCGGT GGTCTCCCAG TATTAGGGAG CGTGTTGCTC TGCAGTTTAA TGATCAAAAC
GCACCCAACC AAAGGCTTCT CTATACTGTC AATGCTGGGG TACTAGAGTT TGAAATTGGA
GTTTCCCGCG CAGAGCTGGA ATCAGGAGGT GATGGTGGCG AAATTTTTCA ATCCGTGTTA
GAGAAGGCCA TCGGTGGAGC TATGCGGAAC TCTCTGGCTG GTTTTCACTT TTCTATTACA
CATTTCACGC TTGATGATCA CGACGATGGA ACATCGTTAG TTTCTGCTGA TGTACAAATG
GAGACTTCTG AGCCGATTGC CCGATCTGAA AATCGTTTGA TCGAAAAAAA TCTTCGGGCG
GCTTTGGCTC AAGCTTTTGA AAATGGTAGT ATCATTTTGA ATTTGGCCGC AGAAGCGAAG
AAAGAAGAAA GATGGCCTAA GGAAGTACGA GATCGAGTTG TCGAAGAATG TCTATTTGAA
GACGATGATG GAGACGAACC TGTATCGGGT CTCGGGCCCG TTTCCTATCC TTTTGGAGCT
ACACGGGTTT TGTTGACAGA AGACAACGAC GAAATCGGAG ATACCTTCGA AGTCGATAAG
AATGATTACT CTCAGAACGA TTTATTCTTG GGTGGAGGCA ACGACGGTGT CTTCTTTGAC
TACTCGGAAG AAAATGCGTT CCGGGCTCCT TTCCGAGGAC AGCTTGGCTT GCGGCTGGTC
GATGCGGTCA CGGAACGTGC CAAGCAGCGG CAGCCCCGCG TAATCGCGAT AGGTGATGTC
CATGGATGCA TCGATGAGCT ACAAGACTTG CTACGTCAGT GCGACTATCG ACCAGGCGAT
CTGGTCGTTT TTCTTGGCGA TCTAGTATGC AAGGGTCCTG ACAGTATTTC GGTCGTTCAA
ATGGCCCGTG AAATCGGGGC TTTTGGTGTA AGGGGTAATC ACGACTTTGA AGTAATTCGG
TGGCACCAAG CTATCAAGTC CGGAGTAGAC CCTCCGGTAG TAGGCTCAGA GCATTTTCAC
ATTGCGTCTT GTTTGAGCAA GGCCGACATG AAATGGATGA ACAGTCTCCC TTGGTACTTG
TCTAGCAAAG AACTTGGGTC GCTTTTTGTG CACGCCGGCT TTGTTTCTGG GATCAGGCTT
GCAAAGCAAA ACCCTCGCCT GATGATGAAC ATGCGCAGCA TTCTTCCTGA CGGTACGGTT
ACATCAAAGT TCTTCAACAA CTGGCCCTGG GCACGTCTCT GGGACGGTCC GCAAACGGTT
TTATTCGGCC ACGACGCCGA CCGGGGCTTA CAGCAATACG AGCACGCTAT TGGACTCGAC
ACTGGCTGCG TGTACGGTGG ACGATTGACC GCTTGCATAC TTCCCGAAAA GAGGTTAGTC
AGTGTGAGTG CAAAGCGGGA GTACTTCAAA TACCGTCGAA AGCACTATGA TTGATGTTAG
CTTGGTATTA GCTTATTCCA ATGTCTATAT TGGAGGTAGC TACCGTATAA AAGTGTCCGA
GGTAATCCGT CCTCTCTCGC CTACTTAGTT GCTCACTATA ATTATGTCGC AGCTCCTTAT
TAACAGCAAA AGATATTGTG ACAGCAAGCA AAGCTGTCAA AAAACGTACT GTATGAGTTC
GGTTCCCGGT CACAGCTCCA GTCGAGTCCG TGATCAGGAT TCGATCTCCA TAACCGAGAA
CGTGAGCAAG GTCTATTACA TTCTTTACAG TTAACTTTCA ATTTTTAGAT TTCACTACTG
ACCGTTTTCT CTGTCGGTGA GATTCTGACG TACGGTGCCA GACACACGCA CGTTGCCGAA
GCACTCTCGA ATTCTTTTTT TGCTGGCGTG ACGATTCGCG AATCTTGCTA TCGATTCTGT
AGATTCCTAC GGAAGCAGAA TTTCCAAAAC AAGCGCGAAG GCATCAAATT TTGA
 
Protein sequence
MILLRRLHRT SRPNIPLRGK RKSGSDEETA SDSNEDLGRK RLLSKDRAIS ALSQPNAPTQ 
TPNTNNDSTN KQSTSKAASL LPSKKVKSDR QEDERTRIKK NNAPKASTVP LQSRSSDRRW
KNSKRSREIT PKVPNTAKAA YSSLLPPPLS GSNGSDDDSL STGLSSWEEF LGKGKESSGS
RTTPIVGRSQ EKRSPSRVEA RKREKSKEEQ ADESDNTDKH RNLPSILDLF PSALSTNPNE
RSASSSKVLS EDAGKSYTSL DGVLPVSDLF YRSSIPQAQR IDGSEQATNA TDDGEESPYR
RAVSQTPGKQ EMAKKPSSTK KRSGRKMVRR GMEMLVGGVP INADPPQRNV DLCYDRLAAD
WATSISLNTR EFGPLLHGAS IPKVSLKERG LFCEYFCHAA MKWDVCPTDL QSIIDSHSKQ
ISGFEVSAWK KTSAHLPGGK DLDKLAESIS KDILSRLHED SDTSDSMITK DTRHDGKSSG
DGIPRIAAKL DLHAKDSRKQ RSRRSKSKAY AQSKAKGFGK DKNGARRSKK EFVDTDALAL
YSNEFKIMAQ PIGISKSDLE SGGQGTQVFE SVLRKAFETH PVLIEAIGEF HVDIYNCTIE
GTGPDSSLYS VEFGVFPKKV IHQSEKPELL HKMIEALRFI LDTDDSEDTL NSTLARIASE
EYRWSPSIRE RVALQFNDQN APNQRLLYTV NAGVLEFEIG VSRAELESGG DGGEIFQSVL
EKAIGGAMRN SLAGFHFSIT HFTLDDHDDG TSLVSADVQM ETSEPIARSE NRLIEKNLRA
ALAQAFENGS IILNLAAEAK KEERWPKEVR DRVVEECLFE DDDGDEPVSG LGPVSYPFGA
TRVLLTEDND EIGDTFEVDK NDYSQNDLFL GGGNDGVFFD YSEENAFRAP FRGQLGLRLV
DAVTERAKQR QPRVIAIGDV HGCIDELQDL LRQCDYRPGD LVVFLGDLVC KGPDSISVVQ
MAREIGAFGV RGNHDFEVIR WHQAIKSGVD PPVVGSEHFH IASCLSKADM KWMNSLPWYL
SSKELGSLFV HAGFVSGIRL AKQNPRLMMN MRSILPDGTV TSKFFNNWPW ARLWDGPQTV
LFGHDADRGL QQYEHAIGLD TGCVYGGRLT ACILPEKRLV SLLINSKRYC DSKQSCQKTY
CMSSVPGHSS SRVRDQDSIS ITENISLLTV FSVGEILTYG ARHTHVAEAL SNSFFAGVTI
RESCYRFCRF LRKQNFQNKR EGIKF