Gene PHATRDRAFT_42571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42571 
Symbol 
ID7195954 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp466297 
End bp470958 
Gene Length4662 bp 
Protein Length1319 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176588 
Protein GI219109668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGGAGTCAAG GTCCTTCTTT TTTGGCCGTA CCGAACGATA CGTTCGACAA TGTCGAGTAT 
TGCGGGGGCT GCGGCAAGTA AGACCCTTTG GGGCAAGATA AAGTCGGTAC CGATCCAACA
CCCCTTCGCC TTTGGTTTCT TCCTTTCCGG CTTTAAGACT TCCTTTTCGG ATTTGCTCGT
TCAGAAGGTA GTGGAACAAC GAGAAACGAT TGACTGGAAG CGCAACGCCG CTTTCGCGTC
CTTTGGATTC TTCTACCTAG GAGGAGTGCA ATATGCTATT TATGTACCAC TCTTTAGTCG
CATGTTCCCG GGAGCAGCGG GGTTCGCCGC CAAGTCGATT CGGGACAAAC TCAAGGACGC
CAAGGGAATG TTTCAATGCG GAGCCCAAGT CGTCCTGGAT CAGTGCGTGC ACCATCCGCT
CATGTATTTT CCCGTCTTTT ACTGCACACG AGAATTGGTG GTACACGACA AACCCGACTT
GAAGCGTTGT CTGAACGAGT ATCGGGGTAA TATGAAAGAA GATTTGGTGG CTCTCTGGAA
GGTGTGGGTG CCGTCGACCA TTATAAATTT TGCCTTTATG CCCATGTGGG CGCGAATTCC
CTTCGTGGCA GCTACGTCTT TGCTATGGAC GAGCATATTA TCCGCCATGC GGGGTGGAGA
CGTACGTTCG TTCACCTTTT TCCCCTGGGA AACGGGGCTT GCCTGGTTCG TTCGTCCGTT
TTTGCTCACG AGCATTCTTT CTTTTGTTTG GTGGCAACTT TTTTTAGGTC TCTCACGGCG
AGGAAATGGC GGGAGGTGCG GTAACGGGAG CTACGTTGAC CATGGTAGAA GCTGGCTTTG
AGTTGTTGTT TACGTCGAAC GTCGAGCTGG ATCGGGATAG CAACCATCTC GTAATAACAG
CGTCGGGACC CGATCGCGTG GGCTGGATTG CGACACTGAG TCGAGCGATT GCTGATCAAG
GTGGGAATAT TACGCATTCA AAGCAAGTGC GGTTGGGGTC CAGTTTCATC TGCGTCTTGC
ACACGGCAGT GGACCCCGAA CTGCAACATG CGTTGATGAA ACGATTAGAA AAGATTCCCG
AGCTAGAGGG ACTTTCCTTG CAGTGCAATA TGTTGACCCG ACGAGCTACG GGGAGCTTCG
ATCCGCCCGT CATGGGAGTG CGTTTGCATT GTGCGGGAGA AGACAAGTAA GTGCTTTTGG
GTGCAACTGG ATTCTTCGCT GATGCCCTAG TCATTGTTGT TGGTGCGTAA CACTGACGTT
GCTTTATTGT GTTATTATTA TTGCTGGGAT CATGTTCAGA CCAGGAATGT TGGCGTCGAT
CACCGAAAGT CTTGCCAACC ACGGCTTGAG TTTGGAAAAT GTAACCACCA GCGTCCGACA
CAACAAAAAA AGCGGCTGCG ACTTTGTGGT GGACGCGGAC TGCACGTTGA CTCGTCATTT
GGACCAGGAC CAAATCAAAG CCATGGTGGA CGATCTCAAC CACTTGAAAC AAGAACTGGA
CCTGAGTACG GTCGATATTC GCGTGCAACG CTTGGCCGCG GAACGACGCG GTACCATCCG
GACCAGCGAG TGATCTTCGT CCCGTGTGGC AGTCGCACGC TGGCGGGTCC GTGATCGTGA
TCGTTTTCCC ATTTTTGTAA ACCTCTCCTT TTTACGTTTA TCCTAATCTT TACTTGGAAT
AGTTCCGTAT GGACCACGTT GGGATCGTGA GTTTTCTAGA GTGTCAAGAT AGTAATGTTT
TCTGAGTGTA AATATAGGGT AGTATGGGTC GGCTGGTGGT TGGGCATTGG TAGATGATGG
CTGGGCTTCG GTGCCAGTTC CGCCCTTATG GTATTGTCCG AATTGACATA CCATAGACAC
AGGATTAGAA TACGATTGGT TCCCAATGGT CGAGACGTTT CCGACTGCGC CCAAAGCAAA
AATCGCACGC GGTTTTAAAT TCCATGGCGG GAACGCGCCC ACCCGTCCCT GGTTCCCGGC
ACGCCGGACC GCTCGCGCCG GAATCCGCGT CCCACGACCA CGCAACGTCC GCACCGTGTG
GTACGAGCTG TACACTGGTT CCATCCGACC CGACCCGTGG CGCACCAACA CTTTCGGGGT
CGCCATGCCT GGCATTGTGT AGTCGTTACT ATCTCTCTCT CTCTCTCTTT GTGTCTATCT
GTTGCGCAAC CAAACAGTGT GGCATGGGTC GAGACGTGAC GGTGTCGTTC GCGTCGATTC
GCGTCCAAGT CCACCGCGGG GCGTTGCGGG AAACCAAACC GCTCCTGACG GACACCGATG
CCGTCGCGAC GCTGACGGGC CAGACGCTGT TGATCGGCGC CAGAGGCAGC TGGTACATGA
AAGCCAAAGA TCTGACGGCT CTGGACGTAT CGGATCGTGT GATTACCGTG TCGTGTGCTT
CCAAAACTGT GGAATTGCGG GACGCCCGCA ACGAACGGGA CTTTCGGATG TTTCGGCAGA
AACTGCAACA CTGGTACGAA ACGCAACGTT CCGAAACTGC CTTTGGTGAT TTCTTGCACA
CCAGTGCCAA AGCAGCTTTG GGTAGTCCGC GGAGTCGCGT TTCCACGTCC CGATCGACCC
AACGCCGGCG GATTCGGGGA ACGTACGGAT CGCAAGCGAC TCGAAGTGTA TCCACTGCCG
TGGCTCCGAC CAATCGATCC TGGTTGCCCG AAGTCTTTTC GGAAGATGAG GGAGAAGACA
CCCACCTGCT GACAGATAAC GCAGTGGAAA CACCCCGCGA GAAAGCGGTG GAGAAAGCGG
ACGACGACGC ACTCTTGCCG GCGGTTTCCA ATGGTAACTC GGTTGGCAAG AAACGCCCCC
GCCTACAGAA ACTCAAAAAG GCTCTAGACG ATACCCAGAC GGCGCTAGAA GACGACTCGG
ACGACGATGC TTTATTCGAC GACGTCCATC CATGGACTAC ACCCGCCACA CAGCACATTG
TGTCGCCGGG AGGAGCGCTC ACGGAACGCA AGTCGCCCGG AAGGAAGCAA AAGAGACTGT
CCAGTTTTTT TCCACGCAAA CCTACCCAGA AATCTTTCGA TCCCGCGACT GCGGTCACCA
CCCCACCACG CCCGGTACCG CGGACCCCGA CACGACTGTC TTCGCCTGCC GCCACCCGTC
TCGTCAAGTC GGCACGCAAA TCCCTGACCC ACGACGCGGC CTGGCTCGAA CGCTCGCCTG
CCGCCAAATC GCCGCACCAA AGCAGCGCGG AACGACTGTT TGGGAAACAC AATTTCTTCC
ACTCGTCTCA TCGGAATATT AAAAGTGAAC CCGAACCAGA AGACCCCATT CAAGAATTTG
GTGATCCCAC GTCCGCGACA CCAACATTCC AGCTCAAATT GAAGCCGCCT TTGTTCTCTC
CGTGTGATAC TAGTACACCC ACCAAGAATT TGTTTCCCGA GGTAGATTGT TCGCCTTCGT
TGCAACCCGA AGAGACCAGT AAGCTGACCC CACCTCCACT CATTCCTCGT TATCCATGCC
GGGGCTTACG GAACCTCGGC AATACGTGCT ACCTGAATTC GTCGGTCCAA ATGCTCTGTA
CCGTGCCGGA TTTTACCTCG CGACTAGACC AAATAAATGA CAACGCTCCA CTCGCGACGA
GCCTCGTTCA AGTCGCTCAC GAGCTGAGAG ATACCAATGC TCCTCTGTCC GTAAGACCAC
GAGCAATCAA AGACGCTATG GACGAGAAGA CGCACAAATA TCAAGGATTC GAGCAACGTG
ACGCCCACGA ATTTCTTAGT GATTTAATCG ATCACGTCCA CGAAGAGTTG ACCGAGAAGA
GCAAAGCGGA ACCGTCCAAA GAATCGGAGC AAACTCCGCC AACCGACGAC TTTCGTTTGG
TCGTGCGGGT GTGCCTCAAG TGTACTTCGT GTGGGTATTC TCGGTAAGCA TTTCAATCGG
AATTGTCTTT TTTTTTGTAA CGCAACAGGC TGCTGACGTA TGCTTCACTC CGTCTCCTGA
TATTGTAGGA ACAAGGACGA AATTTATCGA CACCTGTCTA TTGATGTCGT GGGCGATGCC
ACCTCGGAAG AAGTATCGGA TGTTTCGCAA GCTTCAGTGG AGCAGGGACT GGCCCGGTTC
TTTCAACCGG AGACGCGCGA AATTTTGTGT GAAAAGTGCA AACGCGGTAC TCACGCTTTC
CAGACACTAC GGATTGTTCA AAAGCCCAAA GCGCTTTTGC TGCATCTCAA ACGGTTTCTA
GTGGTGGAAA AGCCGCGACC GTTATCACCC AACCATGCAG AGGCCACTAC TGACGAAAAC
AGCCCACCTA ACAGTCAAAG TAGCTCTCCT ACAAAGACAG CGCCCCCGCC GCCCGAGTAC
GTTTTCCGAA AGAACAAGGC ACCCGTTTTG ATCCCTGCGA CTCTGTCGTT GGACTCCTAC
CAAACAAAGG AAACTGTGCA AGACGGCAAC CAGGTTGCCT CTACTTTTTC GCTGCAAAGT
GTAGTGCATC ATGTTGGCAA TCGGTCGTCG TCGGGACACT ATACAGCCGA TGCGCTGAGG
CTGGTGAACA AGAACGGCTC AGAAACGGAG AGTGCTAAGA CAATGCAGTG GATTAGTTTT
GATGACGGCT GCTCCGGTAG AACGAGTTTG GAAGATGTTA TCCTAGACCC GGTCAAGCAA
GCAACCGCTT ACATTCTACT CTATTCTTCA ACACCAATCT AA
 
Protein sequence
MSSIAGAAAS KTLWGKIKSV PIQHPFAFGF FLSGFKTSFS DLLVQKVVEQ RETIDWKRNA 
AFASFGFFYL GGVQYAIYVP LFSRMFPGAA GFAAKSIRDK LKDAKGMFQC GAQVVLDQCV
HHPLMYFPVF YCTRELVVHD KPDLKRCLNE YRGNMKEDLV ALWKVWVPST IINFAFMPMW
ARIPFVAATS LLWTSILSAM RGGDVSHGEE MAGGAVTGAT LTMVEAGFEL LFTSNVELDR
DSNHLVITAS GPDRVGWIAT LSRAIADQGG NITHSKQVRL GSSFICVLHT AVDPELQHAL
MKRLEKIPEL EGLSLQCNML TRRATGSFDP PVMGVRLHCA GEDKPGMLAS ITESLANHGL
SLENVTTSVR HNKKSGCDFV VDADCTLTRH LDQDQIKAMV DDLNHLKQEL DLSTVDIRVQ
RLAAERRVPY GPRWDRLEYD WFPMVETFPT APKAKIARGF KFHGGNAPTR PWFPARRTAR
AGIRVPRPRN VRTVWYELYT GSIRPDPWRT NTFGVAMPGI CGMGRDVTVS FASIRVQVHR
GALRETKPLL TDTDAVATLT GQTLLIGARG SWYMKAKDLT ALDVSDRVIT VSCASKTVEL
RDARNERDFR MFRQKLQHWY ETQRSETAFG DFLHTSAKAA LGSPRSRVST SRSTQRRRIR
GTYGSQATRS VSTAVAPTNR SWLPEVFSED EGEDTHLLTD NAVETPREKA VEKADDDALL
PAVSNGNSVG KKRPRLQKLK KALDDTQTAL EDDSDDDALF DDVHPWTTPA TQHIVSPGGA
LTERKSPGRK QKRLSSFFPR KPTQKSFDPA TAVTTPPRPV PRTPTRLSSP AATRLVKSAR
KSLTHDAAWL ERSPAAKSPH QSSAERLFGK HNFFHSSHRN IKSEPEPEDP IQEFGDPTSA
TPTFQLKLKP PLFSPCDTST PTKNLFPEVD CSPSLQPEET SKLTPPPLIP RYPCRGLRNL
GNTCYLNSSV QMLCTVPDFT SRLDQINDNA PLATSLVQVA HELRDTNAPL SVRPRAIKDA
MDEKTHKYQG FEQRDAHEFL SDLIDHVHEE LTEKSKAEPS KESEQTPPTD DFRLVVRVCL
KCTSCGYSRN KDEIYRHLSI DVVGDATSEE VSDVSQASVE QGLARFFQPE TREILCEKCK
RGTHAFQTLR IVQKPKALLL HLKRFLVVEK PRPLSPNHAE ATTDENSPPN SQSSSPTKTA
PPPPEYVFRK NKAPVLIPAT LSLDSYQTKE TVQDGNQVAS TFSLQSVVHH VGNRSSSGHY
TADALRLVNK NGSETESAKT MQWISFDDGC SGRTSLEDVI LDPVKQATAY ILLYSSTPI