Gene PHATRDRAFT_42753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42753 
Symbol 
ID7196379 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp969438 
End bp974693 
Gene Length5256 bp 
Protein Length1678 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176699 
Protein GI219109892 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.260531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGGAGTGTAA ACGAGCCAGA GCCGAGGCAA TCGGACCGTT TTCCTGGACA CTTACGTTGG 
ATATTTGCTT TGGTAAGAGC TCAAGAACTC GCCGGAAGAG TTGCAGTATC ATTGTCTCGC
AATTCTAAAA AAGGCTCAGC ATGTCGGCTC CGGCATCCCG CGCGGTACAT CCGTGTGTCT
TGTCCGTAAC GGAGGGCACC CGCGGAGTGG AATTCTCGAC TTGGGCGCAT ATCACTACCA
ACAGCGGCAG GAGCAATAAA GTATCGTCGT GCTTGCCTAA CTTGGTTACG GCCTCCGCCA
GTACCATTTC CGTGTATCGT ATTGATGAAG ATAACAACGG CAAGCTGTGG CTAGAGCACT
CCTTCGGGAA TCTAGCGGGG ACGGTTGTTT TTCTGGGAAC GTTGAAGGCA ACTCAGCAAG
CGGATGTTCC TGACGCGCTG CTCGTGGGAT TTGCGGGACA TGCCCGCTGG ACCGTCCTAC
AAGTCCAACA CGATCTCTTG CAGGCCACGT CTTTGTTGGA TCTCACACCG ATCCTGTCGG
ACTACAGTTA CGGACAGGCA TCGAATTGCT TGGCGGAACA AGACATGATC CTGACGTGCT
TGGAATCGCG TCCTGGGAGG ACGGTAGTAG GCTGCGTTTT GGGCGGTGGC GTCGCTGTAG
CGGTCGTGGA AGTCGGCTAC CAAAAAGCAG TGGCCGGATG GATCGCTGAC GAACCCTACG
TGTTACCTTT GGCAAATTTG TCCACTCAGC TCCCGCACCG AGCGAATCAC CTTGGTAACA
AAAGCTATAC AAGCAACAGT AACAACACCC ACCCAAATAG TGGAGAATCC ATTGCAACAG
GCTTTGGTGA CATATTGTCG GCTGCATTTT TGTCGGGCTA TCTTGAACCC GTTCTCGTTT
TGTTACATTC CGACGTGGAA GGGCCTGTAT GGAGTGGACG CTTGGGCCGG GAACGTGGTG
TGGCTGGGGC ACCACCCTTG TTTGTTACGG CGCTTTCAAT TAGTGTCGTG CACGGGCGGA
CAGCTGTCTT GTGGAGTCAA GTGGTGTCCG CTGACGCCAC CAAGATTCTT AGCTTTGGAA
AGACAGGGTG CTTAGTTGTT GGTGCGAACA CACTTGTGAT TTTGGAAATC GGAAAAGTCC
AGCAGGTGAT TGCCATGAAC GGATGGGCAC GTTCCACTTG TCCAGCAGCC TTACAGACCG
CCTTGCAAGC CAATCCTGTG GTCAAATTAG CAATTCAGCT AGATGGTTGT TGTGTGACCT
GGCTCTCGGA ACACTCTGCC ATAATGGCGC TACGCACTGG ACAGCTTTAC GTGCTACAAC
GGACCGACGA CCGTTGGGCA GTCATGCCTC TGGGTCAAAC GCTGGGAGCG GTAGGAGAAG
TGGCCCACTT GGCCTCGTTG CCTATTGGTG GTCTGCGTTG GCTGGAAAAA ATGAAAATGG
ATGAGAACAA GGCATCGGAG ATGCAAATGG GTGTTTTGTT TGCCGGTAGT CGCACCGGGG
ACTCTCTGTT TCTTGGATAC GCTTTGGAAA TCGTCACCAT GCCATGGGCT GCCATCAAGT
CGGAAGGGCA AACTTTTATC AATTTTGAAG GTAGCGAGCT TTCAAAAGTC GCAACGACGG
CGCCGATAGC AAATGGTTTA GATCGAATTC TGCAGCTAGA AGAAGAGGCA TTATATGGGA
CAGATAGAAG CACACCTTTA CACATCGTAC GGGATAGTGA GGAGGAGGAA ACTGCCGACA
TCCCGTCGGA CGCCAAACGA TTGCGGCCGG TCGCATTTAC TGTGGTTCGG ACTATCGTGC
CACTCGATGT CCTCGTCAAC CTAGGGCCTC TGGGGCCTTC CTGCGAAGGG CCCATTTGTG
CTCCGCCAAA TTTCATGGTA AGCCAAGAAA AGGTCACGAC CGCGATGGGG AAGAAAGAAC
CAGTATTCGG ATCTCCCGCG TGTATCTATC CGTGCGGTTA TGGCTCCAGT GGAGGGGTGG
CAGTAGTCAC CGTTCCTGGC CGAGACGACC GAATGATTCT AGCAGAAGAG GATTGCTTGA
ATGTCGATGC CATCTTTAGT CTTTCCACAG CAGGTATCAT TCTGCTAGGT ATGGGCGGCG
GCGGAATCAA GGTTCTACGC CTAAAGCATT CCAACGCATT GGAGGAAGTG GATGTCCAAC
AGTGGTGCCG AGGTTCGAAA GACAACACAT CTGGTCCCTT TCCAAGTTGT GTCCAATTGT
TCACCGCTAC TTTGCTACAA GCGACAGAGT TCAATGACCG CAGTTTTGCT TTACTCGTGT
CCTCTCCGCT AGAAGACGGC GTAAGCTATT CCGTCGTCAT ACTTAACGAG GAGAACGGAA
ATCTATGCAT CAAGTATCAG CATGTGATCG CATCGACTGA CGACAAGATG TTATCGAGCA
CTCCTTTTGT GCATTCACCA TCCGATGAAG CAGTAGCGTT TGGTTGTAAT TGGTTGTCAG
GGGATGCCTT CTCGTTTATT TTGGATGCAA ATGGAAGATT TCGAGCATGC CAATTTGCAG
GTCGCGCCGA GTCGCTTAAC GAAGAGATGG ACGTGGACAA TGATGAAGAA CGAGAATTCT
ACCGTTTTTA TAGCAAGAAA AGGATTGTCG CTGTTGATGT TTTCCAAGCA CCAGGAAATA
TTTTCTCCTC GGCGTTCTTT CCTGACGAGA AACACGATGA TGTGATTAAC GGTGACGTAC
AGCAGCGAGC CAATGTTGTT GATTCTGATG AAGATGAGCA GGAGTTGTAC TGTAACCCTG
GTGACCCAGG CAATTTTGTA GCATCTGGGC ACATCGCTTC TTCGCAGCAG GACCTTCAGA
ATGAGGGAGA CGGGGGAAAT CCCGCATTGT ATATTGCTGT CTGCAGACAA TCTGGGCAAC
TGGAGATTTA TCTTGTGCCT TTAATCGACA ATATACCTCG ATGCTGCTGG AAATCGAGCG
GCTGCGGACT CGGAGTATCC TCGCTGACTG GACAGAACGA ATCAAAAGTG CCTTTGCCAA
AGACCTACAA AGTTCATGCA CGAGAAATCC GGTTCTTCGA ATGTGGACCA ATACCCTCGA
AGACATTGGA TACCGCGAAA AAAAACAGAA GCCTTTGCCT TGCTGTTGAC TGTTCGTCTG
GCGATCTAAG TGTATATCGA CTAGCAATAT CTCAGGACCA TGGATTCCCG CCAAGATTTG
AAAAATTTCG AATGAAATCG GTTTTTCGGC GAAGCCAGGA GCAGGCGAGA CATCGAACTA
AGCTAATTCG GAAACGAATG GTGGTTGATG TTAACGATGG CACCGGTGGG TTTGTTTATA
ATCGACTATA TCGATTTTCT GGTATTTCTG GTCAGGCCGG GATGTTTGCA GCAGTTCCAA
GACCATTTTG GCTGTGTGCT GAAAGAGGAA AGCCTTCGAT GCTTTTTCAT AGAACCAGGC
ACGCCTCACC GGCGGGTGGA AAACTGAGAC CAGTGTCTGG ATTCTGCTCA GCTGTTATTA
ACGATAAAAG TGGAAATGGA GGCTTTATCA CCCTACATGA ACGTGTTGGC CGGATCGGGA
GTCAGCGACT AACCTTATTC CACGGCCTTG CCCCTGCGTT TGGTGCACAT GGTCTGCTTC
CCGGTGGAGG GATGTGCGTA GAAAAAATTT TATTTGGCAT GACCGTTCGT CACATTCAAT
TTATCAACGA TCCATTTGTC TCGACAAGTG AACATCCGTT GTACGCGTTG CTCGTTTCAA
AGAAACTAGA AGTTGACCAA AGTGATCTGA ACGACGATGG CCTCACAGCC CAAGAACGGA
AAGAAACAGA GGAGGAGAAA GAAAATGCTA AAATAAAGCG ACAGGTTGAG GCAGATTTGG
GGGGCTTTGA TTTAGAGAAC GAATGGGTCG AAGAAATCGA AAGGGACGAT TGTTTTGCAG
TTGAGATGCA GCTAGGCGGG GCGCCTCCTA TCCCAAAAGA AGCTTTTGCC GTTTGGATTG
TTGACGCAGC AAACAACTGG ATGGTGGTAG ATTCGTTCAA GCTTGACGAA TATGAACATG
GGATGACTCT GAGTATTATG GAGCTTACCG AATTCCCTGA AGAACCAGGT AGTAGCAATG
ATACTGATGT TTCCGGAGAT GAGCTTTCCA AACGTATGTT CGTAGCTGTC GGCACGGGTG
TTTTAGATCA CAATGGAGAG GATGTTGCTT CTCGGGGTCG GGCTATTCTA TTGGAACTCA
AGCGCACAAA CTCTTCAGCT AAGGCTGCAG GTAGACAGGT GGTTGAACTC TCCTTCTGCT
ATGAAAAGGA AATTTTCCAC GGAGCAGTAA CTAGTTTGGT TTGTTTGAGC TCGGAAGGAA
AGAATCGATT GTTAATCGGT GCTGGAGCGG ACAGTAAGTT GTGTCAGTTG CTGAGTAAAA
CGGTTCTGCA ATCTTCCTTC TCTCACACCT CTCGTTGAAT CTTGGTTAAC AGTCAACGTG
GAGCAATGGG GTAATGCGAA ACTGACTCAG GTTGGCTTTT TCCGCGCCAC TATGCAAGTG
TTGCATACAA TACCTTTCAA AAGCTTTCTT TTGTTGAGCG ACGCGTACGA TTCTCTTTAT
TTTTTAATCT GGAGGGAGTC TGATAAAAGT CTGACGCTGC TTGCCAAGGA CTACGACCCG
ATTCCGGTCT ATGCAGCTGG AGTTATGAGT CGAGGCCCGG CAATGACATT CCTTTGTCAT
GACGACCGTC AAAATCTGCA GTTTTTTCAA TATGCTCCAG GTGAGGCAGC GGCGCGTGGA
GGCAACAGGC TTGTATGTAG AGCCGACTAT CATCTCGGGA CACAGACCAC GTCTTTTGCG
TCGCATTTTT GTCGCTCTAG CTTAATGATC CATAGCGCTA CTCCTACAAG TACGTTGGCT
GCTCTAAAAC AACAGGATTC CTATTTTGGA CGGAGCGAAG AGGATCAGCG ACTAGGTGCA
TATTTCGGCA CAGCTGATGG TGGGATGGGA GCCGTGGTGC CCCTCAGTGA GCCAGTATAT
TGGAGACTGA CGGCACTGCA GTCCATCGTC GCGAATGCCT TGGAGAGCGA TTGCGCTCTA
GCCCCGCGAG CATGGAGACT GTATCGAAGA AGTACACGCC GCGGCGGCTG TCGCTCTAAT
GATCGAAAGA AAGGGGTGAT AGATGGTGAC CTTGTTCTGC AGTATGCTGA TCTTTCCATC
AGTAAACAAG AGGATATTGC AAGCGCTATA GGATCTACCG TGGATCTAAT TCTTGACAAT
CTTTTGGAGC TACAATGTGG AAGTTTGGTT TTGTGA
 
Protein sequence
MSAPASRAVH PCVLSVTEGT RGVEFSTWAH ITTNSGRSNK VSSCLPNLVT ASASTISVYR 
IDEDNNGKLW LEHSFGNLAG TVVFLGTLKA TQQADVPDAL LVGFAGHARW TVLQVQHDLL
QATSLLDLTP ILSDYSYGQA SNCLAEQDMI LTCLESRPGR TVVGCVLGGG VAVAVVEVGY
QKAVAGWIAD EPYVLPLANL STQLPHRANH LGNKSYTSNS NNTHPNSGES IATGFGDILS
AAFLSGYLEP VLVLLHSDVE GPVWSGRLGR ERGVAGAPPL FVTALSISVV HGRTAVLWSQ
VVSADATKIL SFGKTGCLVV GANTLVILEI GKVQQVIAMN GWARSTCPAA LQTALQANPV
VKLAIQLDGC CVTWLSEHSA IMALRTGQLY VLQRTDDRWA VMPLGQTLGA VGEVAHLASL
PIGGLRWLEK MKMDENKASE MQMGVLFAGS RTGDSLFLGY ALEIVTMPWA AIKSEGQTFI
NFEGSELSKV ATTAPIANGL DRILQLEEEA LYGTDRSTPL HIVRDSEEEE TADIPSDAKR
LRPVAFTVVR TIVPLDVLVN LGPLGPSCEG PICAPPNFMV SQEKVTTAMG KKEPVFGSPA
CIYPCGYGSS GGVAVVTVPG RDDRMILAEE DCLNVDAIFS LSTAGIILLG MGGGGIKVLR
LKHSNALEEV DVQQWCRGSK DNTSGPFPSC VQLFTATLLQ ATEFNDRSFA LLVSSPLEDG
VSYSVVILNE ENGNLCIKYQ HVIASTDDKM LSSTPFVHSP SDEAVAFGCN WLSGDAFSFI
LDANGRFRAC QFAGRAESLN EEMDVDNDEE REFYRFYSKK RIVAVDVFQA PGNIFSSAFF
PDEKHDDVIN GDVQQRANVV DSDEDEQELY CNPGDPGNFV ASGHIASSQQ DLQNEGDGGN
PALYIAVCRQ SGQLEIYLVP LIDNIPRCCW KSSGCGLGVS SLTGQNESKV PLPKTYKVHA
REIRFFECGP IPSKTLDTAK KNRSLCLAVD CSSGDLSVYR LAISQDHGFP PRFEKFRMKS
VFRRSQEQAR HRTKLIRKRM VVDVNDGTGG FVYNRLYRFS GISGQAGMFA AVPRPFWLCA
ERGKPSMLFH RTRHASPAGG KLRPVSGFCS AVINDKSGNG GFITLHERVG RIGSQRLTLF
HGLAPAFGAH GLLPGGGMCV EKILFGMTVR HIQFINDPFV STSEHPLYAL LVSKKLEVDQ
SDLNDDGLTA QERKETEEEK ENAKIKRQVE ADLGGFDLEN EWVEEIERDD CFAVEMQLGG
APPIPKEAFA VWIVDAANNW MVVDSFKLDE YEHGMTLSIM ELTEFPEEPG SSNDTDVSGD
ELSKRMFVAV GTGVLDHNGE DVASRGRAIL LELKRTNSSA KAAGRQVVEL SFCYEKEIFH
GAVTSLVCLS SEGKNRLLIG AGADINVEQW GNAKLTQVGF FRATMQVLHT IPFKSFLLLS
DAYDSLYFLI WRESDKSLTL LAKDYDPIPV YAAGVMSRGP AMTFLCHDDR QNLQFFQYAP
GEAAARGGNR LVCRADYHLG TQTTSFASHF CRSSLMIHSA TPTSTLAALK QQDSYFGRSE
EDQRLGAYFG TADGGMGAVV PLSEPVYWRL TALQSIVANA LESDCALAPR AWRLYRRSTR
RGGCRSNDRK KGVIDGDLVL QYADLSISKQ EDIASAIGST VDLILDNLLE LQCGSLVL