Gene PHATRDRAFT_49066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49066 
Symbol 
ID7195429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp479828 
End bp483159 
Gene Length3332 bp 
Protein Length1012 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183615 
Protein GI219126754 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCGTATCTGT TTACTCGGTT CGTTCGCACC CCCACTCACA GTCAACCTCG GGTCTTCCTT 
TCCTACCTTG CCGTGTCTGG ATCGCTGGAA TACGTGACCA TCACAGTCAT TCTCTGCTGT
TCGCTCACTC TGCTGTTCAC TGTCCGCTGG ATCGATCGAC AACACAGCAT TGCGGTAGGT
ACACGCGTAG GCTGCATCGG TGTTCCTCGA CGGCTACCAA CAGTCTCCCA CGTTCCTTTC
CTTACAATCA ATCAATCAAT CAGCCGCCTT GGTAGTTCGT ACACATACAT ACAGCATGGG
CAATATGGTT TCCAACGAAA GAGGAGCTAC GGGCACCGGT ACCAGTACAG CCATCCCCAC
CCGGACCAAC CAGAAAAAAG CCTCGGCACA AGCCGTTCAA CCACCTCCCT TGTGGAACGA
ATCCTCCAAC AAACAGTCCT TCCACCTCGA TTCACCCGGC TCGGACTTGT CCAATCCTTC
GCAGTTCCGA CACCGCCACC GTCACACCGT CAGCAACAGC AATATTCATG GTCAAGCGCA
AACACCGCTT CGACTGCGGC GTCGGGCCTC CAAGGTTGTC AGCAAGTCAC ACATCGCCAC
CATGGAATCC TCGCTTCCGT ACGGTGATTA CTCCAAACGG ACCAGTTTGC CCCCGCCTTC
CCAGTTTTAT TTTGGCGCTG GCTTTGGCAA CGTCGATAAC GCCAGCGCCA GCTTTCTCGA
GGATCGCGAC GAGCAAGACT TGCTGGACCC AAGCGCTCCC GAAATGGCCG TCTTCCGCCG
ATTCCGGTTG GCGGCCAACA CTGCCGGGGT GGGACGAATG GAAGAGGACG CCGCTAGGCG
TCGGGAACAG ATTCTGACCT CGCGCCAGCA ACGCAAAATT TGGTTCAAAT CGCGTCGCAA
GGCTTTGCAG CAGAAGGCTC GTCGTGCCAA AACTATGCTT TCCGCTTGTT GGGAAGAGCG
ACGGGGTATG TGCATCGTCA TTCCCGACGT GCAGCACCCG CAAGATGGGC GTCCCGAGGC
GGCATCCTAC CCACCACACG TTGCGGTAGA CTCCATCACC ACGGACCCAC TACGCCGGGT
TTCGTCTACC TCTCGTCGCG TTTCCGAGGA ACCCCCCATC GTCGACGGTA CTGGTCCGAT
GCAGTTTGAT CCGCCGCAAG TCCGCGTGCT GGATGAGAAT CAGCAGTCTC GCAACTTTGA
TTCGTTTTAC GACGAACTGA CCGTCACGGC GGCGCCTTTT GCGCCCGTGT GGAAGGAGAC
AAAGCAGCCC AAACACGACG AACACTCACG GGTACCCGTC GCTGCCGAGG TTCCATTTTA
TTTGTACGAT CCCCAACGTA TTTTGGAAGG AATCGTCGAC TCCTCCGAGT CTCCGACGAC
CGCCACAACG TCGACGAAGC TCCAGCAATC GCGCCGTCGT ACAAGTGCGA GGTACGTGGA
TCGTTCGGCG TCCCCCGTGG TCGCACTCTT TTCCGACGAC GACGAAGACC AGTACGATCG
CATCGGTGAT GATGACTACA CCACCAAGAG CATCGAGACC AGTGACAAAT ACAAGACCGA
GAAAGACGAC GATAATGACA CCAACAAGAC TCCATCTTCA TCCAAAACCC TGTCAAATCG
GTCGTCACCC TTTGACGAAG ACTCGGTATC GCAAGAACCA GCGGAAACGC CCAACGACGA
AAACGCTTCC GTTGTGACTC CGTCGACGGT CCAGGACCAG CCGATAGCGG TGCGTCGATC
CTCTACACAG TCTTGTTTGG ACCGTGAAAG CATTGAAGCC AACGTCCAGC AACAAATAGA
CCAGCAACAG AAGGGCGTCG CCGTCACGCA GAAACACCAA ACGCGACACC GCCCGAGTCT
TGCAGAACGG GCCGAGGATC TGGGAAGTTA CGGCAGTGAT CCACCCCAAA CCAAGGCCCG
AGCCGTGACG CAGGCGACAA AAAAGGGAGA TGCTCCTCTA GATCAAGCCA CAACCCGAGT
AACCGCTCCG GAGTCCCGCA AGCCCGTGGT GGGGCATACC AGTACCGCCA GCATTCGAGT
GGCCCGCACG TCGGAATCAA CGGAACCGAT CCAAACAGCG CACGCCGTTG CAGCCAAGAA
ATCAGGCAAG GTCGAGTCAG TCCAACCGAA GCCAGCCAAG ACAGTTCAAC CGCAAGAATC
CCGTCAATCG TCTTCTTCCC AGAACTTGCC CATATTGCTC CAGCTGGCGG GGTGGAGAGC
ACCCAGCGCG ACAACCAACA AAGGGCAACG GAACAGCAAC GGCTCAAATG CATCTACTTC
GACTGCAGCA GGTGTTGCCA AACAGCCCAA CAACATTCCC ATCTTGTTGC AGCTTGCTGG
TTGGAAGACC GAAAATACAA AGGATAAACA GGGCATCCAG AGACTAACTC TTACGGAAAA
GCAGCTCTCC CAAGGGCCCC CGGGCCGACG TAGTAACGTC ATGGCGACGC GAACGGTTGC
ACCGACTCGG CGTCGTCAGA GTGCGCCACC ACTGTCGGCC GTTCCGAAAG ACGGCAATCG
GGCTTCCAAT CGAGAATCGA GCGCCTCTGA ACCAATCTTC CGGAAAGGAA CAACATCCCA
AATCCAAAAG GAGAATCAGG GTAAGAACAT TGCCACCATA CTGCAGCAAA CGTATCCAAT
TGCGACGGCC TTCTCTATGG ATGCGATCCA TGACCTTCAA CCGGCACAGG TGCGTTTTGA
TCCGAATCAC ATTAAGAAAA CTGCCAAGGC GGGATCGCGC CCGCTCTCCA CGGCTGTAGG
AGTTGTTTCA GTGAATGGAT CAAACGAGAT TAGTACGGGG CTGAAAAGTC CGGCGGCCTC
TTCAACGTGC ACTACGGAAA CACCTCTGAG TCAAGGATCT ATTCCCATAC TGTCAGAGGA
GGCTATGGGT AACGCGGCCT TCTTGTTCTC TCCCAGTTAT ATGGGGAATG ACAAGGCATC
CTCAGAGCAA ATGAGATTGG CACCACCAGC GGGAGTTATG CACCGTGATC ACGCTTCTTC
CTTTGCCTTG TCAACCTTCG ATTCCCGCTG TCGTGTTTCT GGATCTTCCC TCTCGACCAA
ACCAGTCGCG ATTCGTCCCC TTCACAATGT TGTCGACGAC AGTGGCAGCA ACCGTAGCTA
TCATACCAAC AGCAAGACTG TTTGGAGCAT TGATGAGGCT CCCAAGGAGG GCAGTGACGT
CGGTAAACGC AATAGCAGTA TAACAACGTA TGACGATACC ATGCGAGGAA AGTTTGCGAG
CAAGGAATCA CAAAAGTTTG AAATGCCACC AATTGTTCAT GCACTCTCGG ATCTCACTGA
TACGACTGGA CGGGAGAGTG GAATGGGCAC AA
 
Protein sequence
MGNMVSNERG ATGTGTSTAI PTRTNQKKAS AQAVQPPPLW NESSNKQSFH LDSPGSDLSN 
PSQFRHRHRH TVSNSNIHGQ AQTPLRLRRR ASKVVSKSHI ATMESSLPYG DYSKRTSLPP
PSQFYFGAGF GNVDNASASF LEDRDEQDLL DPSAPEMAVF RRFRLAANTA GVGRMEEDAA
RRREQILTSR QQRKIWFKSR RKALQQKARR AKTMLSACWE ERRGMCIVIP DVQHPQDGRP
EAASYPPHVA VDSITTDPLR RVSSTSRRVS EEPPIVDGTG PMQFDPPQVR VLDENQQSRN
FDSFYDELTV TAAPFAPVWK ETKQPKHDEH SRVPVAAEVP FYLYDPQRIL EGIVDSSESP
TTATTSTKLQ QSRRRTSARY VDRSASPVVA LFSDDDEDQY DRIGDDDYTT KSIETSDKYK
TEKDDDNDTN KTPSSSKTLS NRSSPFDEDS VSQEPAETPN DENASVVTPS TVQDQPIAVR
RSSTQSCLDR ESIEANVQQQ IDQQQKGVAV TQKHQTRHRP SLAERAEDLG SYGSDPPQTK
ARAVTQATKK GDAPLDQATT RVTAPESRKP VVGHTSTASI RVARTSESTE PIQTAHAVAA
KKSGKVESVQ PKPAKTVQPQ ESRQSSSSQN LPILLQLAGW RAPSATTNKG QRNSNGSNAS
TSTAAGVAKQ PNNIPILLQL AGWKTENTKD KQGIQRLTLT EKQLSQGPPG RRSNVMATRT
VAPTRRRQSA PPLSAVPKDG NRASNRESSA SEPIFRKGTT SQIQKENQGK NIATILQQTY
PIATAFSMDA IHDLQPAQVR FDPNHIKKTA KAGSRPLSTA VGVVSVNGSN EISTGLKSPA
ASSTCTTETP LSQGSIPILS EEAMGNAAFL FSPSYMGNDK ASSEQMRLAP PAGVMHRDHA
SSFALSTFDS RCRVSGSSLS TKPVAIRPLH NVVDDSGSNR SYHTNSKTVW SIDEAPKEGS
DVGKRNSSIT TYDDTMRGKF ASKESQKFEM PPIVHALSDL TDTTGRESGM GT