Gene PHATRDRAFT_49720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49720 
Symbol 
ID7198402 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp53656 
End bp57705 
Gene Length4050 bp 
Protein Length1218 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184561 
Protein GI219128734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.919834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAGTCTGCT AGTGTGACAG TCGCGTGTGA GATTGATTGT TTGTCCGGTA CGCCTCGGAA 
TTCTGACTCG CTTCGATACC TTGACAGTGA GGGGACGGGT CGATCAAAAT CGAGAGCGTT
GTCTGCTAGC ACCGCACCTA CCTACCTCGA CGTGGATTAG CTTTCTGTGT AGCTTGTGTA
GAGGATTCGA TCGGAGTACA CATTCTTCGT TGTTGAGAAT TGTTGTTGAC AATTGTTGTT
GACAATTGTA GTTGACAATT GTAGTTATAG CTGTACAACA TCCACTCCAA CGACAAACGT
CAACACAATG GTGACCAGGC CGCCCTCTGC GAACGGGACC GCGAGTCCAT CCCGAGCGGG
AGCCGCCACC CGCGCCACTG CCAATCGCAC GGTCCCTCCT TTGGACTACG ACAACAACAA
CAACAACAAT AACAACAACA ACAACAAGAT CGAAGACTGT GCTCCACGTC TCTTGTCCGA
AGAAGAGAAA AAGGCCGCGG CACGGGCGTA CGCCTTGTCC GGAACGTCCT CGTCAAACCA
AAAGTCTCAC AAACGCAAAA GTCTGCTCTC TACGACAACA ACGACGGGTA CCAAAGCGAC
ACCCCGATTG CCACTGTCCT TGACGACATC CACCTGCGCG ACAACCACAA CCACCTACCC
CAGTAGAACC ACAAATACTA TCGCAACGGC CAGCACATCA GTCTCGACAC GGGTGCCATT
CGCCACCAAC CCGCACGGGT ATTCCACCCA GTCTTTGCCG CATTCCGAAC AAACATCCCG
AACCGTTCCT CCGGACGCAG GATCGGAACT CTTTCAGGAT ATGGACGAAT ACCAACGATT
CGTCAAGAGT CTCGAATTAC CCCTAGACAA CGATCTGTCC TTGCCCCTCT TTCCCTATTT
GCAAGACGAC GAAGACGACG AAGAAGAGGA ATTCCAATTG GATCTCGAAG ATGATGACGA
CGACGATGAA GAAGAAGAAG AAGAAGAAGA AGACGGAGGA GATGACGACG ACAACAATAA
CAACAAGGAA CAAGTACAGG AACAACACAA ATGGCAACAA CAACAACCAC ATCCAGACCC
TACCAGTCAA CCCCATTCCC GAGCTTGGCT TTCTCCGATA CACCCCGCCG TACCAACCCA
AGTCACGGCC ACAACAGCCA CGACTACGAA TGTCGATCCC CACACACTCG ACGACCTCCA
ACCATGGCCC GAGGATCCCG ACACTGCCGG TGACGCCTTG AATGGTTTTT ACCGAGAATT
GGAGGAAGAA TTGGGATGGT TGGAAGAAGA AGACATGGAA GCCGCCGTCG CCACCTTGTT
GGACCATCCC ATCCCCAAAG GACCGGCTGA CGACTTGCCC AAACTGGCCG GCCCCTATCC
CGACGATTCC GCCACGGACA ATTCCGTCGG TAACGGCAGT AGAGAAGGCG AACCCGGGTC
GTCTGCTCCA ACCCAACGGG GAACCGCCAA AGTCAAGTCT GCTGCAAGCG AACCCGCCCT
GGTACACACG CCACTCCGGG ACGCCGCCCG TGCTGCCCGT ACTGTGGTTA CCGACGCTCA
GCACGAGCAT TTGAAGCAAC TCATGAATCG GCACTATCAA GTCCTCACGC AACAGGCCGT
CTTGTCCGTG CGTGCCGCAC ACTACAATCG CTTGCACCGG GGAAGTCGGG AACGACACGA
AGTCGTCACT GGCGGTGAAT CCGGCGATGA TTTGGTGGAA ATCCTAGACG CCGCCGTCGG
TATGTTGCAA GATTTGGACC AGAATCGGAA GGACGCCATT CGACACTACG TACAATTTCG
TCAAACGGAG CAACGGGGGA AACATTCATC CCTCGATGCT AAACATACCC AACGACACGC
CCGACGATCT CTATTCTCCG CGATGGAACA AGAACAGGGG CTGATCGAAG GAAGCGCACC
ACGTGACAAC CCTTACCACC ACCCCAACGA AGAGCAAGCA ACGGTGGTGG CCCGGCGCTT
GACCCGGGCG CAGTTCACCA AGACATTACA GGAACAGAGC CAAGGGGAAA CGCGGACCGT
TTTCGACGTC CCCGGTTTGA GTAAATTGGG TGACACCTTC AAAGTGATTG ACAAGAGTGT
AGAAGGAGTA GGAAGGCGTA ATATTTTGGA ACTGGACTCG GTACGTTTTT GTAAAGGTTT
CGATGGACTT TATCGTGGGA TCGTTCGGAC TTTCTTCTCA CACATCGTTG TATTGAATGG
TGACAGCCTA CGGAAGCGTG TCAATTGGTT CTTGACGAAG CGGGAGCCGA GTACGATACC
AATGCCGTAC CGAGCGCCAG GGACGTATCC CTGAACTTTG TCGACGCCAA GGAATATTTT
GGTCCCAGTT TTCAACCGCC GAGCACTCCG CAACAAGAAA TGGTCCTACG ACGGAATCGC
AATCTGTTCA CGGCCGGAGA AGACAATTTG GTGCTGAGGG GAGTCAACCT GTACGGGGAA
AAGCAATGGA TTCTTATCGC TGATCGATTC TTGCCTGATC GATCAATCAA TATTATTTCG
CAACGCTATT CGAAGTTGTG CATGATGATT TATCGTGCCA ACGGTATCCG GATAGACCGT
AATGGAGACT TGGAGCAACC GCCTAAGCTA GAAAGTGTCG ACGATTTGGA CGAAGAGGCA
GTGGACAGAG TGAAGCCGGT ACCGCCACCG GCCGTGTTGA ATGTTCATCG TTGGAGTATG
GAGGAAGACT TGACGCTGCT CCGTGCTGTT CCGTTGTTTG GTCACATGTG GGCAGAACTC
GGCGCGCGGC TGATACCTCA TCGAGACCGC GGTCATCTGC GGAAGCGCTA TCAAGTATTG
GAACGTCGGG TAAAGGCTAC TGTCCTTCGC GCCAACAAGC ACGACAATTT GAAGGTGCCA
ACATGGACAG CCTCCCCAGC CAAGCCGGTC CGTTCGTCGG GGGATTTGCA GCCTTACAAT
AGCAGGGCAT CGGGAATTCC TCCCCCGCCA CCGGTACCTT CCCGCACAAC GGCACCGTCG
AAGAAACGCA CCGCCAATCA GTCAGTCAAT CATGCGGCTG CTATATTGGC CAGCGCGCGC
AGCGCACCGG CAGCTAATGT ACCCCAACAG GTGCACAGTC CTATAGAGGA AAACTCTCGA
CTCGCGTTTG AACAACTGGT CAACGGTTCC ACTGACGATT GGTCCCGAAT GACCGGTATA
CTCGAAACAG ACGAAAGTGA GGTTGCCAAT GCGATTGTCA ATCAACTCGC AAAGTCACCG
GCCAAGCCTT CAATTCACAA TAAGTTTGAA GAGGCGGCCC AATCTTCCCA GCTCAACTCA
GAGGCAGAAT CGAGAGCTCT GTCGCTTCTG GCCGACACCT TTTCTCCGAG GAAACGGACC
AAAGCTAGCC AGGATGATGA ACAAGCGACA CAGTCCGTGA GTTTTTTGGC CGGCGTTTTG
GAACACGCTC AGCATTCTGA GGTATCACAA GGGAACAACA CGAACGATGC CAAGATGAAA
CCTCCCCTGC CTGTGTATTC TCCGTCGAAG GGGTCCGTTG AAAGACGGTC GCTTAGTACT
CCAATTCGAA AAGAGGGGCG GTCCACGATT TACTCTACTT CAGGAACACC GGTTGGTCTT
TCTCCCGGCT TTCGGTCGCC TACGGGGAAT TCTAATTGGA AATTAGGCAG CCCGGTGCTG
GAGTCGATTA CGATGGAAAG CTTTCCGGCA GGGATGGCAT CTCGTATGAC CCATGAAGAG
GGACATGGTC ACAGTATGGA TGGCTACGAT TTAAACAAAA TGTTTGAGCA TTCCATAGAA
GCAGGAGGAG AGGCCATGCA CGTTGCCGCA CATGATGTCA ACTCAAATTT GTCGGGAATG
AAGGCGATTG AAACGGAAGC TTTGGAAGCT ATTTCGGCAT TGAACTCATT GAGTAATTCG
CCGGCAAAGA CTTTTCTGCG CAGAGCCAAT AGTCAAGAGT CCAGCAATGA AACGGGCAGA
AACAGCAACG GGGGACTACG GAGAAGTCTG TTTGCCAACG TTGTGGGGGA TGCCAAGGCG
TCAGCGAAGC AGCGAAAACT AAATCTATAG
 
Protein sequence
MVTRPPSANG TASPSRAGAA TRATANRTVP PLDYDNNNNN NNNNNNKIED CAPRLLSEEE 
KKAAARAYAL SGTSSSNQKS HKRKSLLSTT TTTGTKATPR LPLSLTTSTC ATTTTTYPSR
TTNTIATAST SVSTRVPFAT NPHGYSTQSL PHSEQTSRTV PPDAGSELFQ DMDEYQRFVK
SLELPLDNDL SLPLFPYLQD DEDDEEEEFQ LDLEDDDDDD EEEEEEEEDG GDDDDNNNNK
EQVQEQHKWQ QQQPHPDPTS QPHSRAWLSP IHPAVPTQVT ATTATTTNVD PHTLDDLQPW
PEDPDTAGDA LNGFYRELEE ELGWLEEEDM EAAVATLLDH PIPKGPADDL PKLAGPYPDD
SATDNSVGNG SREGEPGSSA PTQRGTAKVK SAASEPALVH TPLRDAARAA RTVVTDAQHE
HLKQLMNRHY QVLTQQAVLS VRAAHYNRLH RGSRERHEVV TGGESGDDLV EILDAAVGML
QDLDQNRKDA IRHYVQFRQT EQRGKHSSLD AKHTQRHARR SLFSAMEQEQ GLIEGSAPRD
NPYHHPNEEQ ATVVARRLTR AQFTKTLQEQ SQGETRTVFD VPGLSKLGDT FKVIDKSVEG
VGRRNILELD SPTEACQLVL DEAGAEYDTN AVPSARDVSL NFVDAKEYFG PSFQPPSTPQ
QEMVLRRNRN LFTAGEDNLV LRGVNLYGEK QWILIADRFL PDRSINIISQ RYSKLCMMIY
RANGIRIDRN GDLEQPPKLE SVDDLDEEAV DRVKPVPPPA VLNVHRWSME EDLTLLRAVP
LFGHMWAELG ARLIPHRDRG HLRKRYQVLE RRVKATVLRA NKHDNLKVPT WTASPAKPVR
SSGDLQPYNS RASGIPPPPP VPSRTTAPSK KRTANQSVNH AAAILASARS APAANVPQQV
HSPIEENSRL AFEQLVNGST DDWSRMTGIL ETDESEVANA IVNQLAKSPA KPSIHNKFEE
AAQSSQLNSE AESRALSLLA DTFSPRKRTK ASQDDEQATQ SVSFLAGVLE HAQHSEVSQG
NNTNDAKMKP PLPVYSPSKG SVERRSLSTP IRKEGRSTIY STSGTPVGLS PGFRSPTGNS
NWKLGSPVLE SITMESFPAG MASRMTHEEG HGHSMDGYDL NKMFEHSIEA GGEAMHVAAH
DVNSNLSGMK AIETEALEAI SALNSLSNSP AKTFLRRANS QESSNETGRN SNGGLRRSLF
ANVVGDAKAS AKQRKLNL