Gene PHATRDRAFT_47779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47779 
Symbol 
ID7202941 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp7827 
End bp9963 
Gene Length2137 bp 
Protein Length622 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182142 
Protein GI219123667 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGCTTTTTC GAAAAGTCAC GACCGCGATA AACACCACCA CCACCATGAA GACTTCCTTT 
GTCTTTCTTA CTCTCTTTTC CCCTTTGCTC GTGCTTGCGG CTGTAGAGGC GCAGTCTGCT
GACGTCTCGG AACAGCCTGA CAAGTTGGTG AGTATATCTG TACTGTAGGC ACTCAATCCA
AGCCTATATT TGTCCCCTTT TCCGAAGCTT GGCAACAATT TCCTCTTGGT AAAACTCTCA
TCCTATCTCT CTCTCTTGTG CCTCTTCCAT TTCAATAGAA CAATGGTGTC CCATCCATCC
TTGAGGTCAC TGGTCGCCGC CTCAAGAAGG GAGGCTCCAG CTCGTATGGC AAAGGCGGCA
CAGCTTCGTC CAGCGCGGAC AGCAAAGGCA AGGGCGGCAA AGGTGCGTCC GGCTATACGA
AAAGCACCAA GTCCTCCAAG TCTCGCAAAT CGGCAAAGGA TCCCAAATCA GAAAAGCACC
CCAAGGCTAC CAACGCACCT GTACCCGTCG CAGCTCCCGT AGTCGCCCCT GTTCCACCTC
CCACCGTCGC TCCCGTCCCA TCTCCCATTG TTCCTTTTAT GGAGGTTGTG GATGTGGGAT
ACAGCATCTC GGTGGAAGCC AACATGGGGA CTGATATGAT GGAGTTCTTT ACAGAGCTTG
AAACAGGACT TGTTGCCGCC ATGGATATCT TGGCGCCCGA AATTGTCGCG GAGATTCTTG
CCCCCGGAGG ACGATACTTG AAGGCTAGTG TCCTGGCCAT TCACAATCAG CGCCGTCTGA
CTGCCGTGGT GCAGTTGCCG ACCTCCATTG AATCCTTAAT GATGATGGGT AAGTGGAGGT
TTGGATGAAG CATGAGATGT GCCGTTGCTA GACGTGCTAT GGTTCTAATG GTTGTTCTGG
CTTTTCATTT TGTTTGTCAG AGTGCGGCAA TTCTGTGATT GGCGATGATT TACTTTGCTT
TGACATTGTT CATAGCATTG CGCTAGAAAT CAGCGGTGAC ACTGTCCCAG AGGAATTTAC
GGGGCTCTTT GAAGAAGACA TGGACACTGC AATTAAAGAA GATCTGCTAC GAGACAAATT
GCTTGAGACC AATCCTAACT CTACAGTGAT TGTCTTTCCC CCTCCTGTCC AGCCAACTCC
CGAAGTGTCC AGCCCCCCGA CTTTGGCTCC GACTACGAAG CCCGCCGGCG CTCCGCCAGG
AATGCCAGTC AATGACCCCA ATTTGGTGCC GGCTCCATCT CCAACTGCAG CCCCAACTTT
GATGCCGTCT ACGATGCCCT TGGCAGCACC CCCCACTGGA ACCGACTGGA CAATCAGACC
CATTACGGCC GACGATTATT GGCAAAGCGT CACGTACGCG AATAACATGT TTGTCGCAGT
CTCATTCGAC CGGGTCATGA CCAGTCCCGA CGGCATAAAT TGGACGCCTC GTACCGCTGC
GGAAACCGGC GAATGGCGGG ACGTCACTTT TGGAAAAGGA ATGTTTGTTG CAGTGGGATG
GAGTATCGTC ATGTACAGTT CGAATGGCGT GGATTGGACC AGTGCAACCA AGGTGCCCGC
CCAAGAATGG CTTTCCGTCA CCTACGGAAA CGGATTGTTT GTCGCCGTGC CCTATGGTGG
CAATCGTATC ATGACCAGTT CGGACGGCAT GGCTTGGACG AGTCGTACCA GTGCGGCCGA
CGCTTTCTGG CAGGGTGTCG CGTACGGGAA TAATATGTTT GTGGCAGTTT CTCAGAATGA
TGGAACGGTC ATGACCAGTC CAAACGGCAT CAACTGGACG GCTGGTACCG CTGCGGCAGC
GAATAATTGG ATCAGCGTCA CGTACGGCAA AGGGAAGTTT GTCGCAGTGT CTTGTTTTGG
CAGCGGCAAA AGTGTCATGA CAAGTGCAGA TGGCATAACT TGGACTGGCC ACACCGGTGT
GCCAAACACT TGTTGGTGGC GTGTGACGTT TGGTGGCAAC ACGTTCGTTG CCGTGGCGTC
TCGTGGCGAC GGCAGTCGGA TCATGACTAG TTCGGATGGC GAAACCTGGT CCTCGCAGAT
CACTCCCGAA GCGAACACCT GGAGGGGCGT GACTTTTGGC GCAAATACAT TTGTGGCGGT
AGCGGATAGT GGAACCAATC GAGTCATGAC CGGGTAG
 
Protein sequence
MKTSFVFLTL FSPLLVLAAV EAQSADVSEQ PDKLNNGVPS ILEVTGRRLK KGGSSSYGKG 
GTASSSADSK GKGGKGASGY TKSTKSSKSR KSAKDPKSEK HPKATNAPVP VAAPVVAPVP
PPTVAPVPSP IVPFMEVVDV GYSISVEANM GTDMMEFFTE LETGLVAAMD ILAPEIVAEI
LAPGGRYLKA SVLAIHNQRR LTAVVQLPTS IESLMMMECG NSVIGDDLLC FDIVHSIALE
ISGDTVPEEF TGLFEEDMDT AIKEDLLRDK LLETNPNSTV IVFPPPVQPT PEVSSPPTLA
PTTKPAGAPP GMPVNDPNLV PAPSPTAAPT LMPSTMPLAA PPTGTDWTIR PITADDYWQS
VTYANNMFVA VSFDRVMTSP DGINWTPRTA AETGEWRDVT FGKGMFVAVG WSIVMYSSNG
VDWTSATKVP AQEWLSVTYG NGLFVAVPYG GNRIMTSSDG MAWTSRTSAA DAFWQGVAYG
NNMFVAVSQN DGTVMTSPNG INWTAGTAAA ANNWISVTYG KGKFVAVSCF GSGKSVMTSA
DGITWTGHTG VPNTCWWRVT FGGNTFVAVA SRGDGSRIMT SSDGETWSSQ ITPEANTWRG
VTFGANTFVA VADSGTNRVM TG