Gene PHATRDRAFT_39341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39341 
Symbol 
ID7195058 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp245285 
End bp248973 
Gene Length3689 bp 
Protein Length1167 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183327 
Protein GI219126151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.338277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGGAA ACCCCCCCAA AGCTATGATG GCAGAGTCCA TTTACTTGCC TCCAAAGCCA 
ACCTTGGGTG GTCTTACTTG CCAATCGGCC CGTGAAATGG TTGCCTACAC TGGCGGGAAG
CCCAACGCAA CGTGGACCAA TCTTGCTGAA CCTACCAAAT GGGAAACGCC TAACGTAATT
TGCCCCAACG GAGTCAGCGA CAAATGTAAG AACTGGAACT TTCTTACGCA GGCCCCAACA
AAGTTGTTCT CAAAGTCGGA TACGTCCTAC CTACAAACCA CCTGGATTGC TGACCAGGTC
AAGCATTTCC GCGAGGGCGG CCTCAACTCA GTCATGTACC GCCCTAGTAT TGCGGACCCG
ACCGATATGG TAAATGTCAT GGAACACCAT GAGCAGGTGA CCTTGACCCA TGTTCTTGAA
GTCGAAGAAG ATGTTTGCAG CAAGTGGGAC GCCTACAGTT GTTACAACAA CAAGGCAAGC
TCCAACCGCA TCCTCAACTC AATCAGTGCT GAGCTTAGTG AAGAATTGAG GCTTCGGATG
CTCCCGACGG ACACAGCGGC CGTCTTGTGG ATGAGAGTTA TGAAACTGGT GGTGGACGGC
TCTATTGAGC ATTACAACCG CCAGAAGGAT GCCCTTTGTG CCCTCAGCCC GTTGTCCGAG
CCTGGACAGA ATGTACACAC GTACTCTGGT AAGGTCCGCC TCATTTGCAA CGAGCTCTGG
CATGCCCGCC AGTGGGAATG GCCTTTAATG CTTGTGATTG TCCGCCAGTT GTGTCTTGTT
ACGGTCAAAG CTTTCCAGTC AATGTTCTTA CCAATCAAGA TGGCAATGGA CCGTACCTTG
ACAGAGATCT CTTATCTTGA CCGGACTTTG GCAACTGCGG TGATGGTAAA GAAAGGGTAC
CACTATACCC AATTTCTAAC CCTTGTTGAG GACACTTACA AGTCTCTTCT TGACAACAAG
GACTGGCCGC CAGCAAGCAA TCAAAAGGAG ACCCAAGGTG CCCCAACTGC TTTCTTGGCT
GGGATGTTGG AGGTCCAGCT CAATGCGCTG GTCCAATTGC ATGTCAACAA GACTTTGAAT
GCTGAAAAGA AAGTATTCAA ATGTTTTAAG TGCGGCCAAA CTGGCCACTT CTCTCGCGAT
TGTAAGCAGC CGGCCTCAGA GGGAGACAAG ACTCCGTCCG ACAGCAAGAA GCCTAAGGCA
AAGGAAACCG GATGGCGTGT TGAAGCCCCT GCTGCTGGAG CCACTCAGAC TAAGGTGGTC
AACAGTAAAA CATACCATTG GTGCGCTACT TGCAACAATA AAGACGGCCG CTGGAACCTC
ACTCACGTTA TGGCCTCGCA TACCACTGGA GCCGGTCGCC GGGGAAAATT TGATTCACCG
GCCCCTTTGG GCCTCATGGC TGAAGTCTCC TCTCCTGATG GTGTCCTTCC CAGTGACGGT
CGCTTCTGGT GACTGGAGGG TCGGGGGGAC CCCGATCTTT CTTTCCGTTT GCCGCCAAGT
ATTGCTCTGT CATCTCTGAC GGCCTATGCA CTCCTCACTT TGTTTGATGA GCCCATTCCG
CCGGGAGAAC AGCAGTGCCT CTCAGTTACC GCATTGTGCC GCTCTCCAGC TGTGGCCGGT
GTTGCTGTCT ATGCCGGTAC CAGTGTGGAT GCCAAACCGG ACAGTTTCCC AGTCATTTGG
GACACTGGTG CTTCGCTCTC AATCTCCCAT GAAGCCGGAG ACTTTGTTTC TGAGGTCCGT
CCTCCACCAA CGCCGCTGGT TTTGAAAGGC TTGGCCAAAG GCCTTAACAT TGTGGGTGTG
GGCACTGTTG AATGGTTGGT ATTGTCTCAG GACGGGATGC CTTGTCTGCT GTGTTTGGAT
GCCTATCTTG TCCCATTGGC TGGGCAGCGT CTCTTGTGTC CGCAGTCGTA CATCCAGCAG
CAGCAGCATC TCCTACCCAG TGACCCTGGC AAATTTGTTG TTGATTGTGA TGGTATGTCT
TTGGTTGGTG CCGGAGACAA CACTGTCCGT GTTCCCTTCC AGACCTCCAA TAACTTGCCG
TTGTGCATGG CCTGGTTGCC CTCTGGCTCT CCTTCCCTTG TGGCAGAACT CAATTTGTGT
GTGACCAATG CCCAAAATCA GAATTTGAGT TTGGCCCAAA AGGAGTTGCT TTGTTGGCAT
TATCGCCTTG GGCACTTGCA CTTTGAGTCC ATTTGCCGTC TTCACCGGAC GGGCGCTTTG
TCCCAGAGTG CCAAAGTCCG TGCTCTCCAC CGTTTGGCTG CCAATTGTGA CCTTCCAAAG
TGCGCATCCT GCCAATTCGG CAAGGCTAAA CGTTGTCCCT CTCCTGGTAA GGCCCAGACC
ATTGTCCCGG CACATGATGG CTCCATCAAA AAGGAGCATC TATCTCCTGG ACAACAGGTC
TCTGTTGACC ATTTCATTTG CTCTGCTAAA GGCCGGCTCT CCTCTTCCAA GGGGAAGACA
ACCGATGATC GGATGTTTTC GGGCGGCTGT TTGTTTGTTG ACCATGCTTC TAGCCTGGTT
CATGTTGAGC ATCAAGTTTC TCTCACTTCC CATGAGACCT TGCAGGCTAA GCACCGCTTT
GAAACCATGA CCCGGGACCG TGGCGTGACG CCACAGTCCT ACTTGTCTGA CAATTCAACG
GCCTTTACCA ATGCTGAATT CACTGTTGAG CTCCGCATTT TCCGCCAAGT CCAGCGTTTT
GCCGGTGTTG GCGCTCATCA CCACAATGGT GTGGCTGAGC GCAACATTCA GACCATCATG
GCAATGGCCC GCACCATGAT GTTGCATGCT GCTATTTGTT GGCCTGAAGT TGCTGATCCG
TCTCTCTGGC CCATGGCTGT TGACTATGCC ATATACCTGC ACAACCACCT CCCCACCGTC
TCCGCTGGTC TGGCTCCTAT TGATGTCTTT ACTGGGAGTA AGTGGCCCAT GCACAAGTGT
AATGATCTCC ATGTCTGGGG GTCTCCAACA TAAGTGCTCG ATCCGACGCT TCAGGACAGG
AAGAAGCTGC CTAGATGGAA GCCTTGCTCG AGACGTGCTG TTTTTGTGGG TTTCTCTCCG
AAACATTCCA CAACCATTCC CCTTGTTGTC AATCTTGTCT TGGGGGCCAT TAGCCCGCAG
TTTCATTGTG TTTTTGATGA CTGGTTTTTG ACGGTTTTTT CGGATCCGGA CCGGATTCCT
GATTTTGAAC AGTCTCCATG GACCAATTTG TTCAGCAAGA GCCGTTTCCA GTACCCGTTT
GACGCCGATG ATGGTTCTCC TCCTCCATTG GAAGAGCAAT GGCATGACGA GTTTGCCGAG
CGTACGGCCT CTGCAGCTTG TGAGCTCATT GTTCGCGATG CCCAGGACGA AGCCTTGTCC
GGAGCCAATG CCGCTCCACC GCTTGAACCT GTACCATCCG GCCCCTCTCT GGCCGGAGTG
CCCACTTCTC AGCCACCAGA GCCCCTCAGC GCTCCTCTGC CGACTGTTCC GCGTGTTCGT
TTCAGTCCCG ACGTTGTTGC TCCACTGGCT CAGAGGGAGC CGGACCCCGC TCCTTTGGCT
GTCCCCTTGG TTCCTGCTTC GGAGCCTCCG TCGCCTCAGC AGAGGGAGCA GCCCCCTGCT
CCTCCTATTG TCCGTCGCTC GAGTCGTTCT ACTCTGGGAA AGCCGAGCAA ATGCTTTGCC
AATCTGGAAT GGCACCTGGC CCAGACTGA
 
Protein sequence
MSGNPPKAMM AESIYLPPKP TLGGLTCQSA REMVAYTGGK PNATWTNLAE PTKWETPNVI 
CPNGVSDKCK NWNFLTQAPT KLFSKSDTSY LQTTWIADQV KHFREGGLNS VMYRPSIADP
TDMVNVMEHH EQVTLTHVLE VEEDVCSKWD AYSCYNNKAS SNRILNSISA ELSEELRLRM
LPTDTAAVLW MRVMKLVVDG SIEHYNRQKD ALCALSPLSE PGQNVHTYSG KVRLICNELW
HARQWEWPLM LVIVRQLCLV TVKAFQSMFL PIKMAMDRTL TEISYLDRTL ATAVMVKKGY
HYTQFLTLVE DTYKSLLDNK DWPPASNQKE TQGAPTAFLA GMLEVQLNAL VQLHVNKTLN
AEKKVFKCFK CGQTGHFSRD CKQPASEGDK TPSDSKKPKA KETGWRVEAP AAGATQTKVV
NSKTYHWCAT CNNKDGRWNL THVMASHTTG AGRRGKFDSP APLGLMAEGR GDPDLSFRLP
PSIALSSLTA YALLTLFDEP IPPGEQQCLS VTALCRSPAV AGVAVYAGTS VDAKPDSFPV
IWDTGASLSI SHEAGDFVSE VRPPPTPLVL KGLAKGLNIV GVGTVEWLVL SQDGMPCLLC
LDAYLVPLAG QRLLCPQSYI QQQQHLLPSD PGKFVVDCDG MSLVGAGDNT VRVPFQTSNN
LPLCMAWLPS GSPSLVAELN LCVTNAQNQN LSLAQKELLC WHYRLGHLHF ESICRLHRTG
ALSQSAKVRA LHRLAANCDL PKCASCQFGK AKRCPSPGKA QTIVPAHDGS IKKEHLSPGQ
QVSVDHFICS AKGRLSSSKG KTTDDRMFSG GCLFVDHASS LVHVEHQVSL TSHETLQAKH
RFETMTRDRG VTPQSYLSDN STAFTNAEFT VELRIFRQVQ RFAGVGAHHH NGVAERNIQT
IMAMARTMML HAAICWPEVA DPSLWPMAVD YAIYLHNHLP TVSAGLAPID VFTGMLDPTL
QDRKKLPRWK PCSRRAVFVG FSPKHSTTIP LVVNLVLGAI SPQFHCVFDD WFLTVFSDPD
RIPDFEQSPW TNLFSKSRFQ YPFDADDGSP PPLEEQWHDE FAERTASAAC ELIVRDAQDE
ALSGANAAPP LEPVPSGPSL AGVPTSQPPE PLSAPLPTVP RVRFSPDVVA PLAQREPDPA
PLAVPLSFYS GKAEQMLCQS GMAPGPD