Gene PHATRDRAFT_45342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45342 
Symbol 
ID7200033 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp897792 
End bp902600 
Gene Length4809 bp 
Protein Length1366 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179532 
Protein GI219117475 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTTC CCTTAATTTC ATGGGCGAAG TACAACGCGT GGACAAAACC ACAATCAAAC 
GCCAGTTAAA TGTTTGTAAA CACTGAGTTC GCAGTCAGCC CTGGTGTTTT GCATCCGAAC
TTATCGAGAC GAAGAAACAG TCTCACGACT TGCAAACAAC TAGAGTTCTG GTTGCCAAAT
ATAGCGCTCT TGGAAGGACT CGAAATCCTA CCGCTTTCCG GCATGCCGTT CTTACTGGTG
CTAAGAAAGA CTTGGATCGC ATTTCGCCAT CTCGTACAGT CCAAACGTCG TCGACGCCAG
CGCTCGTTTT TCTTTTCGGT GGCAATCCTG GGATCACTGG TCCTGTTGCA GACGAAACAT
ATGCACACAG TTTCGACAAC ACCGGAAACA GTTTATTCCC TCAATCATGG ACAGGCGATT
GAAGATTCCG CGCCGTCCTT GACGTGGGAA TTTCACGGCT ATTCACAGTC AAACATACCA
GCACGATCCA CCAGCCCCTC AACTCCACGT CTATTGATTG CCCAGTACGC CTCCGCTTTC
TACACGGTAG TATTGAACGA GACACAACGA GTCAACCAGG CATATGCGGA GCGATTCAAT
CACGATTTTA TCGTTTGTCG AGGCATTTAC CTGACCGACA GTCCTTGGTG GAGACTCGTC
TCACCACCCC TGCACACTAT TGCCGGTTCG CGCTCGACCT ACAACAAAAT TGCCGTCTTG
GCCTACGCGA TGCAACATGA TTACGATCGT GTGCTGATAC TGGATTCCGA CGCCATGGTT
CGCGATTTTA GCATCAATTT GGCGACGTAC TCCCTGACAG ACGACAAGGG AAAGGACGTT
GTGGTGGTAG CCCAGCAAGC CAAAATGGGT GAAGTTCACC CTCCAAATAC TTGGAATGTC
AATATTGGTG TCACACTCTG GAATCTCCGT CACGAACAAG TCGTTACAGT GTGGCAGCAA
TGGCACGATC GCTCCATTGC CCGAATACGA AGTGGCCAAG CGGACGACGA TCAACAACCG
TTGCAGCGCT GTTTTCGCGC TTTTCCCGAC ACTACACGTC CCGTCCTTGC CGTGAAGGAG
TTCGGCTATG GTGGCGGATC CATCGTACAA CACTTCATTC GCGAAAGTTC GTCGTCCTGG
AGTGAACCGA CGGAGGCTCG AACGGAAGGA ATTCGGACCG CAGCCCGAAA AATAGTACCA
AAGTAAAAGC TGACAATTGA TATTGAACAA AATGAAATGC CATGTAGTGC TAGGTAGACT
AAATCAGCGA CTGTGTAGCT TTTCCCAACA CTGTGTTTGA TCTAATATCC AAAGGGGACT
GACAAGAATG ACAGAAACTT CACTGTGAGT GCCATACATT TCGAACGCCG CCTTTCACAT
CAATTCGTAA AAATTGTGAA GTTGTGTATG TGGTGGTTGT GCTTTTCTAT AGGGATTTCC
GGCGATTGGG TCTTCCTTGA AATCCGAACC CGCAAAGGTT GAAAGTTCAA ACAAAACCAT
CTACCGTTCG GATTGTCGTG AACCTCGGCC GTTGGAACGG ATTCGCTCTC GTCTGTGAGA
TTCTTTTGCC TTGCGGCAAA GAACAAATCC CTGACTATTT TCTAAAACAG TCATTTGCTC
CACCAAATGA TGCCTATTGC CCAAGCGATA CTGTAACACC GGAGATAGTC CCTGTTGCAG
ATGAGCCTCG GGACGAGTCC TCGTCCAGTG GGAAACGCAG CTGATCTATG CGTACGTGCG
CATGTTGCTC ATACCGTCCC AGAAACGCAT CCATATTGTC AGAACGCCCA GAAACAGGAT
GCGCTAGGCA GTCATGCCAT TTGGAAAGCC AATGAGACAA AGAAAGATTC TTACACTGTG
GGAAGCCGGC GTACTGCTCC AATGATTCCT ACGACAACCT CTCCCGTATT GATCTATACG
GGCGTTGTGC TTCGCACCCT CTTGCTGGAT GTTCCCCTAC TATTAGCCTT GATTCTATAT
TCTACGACAT CTTGGTTAGA ATATGTAAAG ACCAACTACA TGCTACCGCA GCTCGAACTC
CAGCGCTGGA CCCCGGAACG GGCCGAACAA GAAGTGACCT ATTTTCATCG TCGGTGTGAC
GAATCCGATC AGTCCGCGCA CGATACGGAA CCTCTCGTCA TAGACTACAG CAGCATGTCC
AAGCGAGACA AAATGGAACA CATGCTGACG CACGGTGTTT CAGTCTATCC TAACCTTTTG
TCGGCGGAAA CGGCCAACGA AGTGCGCGAT TTCATTCTCG CGCAGAATCT TAAAAACGAA
GACATGATTG ACGTGATTGA AAACACCAAT CGCTGGAGCT TTGGTGTAAG AGTCGACCAA
CATCCCAGTG TATCCAAGGC TCTGAAGGAA GTTCTCAATA AGCCCGAGCT GGTTGAAGGT
CTCGAAGCCA TTCTTGGTAA GAATCCCGCC ATTATTGAGT TTACCGGGAT TACCAGTGCC
TATGGCGCTA CCGCACAACG CTGGCACCAG GACGTGGTAC CCGAAGGAAG CGCCGCCAAA
TACAGTCGTA GTTTTGTCCC GTCCTACAGT CTCTTTATCC CACTGCAAAA CACTACCAAG
GCGATTGGTG CCACCGATAT TTGCCCGGGT ACACACATGT GCGCTGCCGG ACCGATCCAC
TTTTGTGAGT ACTCGGGATT TCCCGTATCC GGGGCGGCCG ATAACTGGCC ATTGGGATGG
GGAGCACTGG TCAATCAACA AACGACCCAC CGTGGAGCCC CCCATGTCGA CCCGCACGGG
CCCAGCCGCG TTTTGTTTAT TCTCACCTTT GCACCACGGC CCCAGTTCAC ACCGTCCAAG
CTGGAAACGC GAATGATTTC GACCGGCGGT TCGTACTCCT TACACTGGTC ACAATGGGGA
CACACGTTGA GGGATTTCCA GGACCCGGAC ACTCGCATGA AACATCCCTG GCGAGCTCTC
CGTGCGCTGG GTTTGTACAA ACCGCGAGAT GCACAGTGGG GATGGGACTA TTTATCCCAA
GCTTCGGGAC GCGTGGCCAA CGATGAAGAG GGTTTTCATC GTGAGAGTCT GGACGGGCAG
CTATCCAAGG GCGGACTCAC GTTTCTGCCG GACTGGCTGC AGGGGCACGC GTCAACTGAC
GAGGAAACCA GTTATGCCTG GGTGGAATTC ATGGAAGATA CGCTACGCTT ATGCAGTCAC
ACCACTCAGA AATTCTATCT TGCCGTTGTA TTCGGGTACG GTTCATTTGT AGTTATCTGG
AACGGATTTT TGTTCGCCGC GGGACGTAGA CATTTTCGAG TGAAGGCTAT TGGTCGAAGC
ATGCTGCGGG TCCTATTGCT ACACGCTGTA ATATTGTCGA TCGAGGAGTT TGCGCGACGG
CGACTTGCTG TTACGGATTG GGCCAAAAGC ATTCGTGGCA GTCGTCTCTA TCGGTTGCCC
AGTCCAGATC AAAATCTTCC TCTTCCAGGG ACGCTTCTTC TCTTGGAAGA TGTTTTGATC
CTTGACCAAT TTCAGTCGGA ACACTTGGGG TCGTATGATC GAATACTGGA CTTCGCGCAT
CCTGGAAATC GACGGTTCAA CACCATGATC TTGCAGCACT CCAAAGGATA TACCGCTCTA
CCATTTTCGC TTAAACGGAG TTTGCGTGCG GATGTTCTGC TATGGAACAA GCAAGACGGA
AGCCGTATTC TAGCAAAGAA CGTAGATGGA GCTTGGGCCG AGGTTGCCCA AGAAACTGCC
GAAAAAGCAT GTCATAAGAA ATTGACAAGA GCATCAAATT CCGGTGTTGA ACATGTATCC
CGACAGCTGG ACTACTTGAA GGCGGAAAAT ATATATGGCT TTTGGCGGCA CACATCTATG
TACCTACGTC ACAATCCTGT TCTGCTTGAC CGCCTCGAAC GAAAGCTGCT AGGATGGAAC
GAGACTAGCA AGAATTCGTC AGCCGGTCTA TCGTCCTCAA ACGGCTCTCT CCTCGTACGA
CCATTCTTTC GCGGACATTC GATTCCATTA CTCACTCAAA AATCTCTTCG ATCAGTTCGG
CAAGTTCTTC CTCCCAGGCC ATCTACTTCT GAGCCGTACG CTGGAGCGTG GATGCAAGAG
GGGGATGTCG TGGAAGGTCG CTATCATGGT AATTTTCCAG GTACGTCGTA ATTGTGAGAA
GGAAGCACAC GTCCAGACTG GTAAGTAAAA CTCACAGCCA GCATTTTCAT TGTTCTCCTA
GAATGGTATC GTGGACGCAT AGTGTCCACG AGTGCTGACA AAGATGTATG GGACGTTGAA
TACGACGATG GCGACGAAGA TGTTGGACTC TGTCGTAACT GCGTACGACC GTTTGTTCCG
TACGCCCTGA ACGACGACGT AGAGTGGAGA GACGAAGAAG ACATATTTCA TCGTGCTCGT
GTAGTCAAAA TTCAGTCGGG TGATGTGTAC GATCTCAAGT TTGAAGACGG CAGCATACGT
AGTAACGCGT CAGCTACCGA TCTACGCCGC GTCCCGTTGT TGGGGGAAAT AGAGGTAGGA
TCTCGCGTTG AATTTCTGGT TGATGAAGGC TACAATACGG GCACAATATT GCATGTGAAT
GTGGATGGGT CCTACAACAT TGAGTTTGAC GACGGCGACT TCGCCACCAA CGTTGCACCA
AAACACGTAA TTCCCGAATA GGAGAGGAGA GATACCCGAG CACAAAAGGT AATGAACTAC
GAAAGCTGGA AAGCAGTAAA GAATTGAAAG CAAAGTTAAC ACGAAACGAC GTGAGACGCG
TTGCGATAGT CTATCAAGTT TTCAAGGCTA AATACTGTTA ATTTCAAAAA AGTATCGTTC
CGACACTTC
 
Protein sequence
MSFPLISWAK YNAWTKPQSN AISPGVLHPN LSRRRNSLTT CKQLEFWLPN IALLEGLEIL 
PLSGMPFLLV LRKTWIAFRH LVQSKRRRRQ RSFFFSVAIL GSLVLLQTKH MHTVSTTPET
VYSLNHGQAI EDSAPSLTWE FHGYSQSNIP ARSTSPSTPR LLIAQYASAF YTVVLNETQR
VNQAYAERFN HDFIVCRGIY LTDSPWWRLV SPPLHTIAGS RSTYNKIAVL AYAMQHDYDR
VLILDSDAMV RDFSINLATY SLTDDKGKDV VVVAQQAKMG EVHPPNTWNV NIGVTLWNLR
HEQVVTVWQQ WHDRSIARIR SGQADDDQQP LQRCFRAFPD TTRPVLAVKE FGYGGGSIVQ
HFIRESSSSW SEPTEARTEG IRTAARKIGF PAIGSSLKSE PAKSLLQMSL GTSPRPVGNA
ADLCVRAHVA HTVPETHPYC QNAQKQDALG SHAIWKANET KKDSYTVGSR RTAPMIPTTT
SPVLIYTGVV LRTLLLDVPL LLALILYSTT SWLEYVKTNY MLPQLELQRW TPERAEQEVT
YFHRRCDESD QSAHDTEPLV IDYSSMSKRD KMEHMLTHGV SVYPNLLSAE TANEVRDFIL
AQNLKNEDMI DVIENTNRWS FGVRVDQHPS VSKALKEVLN KPELVEGLEA ILGKNPAIIE
FTGITSAYGA TAQRWHQDVV PEGSAAKYSR SFVPSYSLFI PLQNTTKAIG ATDICPGTHM
CAAGPIHFCE YSGFPVSGAA DNWPLGWGAL VNQQTTHRGA PHVDPHGPSR VLFILTFAPR
PQFTPSKLET RMISTGGSYS LHWSQWGHTL RDFQDPDTRM KHPWRALRAL GLYKPRDAQW
GWDYLSQASG RVANDEEGFH RESLDGQLSK GGLTFLPDWL QGHASTDEET SYAWVEFMED
TLRLCSHTTQ KFYLAVVFGY GSFVVIWNGF LFAAGRRHFR VKAIGRSMLR VLLLHAVILS
IEEFARRRLA VTDWAKSIRG SRLYRLPSPD QNLPLPGTLL LLEDVLILDQ FQSEHLGSYD
RILDFAHPGN RRFNTMILQH SKGYTALPFS LKRSLRADVL LWNKQDGSRI LAKNVDGAWA
EVAQETAEKA CHKKLTRASN SGVEHVSRQL DYLKAENIYG FWRHTSMYLR HNPVLLDRLE
RKLLGWNETS KNSSAGLSSS NGSLLVRPFF RGHSIPLLTQ KSLRSVRQVL PPRPSTSEPY
AGAWMQEGDV VEGRYHGNFP EWYRGRIVST SADKDVWDVE YDDGDEDVGL CRNCVRPFVP
YALNDDVEWR DEEDIFHRAR VVKIQSGDVY DLKFEDGSIR SNASATDLRR VPLLGEIEVG
SRVEFLVDEG YNTGTILHVN VDGSYNIEFD DGDFATNVAP KHVIPE