Gene PHATRDRAFT_43552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43552 
Symbol 
ID7197582 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp795123 
End bp798302 
Gene Length3180 bp 
Protein Length971 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178005 
Protein GI219112507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.774573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAGCTGCTC TTGGTGAAAT ACGCAGCCAG ATTCGTAGAG CTGGATTTGA TTCTAGTCTC 
TTGAGCGACC TCACTGTACG CAAGCTACGC GCAGGAAGCA GCAGATCCGT TAACGGCCAC
AGATAGATAC ATCGTTCGCG CAGAGAAATA CGGAACAACT TGTACTCACA TTCTGGACGA
CTGTAACTTG AAATGCTGCA CACCCCGGCG TCTCCATCTG GAAGTGTTGG ATCGTTGAAA
CGAGATAAGG AGGAAACTGA TTTTGATTTT GATCACGACG AGGGCTTCAT CATACCGTTA
CAGATGTATT CTAACACAGG TTCACGCATG CGGAACACCG CTCTGACTCC ACCGTCCCGG
CGCAAGTTTC CCTGGATGTG GACAATGCTT GTGTTCCTAA CATTGATTAG CATAGCCTTT
GCTTTCTGGA GCGAGCTCCT GGAAGCCTTC CCGACTATTA CGATACATTC AAATATCGCC
AATCACGAAA GACAAAGTGG AGTTTTTCCG ATCCAACCCG TTGCGTCCGC TTCACGACGA
GGCTCTGGCG CGCCGGTTGC GGTCGTTGTC GATACGCGGG AATCGGCGGG AGAACAACAA
CAAACAATAA TGACGACTGA AGCGGCTGCA TTTCCACCTC CGAAATTGTC GGGAATTCCT
TTGCACGCCG CCCAGGCACT AGCTTGTCGC CAGTCCGTAA TTAACTTCGT CATCAACGCG
ACGGACGGCA AGGATGAATG CGAAGGATTG AAAAAAGCCT TCGAGCGTAC GTGCAGCAAT
GACGGTTACG AAGACTCCGA CTCTAAAGAT AAAGCAGGGA GACGCAAGCA CCGTAACATG
AGTGAGAAAC TATGGGACAA ATCAATACCC AAAGTAGATC GATGGCACGA GTCGGTGTTC
TACCTTTCGC AAACGCTTCG CCGGATAGGT GACTGGCTTA TGTGGAAAGA ACCTGCCCCC
TTCTTTTTGG CCGAAGACGA GGTTGCCACG GACGAAACTT GGCAAACTGC TCGCTTTCTC
GTAAAAGAAG ATTTAGACCG CGTTGTATAC CGAGATATTC TGCATTCTTG GGCAACGCTG
GATTGTCAGA TCAGGCCTGA GCTTTGCCAA GCAAGAGTAC GCCGAAAACT CGATGAAAGC
ATCGAGTCCT CCTTTGCCAA TGATGGAAAC AAGGAGGTAC AACACTCGAA CCACACACAT
TCTGGTGGTG GATTGTCTCT AGATCTTCCT TTCGCCAGCG GCCATGTTTC GGAAAAGGTC
ATGGGTGAAG CTCTCATGCT TCAACAAGGA GATAAGCTCA TTGAAAAAGC CACCAATCAC
ACGTCGACAA ATGCAGCTAA ATCGGAAGCG GCAGCCTCTT CCAAGGCTGT ATCGGATGCG
TCTGCTGCAG TATCGGCCGT CCTGAACGAT CCTTCGTCTA TTGAAGCCAG GACGTGTTGT
GCGTCCATTC TGAACGTCTA TCATGACAAC TGCAGTACCG ATGTAGACGA TCAAGTCTCA
GACAGCCGGC TTTTTTTTGT CGTGTTTGTC ATGGCTTTAT GTGGAATGGT GAAAAGCCTG
ATTCGTCATT TCAAAATTCT GTGGTTGCCC GAAGCTGCAG GCTGCATCAT TGTCGGAGGT
ATGTTGGCAA GTTTATTTAC GTTTTCCTAT TCATCACGAG CTCACCACAA ATTAACTTCC
TTTTGAACAG TATTGAGTGG ATACGGTATG TTGCTGTTGC CACACCACGA CATTAGCTTT
GATGGAAACT GGTTTTTGCG CATATTGGTA CCACCAATCA TCTTTGAAGC TGCAATCAGC
ATTGACAAGC GAGCTTTCAA CCGCCACATT GTGCCAATTC TGATTTACGC AGTTGCTGGT
ACACTGGTGG CGACTGTTTT GACAGCATCA ATTCTTCATC GAGGCACGAC GATGCTGTCA
GACTGGTGTT ATCCTATTCC TTACGTTGAG GCCCTTGCTT TTGGTGCGCT GATTTCATCT
ATTGATCCAA TTGCTGTCTT GAGTGTGTTA AGTAACATGG GAATGACAGA TACAGATACA
ATATATGTCG TGATTTTTGG GGAGTCGTTA TTGAATGATG GCGTCGCAAT TGTTCTCTTT
CATACGCTTG TGCATTTTCT TGACGAAACA CTTGTGATTG ATCGGGCAGC CGTGATAGCT
GCCGTCATTC ATTTTGTGGT GGTAGCGTTT GGCTCATTTT TAATCGGTGT CGCATCAGGT
ATGCTCTGTA CCGTCTACTA CTGGATTTTC CACGGATGTC AGACTCCGTT GGTGGAGGTG
CTAATGTTTT TTTGTTGGGC ACTCTTGCCC TATTATGTTT GCGACGGTAT TGGCTGGTCC
GGCATTGTCT CTGTAGTAGC TGCCGGGTTC GTGATGGATT TGTACATCGT CGGTGACGAG
CATGGCGAGT CTGAGATCGG AGACACGAGA GAACCTTCGC CGAAAGTCGA GTCGGCTCGA
AAGCGTGGCC AGATCTTCTC GCCTATGGGA CAACTATCGA ATGAAGCCAA GACACATATT
GGCTTTGTCA CGGAAATCAT TTCGACGATG ATGGAGACTG CTATTTTTGC TTATCTGGGC
CTTTTCCTTT TCAGCCATCG TTATCACTGG AACATATGGC ACACCTTGAT CTCGATTACG
GCGTGTTGTC TTAGTCGCGG CATTATGATT CCGTGTCTGA GCTGGGTTGC CAATTTTATT
TTACGTATGC AACAAAATCG GCCGTCTTGT CGAATGCAGC AATCGGCAGG ACGAAAAAGC
CCGCAGTCGG CTGGTGTTGT TATAGATAAA AAGATGCAAC TGGTCTTGTG GTTTGCTGGG
CTACGTGGGG CAATGTCTTT TGCATTAGTC GAGCACATTC CGTTGTACGA TGAAGTCAGT
GGTATCGGAA CACGTCTCAA ACCGGAACTC AAGGCCATGA CTTCTGCGTG CATTATGTTT
ACGGTATTTG TTTTGGGGGG TCGTACCTAT CACATGATGG AATACTTGGG TATTGCACCC
TCCGCCAGCG CACGAAAACA ACAGCAGAAT CCGTCACCAC TTGAGTTGAC GGCACTTATG
ACGTCCAAGA GCTACGAAGA CTCAATGGAA ATTGAGGACG ACTCGAGTCG AACACCGTCA
AGGCCCGGTC ATGTCTTTCG AAGACAACGG CACAAAGAGC CGATGCCCGA AGGTGAATGA
 
Protein sequence
MLHTPASPSG SVGSLKRDKE ETDFDFDHDE GFIIPLQMYS NTGSRMRNTA LTPPSRRKFP 
WMWTMLVFLT LISIAFAFWS ELLEAFPTIT IHSNIANHER QSGVFPIQPV ASASRRGSGA
PVAVVVDTRE SAGEQQQTIM TTEAAAFPPP KLSGIPLHAA QALACRQSVI NFVINATDGK
DECEGLKKAF ERTCSNDGYE DSDSKDKAGR RKHRNMSEKL WDKSIPKVDR WHESVFYLSQ
TLRRIGDWLM WKEPAPFFLA EDEVATDETW QTARFLVKED LDRVVYRDIL HSWATLDCQI
RPELCQARVR RKLDESIESS FANDGNKEVQ HSNHTHSGGG LSLDLPFASG HVSEKVMGEA
LMLQQGDKLI EKATNHTSTN AAKSEAAASS KAVSDASAAV SAVLNDPSSI EARTCCASIL
NVYHDNCSTD VDDQVSDSRL FFVVFVMALC GMVKSLIRHF KILWLPEAAG CIIVGVLSGY
GMLLLPHHDI SFDGNWFLRI LVPPIIFEAA ISIDKRAFNR HIVPILIYAV AGTLVATVLT
ASILHRGTTM LSDWCYPIPY VEALAFGALI SSIDPIAVLS VLSNMGMTDT DTIYVVIFGE
SLLNDGVAIV LFHTLVHFLD ETLVIDRAAV IAAVIHFVVV AFGSFLIGVA SGMLCTVYYW
IFHGCQTPLV EVLMFFCWAL LPYYVCDGIG WSGIVSVVAA GFVMDLYIVG DEHGESEIGD
TREPSPKVES ARKRGQIFSP MGQLSNEAKT HIGFVTEIIS TMMETAIFAY LGLFLFSHRY
HWNIWHTLIS ITACCLSRGI MIPCLSWVAN FILRMQQNRP SCRMQQSAGR KSPQSAGVVI
DKKMQLVLWF AGLRGAMSFA LVEHIPLYDE VSGIGTRLKP ELKAMTSACI MFTVFVLGGR
TYHMMEYLGI APSASARKQQ QNPSPLELTA LMTSKSYEDS MEIEDDSSRT PSRPGHVFRR
QRHKEPMPEG E