Gene PHATRDRAFT_47039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47039 
Symbol 
ID7202122 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp250613 
End bp252596 
Gene Length1984 bp 
Protein Length413 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181337 
Protein GI219121987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.915264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACCACTGA CTGACTGTGA ACTCGCAGCA ACAGGCACGG CCCGCAGAAT CTGTGGGAAC 
CTCACGATCG TATCAGTCAC AAACAAACGC GCACCACCGA TAAAAACGTT CATCGCCTTG
CAGTAGACGC TCATTCTTAT CTTGAAACAA ACCACTTTGC GAAATCAAGA AACGACATTC
AGAACCATCA TCGAGACACA CTGTTCATGG TTGATACCTG TTTCGAAAGC TTTCTTAAGA
ATTCGCGTAC CGCCCGACTC CACGGATTTC ATTCGCAGGC GCCTATAACA GTCTTGATAT
CGAAAGACTG CCATTCAGAC TCTCAAATAT CGTCTCAAGA TTCTTCCTTG CCGACACAGA
TATCGTGTCG GAGACGTGCA AGATTCCATC TTCAACCAGA AAACAACGCG CTATTTGGCA
TTTGGCCTTT GTCGATACCA GACATATAAC AAGAGTTTAG GTATAGCTTC ATTTCACGAC
TTTAACTACA ACCATGGTGG TCCGTGGAGG AACTATAATT AACATTTCCG GTGCTTCGCC
GGTAGATGAT CCCGAGTATC GCTACAAGAT GCCCGCGGTT TTTGGAAAGA TTGAAGGGTC
TGGGAATGGT ATCAAGACAG TGATTCCCAA CATCACTGAA GTGGCTCTTT CTTTGCACCG
GCAGCCCGGC GAGGTCAACA AGTTTTTCGG AACGGAACTT GGCGCGCAGA CGCGTTACAG
TGCCGAAACG GATCGCGCTG TTGTAAATGG CGCGCATATG GACGCCGTTC TTCAGGATTT
GATGCATCGT TACATTGAAC GGTTTGTGCT ATGCCCCAAC TGCAACCTGC CTGAGACGGA
CTACAAAATT AAGAATGATG CTATTTGGCA CAAGTGCGCA GCTTGTGGAG CTAAAGAAAT
GGTGGACATG AGCCACAAGC TCTGCAACTA CATTCTTGCG GAGGACAAGA AAGCCAAGAA
AGACAGCAAG AAAAGTAAAA AGGGTGATAA GGTACGTGCA GTGGAACAGG AAATAGCACC
GTTGAAAATG TCCGTTGTGT TGTTTAACCC TTTTCTCTTA AAATTTACAG GATGACAAGA
AGAAGAAAGA CAAAAAGAAG GACAAAGACA GCGATGACGA GAAAAAGAAG AAAAAGGACA
AAAAGAAGAG CAAGGATAAG AAAAGCAAAG ACAAGAAAGA GAAGAACGGC GACGACGAAG
ACGAAAAGGA TTACATCAAG GAGGCTCTCG AGGGTGGGAA AACGGAAAAT AATGGTCTTT
TGAACAGCGA CGACGAAGAC AGCGTGTCTC TTGCCTCTGA AGCTGGCGTC GATGATCAGG
GTGCCTTGCT GCTTGCTGTC GAAGCTACAA AAAAGTATAT TGCGGAGAAT TCCGATGTTA
GTGATAAAGA GCTTTCTGAG GTCGTGACTA ATCAACAAAT GGCTTCAGCC CTCAAGTCGC
ACGACAAGGT CCATATTATC GTGCGTGCGG CGCTCTCCGC TCAATTCTTC AAAAACAAGG
AAATCGAGAA GTATTCTTCG GCCATCTATA GCATCACGAG TGGCAACAAG ATCATGGAAC
GTCATTTGAT TGCGTCGCTC GAGGCCCTGT GCATCGATAA GCCCAAGAAC TTTCCCGTCA
TGATCAAACA GTTTTATGAC GAGGATGCCC TTGCTGAGGA AACAATTCTG GAATGGGCCG
ACGAAGGTCG CTCAGAGTTT ACCCTACCAG AAGTGGACGA GGATGTTCGA GCTACACTTC
GTGGAGAAGC TGAGCCTGTG GTTGTCTGGT TGCAGGAAGC CGATAGCGAA GACGATTCCA
GTGACGAGGA TTAGGTCTTG CGCTCAGCTT TACGAAGTTG ACGAGTAGTC ATACTTTATT
CTTACCGAAT TTTATCGATG TTGACAGGCA AGACAAGTAG ACCTCATTAC CCAGTGAATT
GCTGCAACGT ATCGCCAGTG AATAATCTAC GAAGACTGAT AAAAATCTAA GAAATTTACT
ATAT
 
Protein sequence
MVVRGGTIIN ISGASPVDDP EYRYKMPAVF GKIEGSGNGI KTVIPNITEV ALSLHRQPGE 
VNKFFGTELG AQTRYSAETD RAVVNGAHMD AVLQDLMHRY IERFVLCPNC NLPETDYKIK
NDAIWHKCAA CGAKEMVDMS HKLCNYILAE DKKAKKDSKK SKKGDKDDKK KKDKKKDKDS
DDEKKKKKDK KKSKDKKSKD KKEKNGDDED EKDYIKEALE GGKTENNGLL NSDDEDSVSL
ASEAGVDDQG ALLLAVEATK KYIAENSDVS DKELSEVVTN QQMASALKSH DKVHIIVRAA
LSAQFFKNKE IEKYSSAIYS ITSGNKIMER HLIASLEALC IDKPKNFPVM IKQFYDEDAL
AEETILEWAD EGRSEFTLPE VDEDVRATLR GEAEPVVVWL QEADSEDDSS DED