Gene PHATRDRAFT_41302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41302 
Symbol 
ID7199142 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp68044 
End bp70507 
Gene Length2464 bp 
Protein Length747 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185319 
Protein GI219130327 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTT TCCCAAAGGT TACTGTTACC ATAGTAACAT TTGTAATTGC GCTAGCCTGG 
GGTTTCAATG ACGCCGTGGC TCAGACTGTA GCATGCCCTC AAGCAGAGGG TTTGACCGGC
TACACAACAA TTGCATCGAT CAACAATGAC ATGAGGGCCG AGTTAGCACG AATCAGTGAC
GGCGAAAAAC TTCCTGAAGA CAGTTACACC TATACGCTTT GTCCGCAGAC AATATTTGAT
GTCTCAAATG AGCCTTTGCA GCCGATTTTA AGTGATGTTT CTTTCGCGTG TGGCTCAAAC
GGGGAGGCGA ATGACAGTTG CGTTCTTTTC GGAGGTAGCC AGCAAGTCCT CATTGTAGAC
TCACTTTTGA ATTCATATCC TCTAGAACGA ATTACTTTCT CTGGGTTGAC ATTCTCTGGT
TTCAACCAGA ATTCTGAAAG TAGGGGTACT GCGATTGCTG CTTTTGCTTC GAAGCCCACT
TCTGCGATTT TCCGTGACGC TTTGTTTCGT GTACGTTTTT TCCGTTTTGA ATGCCAACTA
CAAATAAGAT GTCCAATCGC TTCTCACAAA TTCAAACACA AATCACAGGA CTTTATGAGC
GATTTCGTCA TACTTCAAAG TACAGGGGAG CCGGGGTCGG AGCCAATGAT GATCGAAATC
AATGACAGTA TCGTCAATGG CGGCACAACA GGAGTCTTCT TTGACAACGA CGGCGGCTTT
CTAAATATCA GAAATATTCA GGTTGAAGGA TTAAATGCTG CCTCTTTCAT TGCCACGGCC
AATGGAGGCG TTTCGCGGCT TAGGGAGTCC TCAATTTCCA AAGGATCTTT GGATTCGATA
ACATACACAA CTAACTCGGC GGAGCAGCAA GTTGCAGATG TGAACATTTT TTCAATGAGC
CGCCTTGCAG ACGCATTTTA TGCGGAACAA GAAGGAAGCG GTTTAGTTGT GAGAAATGTC
AGCTTATTTT CCAACGATCT CAGTCCTATG GAGTGGACTG CAATATCAGC ACAATCCGGA
GCAATCGTGG AAGTAGTAGG TTCAACAATC TCAGGGAATT CAGGTCTATT GTTTGCTTTA
CAAGCTGGTA TTGGCTCCGC TGTCTCTATA ACTGATTCGT CCATAAACCA AAACACGGCA
GCGGTAAGAA TACAGACGAC TCTCGTTTTG TTCACGTCAC TTCTTGCCTT AACCAGTTTC
CTTTCGTAGA GCTCGACAAG CGCTTCCATC TTCGTAATTG GTGGCTCGGC AATTGTCGAA
CGATCTGAGT TCACCCAAAA CTTTGGTTTT TCGGTAAGTT GTTCACAATC AGTGACGAAA
CCTCAAATCA CTATTACGAT GCAACTGATC TTTTGCTTCG TCCATTTAGG GAGAAATTTT
AGCTTTTCTG GGTGGCTCTG TTGAATTGAG TCAATCATGT ATCCAAGACA GTAGGTCGGA
TTTTGTAGCG TTCGCAGATA GCCAATCTAC CGTTTCAGGT GAAAACGCGA TGAACTTTGT
TGGTAGCTAC GAATCCTCGT TTTGTACATC TAGCGGGCCC CGACTCTTTC GCGAAGATGT
CGGGGCTGGC TGCTTCGTCG GCGGCCTATG CACGGGAACT TGTCGGAATA TTGCCGATGC
TTCTGAATGT ATGGCCCGCG CAGCAACCCC AACCTCAAGT CCGACTAACT TCCCCAATTC
GGTGTTACCG ACAGTCACCT CAACAGGTCA ACCCGATACG AGTATTCCCT CGTTTACACC
GGGATCCTTT GTGCCAAATA CCCAAATACC AATTCCAACT GTTGACCTTA CTTTTTCGCC
AACTACAGAT GATACTCTGC TTCCGACAAG GAATAGTGTT CCTGAAAGCA CAAACCTGCC
AACACTAGAA GTCCAACGTC CGACTTTGGC TTCAATTACA ACATCAGTAC CGATTACTCC
GCCGAGCGAA CCAACAGTAA TTCCGACCTC TGCTCCACCG GGTACAACCG TCACGCCAAC
AATGCTGCCC ACAAATGTGC ATGGCCAGAA CGGAACTCCG TCTCCAACAC AGAGATGCCG
ACCAGCAGGA AGCGGGACTA TGGCCAATTC TATCGGAAAG GGCAAGGGGG ACAAAAGCTT
CAAATCGTCC CGAGATTCTT CAAAGAATAT CCGAACACCA GAGTTCGGGA TTGAACAGTG
GGTGGACAAC AAATTATTGG ACGGACGTCA CACATGGTTC AGTACCCAAT CGAAGGAAGA
ACCTTTGTTC AACGAAAGCT TGCCCATTTG CCCTCCGGAA GAGTCTGCAA GGCCAACCGT
TGGTGCCGTA TCCACAAAGT CCGGTAAAGG AAAAAGAGGT GCTTCAGAGA AATCTTCGAA
GAAGGGTAGT ATGAGCGCTA AATCTGGAAA GGCAAAAAGC AGCAGTACCA AGAGTAAAAA
AAGCAGCGAC AGGAGTAGCT TCAGCAGCAA GAAGAAAAAG GGTGGTGTAC GTCGTGAAAT
TTGA
 
Protein sequence
MALFPKVTVT IVTFVIALAW GFNDAVAQTV ACPQAEGLTG YTTIASINND MRAELARISD 
GEKLPEDSYT YTLCPQTIFD VSNEPLQPIL SDVSFACGSN GEANDSCVLF GGSQQVLIVD
SLLNSYPLER ITFSGLTFSG FNQNSESRGT AIAAFASKPT SAIFRDALFR DFMSDFVILQ
STGEPGSEPM MIEINDSIVN GGTTGVFFDN DGGFLNIRNI QVEGLNAASF IATANGGVSR
LRESSISKGS LDSITYTTNS AEQQVADVNI FSMSRLADAF YAEQEGSGLV VRNVSLFSND
LSPMEWTAIS AQSGAIVEVV GSTISGNSGL LFALQAGIGS AVSITDSSIN QNTAASSTSA
SIFVIGGSAI VERSEFTQNF GFSGEILAFL GGSVELSQSC IQDSRSDFVA FADSQSTVSG
ENAMNFVGSY ESSFCTSSGP RLFREDVGAG CFVGGLCTGT CRNIADASEC MARAATPTSS
PTNFPNSVLP TVTSTGQPDT SIPSFTPGSF VPNTQIPIPT VDLTFSPTTD DTLLPTRNSV
PESTNLPTLE VQRPTLASIT TSVPITPPSE PTVIPTSAPP GTTVTPTMLP TNVHGQNGTP
SPTQRCRPAG SGTMANSIGK GKGDKSFKSS RDSSKNIRTP EFGIEQWVDN KLLDGRHTWF
STQSKEEPLF NESLPICPPE ESARPTVGAV STKSGKGKRG ASEKSSKKGS MSAKSGKAKS
SSTKSKKSSD RSSFSSKKKK GGVRREI