Gene PHATRDRAFT_47802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47802 
Symbol 
ID7203045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp87630 
End bp89502 
Gene Length1873 bp 
Protein Length600 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182154 
Protein GI219123693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCGCACACA ATTGACGGCA ACGGCTAGTC TCGACAACGG CATTTCCTCG CCTTCCAATG 
CGCACGCAGA ATGTCGGTGG TTCCTGTCAA CGAAGGCAAG CTTGTGGGAG GCTTGACGGA
AGATCATTTG CGACATCGAC ACGCAATGAA TGACGATCAT GACCGTACCA AGGCAACCGA
AGGACCCGTG CCGTATGGGT TTAAAGTCAA CGACATTGGC ATTGACGACA AGTTAACGTC
GACCGAGCTC GAAATGCAGC CACTGCATTT AAAGGAAAGT GATGGAAACT TGGAAGAGGA
TGAACGAATA CTGTATCCGA TCACCGTCGG TACTACTAGT ATGTACGAGG GCTGGAGAAA
AACCTTGGAC GATTTTCTCT TTCCTCCACA TCTACCCAGA AGTTGTCAAT TACTGCGACC
GGAAAATATT GCCGTGCCAG CGTGCTACCT GCTAGTGGGG CTTCTGCAAG GTCTTTCTTC
ACCACTGATT AATGTGTTTC CGCTGGATCT GGGTGCCACA GAAGCTCAGC AGACGACAAT
TTCGTCGATC CGATCTCTAC CCGCATCCTT CAAGCTCGTC TTTGGGTTTA TGAGCGACAA
TATCCCTATC GCAGGTTACC GAAGAAAACC GTACATGCTG ATGGGATGGC TTCTGGCTAG
TCTATCACTT TTTTCGCTCA TTCTTGGTTC CAATCTGAAC ATTACCCCCC GCAATGCCGG
TTGCTTCGAG TCCCAAGCCA GCGACAGCGA TTCGCCGACG ACACTGCCAG CGGACGCACC
TTCTATACCC TTTTTCTCCG TCGCCCTCTT GGCCTTCGGC ACCGGCTTTT GGCTCGCCGA
TGTCATGGGT GACAGCATTG TCGCCGAAAA AGCCAAACTG GAACCACCGG AAAGCCGCGG
ATCCGTACAG TCCAGCTGCT ACTCGTACCG ATTTTTCGGA ATCATGGTGG CGGCGCCCTT
GTCCACGTAC CTGTACGCCA CGTACGGGCC CCGGGCGGTG CTCCTGCTCC TCGCCACACT
GCCCTTGTGT ATCTTGCCTT TGGTCTACCT GCTCTTTGAA GTGGAGAACG CTCCGGTCAG
CTCGACGGCC GACCAGTGTC GTGAAATTTG GCGGACCGTC TGCAGTCGAG CTGTTTGGCA
GCCCATGGGA TTCGTTTACG TGTACAATCT TATGCAAGTG AGCAACGCTG CGTGGCGAGA
GTTTCTCGTC ACCTCCTTGC GGTTCACATC GTGTCAACTC AATCTGATCC TCATTGTGGC
CTACGTGCTG TTGTACCTTG GGATTCTGGC CTACAAGTAC TACATGATGG ACTGGTCCTG
GCGCAAAGTC TACTTCGTTA CCACTCTACT GAACGGATTC TTCAGTCTAC TCCAAGTCTT
GTTGATTTAC AACATTACCT TGGGTTTGTC CAGTTTTTGG TTCGCCCTCG GCGACGACGC
CTTTGCCGAA TTTATTGGTG GCATTCAGTT CTTACCGACC ACGATTATGA TGGTCCATCT
CTGCCCCACC GGCAGCGAGG GTGCTTCGTA CGCCATGTTT ACGACCGTCA ACAATAGCGC
TCTGACCTTG TCCAGTGCCA TTTCCACCCA ACTGTTGCGC ATTTGGGACG TGTCCCGCAC
GGCCTTGGCG GCGGGGGACT TGTCCGGCAT GGTCCGACTG ACCTACCTCA CGACCGTGGT
CCAAGTGGCA GCGATTGCCT TTGTTTCGTG GCTACCCCAC ACCAAGGAGG ATCTGGTGCA
ATTGAACGAG CAGTCGTCCC GGAGTCGCGT GGGGGGTACC GTATTTTTGG TGGTCACGTT
CGGCTCGATT CTGTACGCCG TGGGAGTGGG TCTGTTGAAC ATTGTGGCAC CAGGATGGAT
GGGAGAATCG TAA
 
Protein sequence
MSVVPVNEGK LVGGLTEDHL RHRHAMNDDH DRTKATEGPV PYGFKVNDIG IDDKLTSTEL 
EMQPLHLKES DGNLEEDERI LYPITVGTTS MYEGWRKTLD DFLFPPHLPR SCQLLRPENI
AVPACYLLVG LLQGLSSPLI NVFPLDLGAT EAQQTTISSI RSLPASFKLV FGFMSDNIPI
AGYRRKPYML MGWLLASLSL FSLILGSNLN ITPRNAGCFE SQASDSDSPT TLPADAPSIP
FFSVALLAFG TGFWLADVMG DSIVAEKAKL EPPESRGSVQ SSCYSYRFFG IMVAAPLSTY
LYATYGPRAV LLLLATLPLC ILPLVYLLFE VENAPVSSTA DQCREIWRTV CSRAVWQPMG
FVYVYNLMQV SNAAWREFLV TSLRFTSCQL NLILIVAYVL LYLGILAYKY YMMDWSWRKV
YFVTTLLNGF FSLLQVLLIY NITLGLSSFW FALGDDAFAE FIGGIQFLPT TIMMVHLCPT
GSEGASYAMF TTVNNSALTL SSAISTQLLR IWDVSRTALA AGDLSGMVRL TYLTTVVQVA
AIAFVSWLPH TKEDLVQLNE QSSRSRVGGT VFLVVTFGSI LYAVGVGLLN IVAPGWMGES