Gene PHATRDRAFT_47888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47888 
Symbol 
ID7203102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp356867 
End bp360200 
Gene Length3334 bp 
Protein Length854 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182206 
Protein GI219123801 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.554333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCGCG CGACAGTAGA TATTGCTTTG CTCTCCGCAC AGTTTCTCTC TAACGTTCAG 
GACGGCCGTC TCAACATGAA GGAACAAGCA AGCGCTTGTG ACGCTTTCAC CATTAGAAGC
CGGGTGCCCG TTTCCTTGCA TGGGTGGCAG CGTTGCAGCG GTTGGATTCT TTGTGTTTTG
TTGGCCATTA TTGCACCCGC CGCTGGTGTA CCAGACGATG CTCTATCAGT TCACTCGGAA
GTATCCTCTA GTCGGAGCAA ATTAGATGTT GCGAACTCTT TTATCGACAA CATCCCTTCT
TCAGCACCGG CAGTTGTGGC ATCATCGACA CCCAGCTCAT CCTTTGAGCC TACCACGTCA
ACGCAACCCT CATTCACGCC GACAGTGTCC TCCCAACCTA CATTAACGGC ACGACCGACC
CAAATTTCAA CACTCAGTCC CACTGCACTC ACCTCAATGA ACGGACCTAT CTTCGCAGAT
CCCGCCGGAA CTCCGACTGC CGCTATTGCA ACACCAGTGT CAATGAGCCC GTTCATAGAG
ATTCCGCAAG AGGAAGAAAT GCCAACCGGA AAGCCGTCCC CGATTCTGCA ACTGAATCCG
ACAAGTACAC CCGCTGCGCC TACAATTTCT CCAACCAAAG GACCAACTAC CGCACCAACT
CTTGATACAA CGACTATGCC TACTTTGTTT CCACCTGCGA CAGAAAACTT AACGGTAATA
ACAACGAGCT CTCCGTCAGC AGTTCCATCC ATTAATCCCT CATTGGTACC ACCGCCTACT
GCAACATTGA ATGCGTCTGA CGCTCCTTCT GTCGCTGCTA GCACACGGCC TCCTTCCAAC
CCTACGAATG GTGCTTCGGG CGCACCCACG ATGACCCTCG CCCCTGCAGT AGTTCCTTCC
GTATCGCCTA CCCAATCGCC ACTGTCAACG ACAAGCGCAA TGCCCTCGGC GATCCCAAAT
TCCATGGTGT CAGCGACTCC AAGCACCACG CCTACTTCCA CAACAAGCTC GACGCCCACT
TCGACTCCAA ATTCCATGCC TTCGGACATC CCCTCATTCT CACCCACTCC AAATCCCACA
GTGGGTCCTA GTGATCCACC AACCTCCAAA CCTTCTCCGA TGCCATCGCC AATCAGTTCG
AATGGTCCAA GTCTCAGTAC TTCGGCCTCG CCGTCTACTC GTCCTTCAAG CACTCCTACG
GAGCAACCCA CTCATGTTGC TTCAGATGAG CCCTCACGAG TAGCTTCGGA AACGCCATCA
CTGAGCCCAA GCCATATTCC CACAGCCATT CCTTCACTAG GTCCAAGCAA TGCGCCTAGT
TCTAACCCCT CCCTGGCTCC GTCTGTTTCC ATGCAGCCCA GTACTAGTTC AGCGCCATCG
ATGGCACCGA CACCGGTGTG GGAGTTCGAA AAGTTCGTTG GCGTTCTGTC TGGAGTTCAG
CTTATCTACG GTATGAACGG TCGTCGTATG GATGAAAGCC AGCCAGAATT GACCGATGAA
GTCTGCATTG AATTGTGGCG GGATTACGTC GAGTCAAGTA TAGCGCGAGA AGTTGAGAAT
TTGGTTCAGA CCATTGAATT TCTCGAGGTT ACGGTGTCGA ACGAAACACA AGAAGTGCTT
ATTGATTCAC AGCGTGTTGC GTATGTCTTT GATACTACCG TAGAGCTACG ATCACCAATC
AGTGAACACA ATTTGAATCG TTTTGTGGCG GGTGCTTTCA ACACTGAAGA AGAACGGCTT
ACCATGGTAG AGTATCTACG GAACACGACG TGCCCTCAAT TTGCGTACAT AACGACAGCC
GACTTAGTCA TGCCTCCGAA CAAGGCGTTA CTTCCCGAAG ATGGCAGCGA TGAAAGCTCC
GGGGCTGGAT TGATCGCTGG TCTGGCTATT GCAACAGTGG CGGCCGTTAT TTTGACAAGT
CTGTTTCTCT TTCTTCGCCA TAAGAAGAAA AATAATCCGT CGGTAGCTGA AGAAGAAGAA
GTCATTCTTC CAATACCTGA GCACCCGACT AGTCACAACC CTGACGAGTA TGCTTCTGAG
ATTGATGTAG AAGGTGGAAC CGATATTAGT ACTTTGGGAG ATCCAATGCC GCAAGCCATG
TACCCAATGC TATCAGGCGA CATGTCTATA ACTGATTCGG CAACAATGGA GTATGACTTT
GACAAGGCTT ACCATAGCCC ACCTTCTGTG TCAGAAGCTT CGGAGACAGC CTCGAGATTT
GATACTCTAC AGTCGAGTAA TCCTGTATTG AGCAACGATG GCATTTCGAA TACACGAAAT
GAGTTCGAGA CGGTAGTAGC AGTACTGGCT CCTGCCGGGC GCCTAGGGCT CATCCTTGAA
TCCAGCAAAG ATGGGACTCC TATTGTGAAT AGTGTCAATC CGGGAAGTGC TTTGGAGGAT
GAGGTCGAGC CGGGCGATCG TCTCTTGTCC GTGGATGGTC TTGACGTGAC GGTACTCTTG
CCCAGTGAAG TGTCAAAACT GATTGCCGAG AAACGAGATG AAGACGTTCG TAACCTTGTT
TTTGCCCGCT CTATGGCAAG ACCGAACAAC TTAGTGGATG GTTAAAGTCT TTTTCAGGTT
GCAGATTTAG ATTGTTCGTG TTTATGCGCG GTATACTTGT TGGAAGTAAA GCCGAGTATT
AATTCTAGAG AAATCGATAT TAATGAAAAT GCCTGTAGCT ATGCTCAAGT CCCACCATTG
AATATGGGTT GTCATCCTAG GGAGCAACCT GATGATCTAG CAGTGGTTTG TGCTTCATAC
GGGGCTTATG CTTTTTCTGT GGGCGACCCT CTACGGGCTT TGAATCAATA CCCACTGCAG
GCTTGTCAGG TTGTTCTTTC TTCGCCTCTT GTTTTGATCT CAGATTCTCG GGAGGGCCAT
CAAGAATCTT GGCTCCGATG ATATGTACAG GCTCCTCATA GAACCATTGC CACTCAGGAT
CGTCGCGCAC AGTAGGAGAT GCAATGACCT CTTGTAAAAC ATCAAAACCG TCCACGACCT
TGGCAAAACA CGGATCAGCG AATTCGTTCA GGTCATGCTG ATTTTGACCA CCAGGTCCAT
GCGATTTAGT ATTATCCACC TTGTTAATAT ACCAATCGGG TCCGCCTGGT CGTCCCGTAT
ATCCCAACGT CCACGGGACG TGCGGAAAAT CTTCCGAATA TTCCGGAAAC GCCAAGCTTT
CGAGATTCAA CGATCGAAAC TTTCGAAGCG CGAACTCTCG TTCGTCTTCC GCCTCGTCGT
ACCCTTCTTC ATCGTCTTCC GCCTGCGGAC CACCTTGGAG GACGTGCGGT CCATTGAGAT
AAAACCATGC CGAGCTCCAT AGACCGTGTG CAAC
 
Protein sequence
MVRATVDIAL LSAQFLSNVQ DGRLNMKEQA SACDAFTIRS RVPVSLHGWQ RCSGWILCVL 
LAIIAPAAGV PDDALSVHSE VSSSRSKLDV ANSFIDNIPS SAPAVVASST PSSSFEPTTS
TQPSFTPTVS SQPTLTARPT QISTLSPTAL TSMNGPIFAD PAGTPTAAIA TPVSMSPFIE
IPQEEEMPTG KPSPILQLNP TSTPAAPTIS PTKGPTTAPT LDTTTMPTLF PPATENLTVI
TTSSPSAVPS INPSLVPPPT ATLNASDAPS VAASTRPPSN PTNGASGAPT MTLAPAVVPS
VSPTQSPLST TSAMPSAIPN SMVSATPSTT PTSTTSSTPT STPNSMPSDI PSFSPTPNPT
VGPSDPPTSK PSPMPSPISS NGPSLSTSAS PSTRPSSTPT EQPTHVASDE PSRVASETPS
LSPSHIPTAI PSLGPSNAPS SNPSLAPSVS MQPSTSSAPS MAPTPVWEFE KFVGVLSGVQ
LIYGMNGRRM DESQPELTDE VCIELWRDYV ESSIAREVEN LVQTIEFLEV TVSNETQEVL
IDSQRVAYVF DTTVELRSPI SEHNLNRFVA GAFNTEEERL TMVEYLRNTT CPQFAYITTA
DLVMPPNKAL LPEDGSDESS GAGLIAGLAI ATVAAVILTS LFLFLRHKKK NNPSVAEEEE
VILPIPEHPT SHNPDEYASE IDVEGGTDIS TLGDPMPQAM YPMLSGDMSI TDSATMEYDF
DKAYHSPPSV SEASETASRF DTLQSSNPVL SNDGISNTRN EFETVVAVLA PAGRLGLILE
SSKDGTPIVN SVNPGSALED EVEPGDRLLS VDGLDVTVLL PSEVSKLIAE KRDEDVRNLV
FARSMARPNN LVDG