Gene PHATRDRAFT_48479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48479 
Symbol 
ID7203764 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp619377 
End bp622349 
Gene Length2973 bp 
Protein Length976 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182995 
Protein GI219125451 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGAGA GTGGTACGAT GATCGACAGT GCGGGCAACC CTCCCCCCTC TTTCGTGGCA 
AAACCCTTCC CCTATAAAAC ACTTCCGGGT ATCCCGAACG CACAATCCCC ACCGGAACGC
GATCCCGTCA CGAAACCATG CGATTGTTCC GTTACCGTGA GTCGCACGGG GCAGGGGGCA
CCCAACTCAT TCCGCCGGTG GAGTAGAGGA CCCGCTACTC GTGACCATCA CATTTGTTTA
TTGATGGATG GATGGATAAG CGGAGCGAGT GCGACAACAA TGACGACGAG CGAAAAGCAA
CGGCGGGAGG AGCCGCCAGT GTCCCGACGT CTCCGACGGG ACACCAGCAA CCATAGAATC
AAATCCACAA TTTACAATCT CGACGCACAA GACGATGAAC TGGAGCAAGC GTTGGAAGCC
TACAATGACG ACACTACACG AACGAGTACT AACGATCGCT GTCTCCAAAC CAAAGGAGTC
AAACGAACAA TGCCGGAACG CCAAAGTGGA CCGACGGTCC AGAGCGCATA CGAAGAGGAA
GTGGAGCATA GTAGTACCAA CAATAGGCCA AACAAATCCA TGGGGAGGCG ACCACGACAA
CCATCAACCA AGCGCAAGAA ACGAACGCCG ACAAAGGGCA TCACAAAAAG AATCCCTGTC
GACGATCAAT CGGGAGATGA GGAAGAGACC GATCAAAAGA CACCAGCTCC GAAACGGCGA
CAACCTCGAC GTTCAACCCG AACAATTTCC ACTATCCGAC CGAAACCATC CTTATCCGAA
GTAGAATCGA GTAACGAAAG CCACCAAGCC GAGGACGAGG ATGAGGACGA TAATGAGTCA
GTTATTATTG CACCCCTTTC CGCCATTACC AAAACGCTGA AGTGTCCTCA TTGCCCGAAA
ACATTTGGTA CCGATGGTGG TCTACGTTAT CACGTCGCCA ACTTTGTTTG CCAACCTGAT
TCACGTCCAG GAGGTCCCGT CGTTAGGGGT CGCCGTGGCA AGAGTGCCTC AGGCGGGATG
GATGGTTCAT CCAAGCGTAA ATTTCGTAGA ATTCGGGGTG CCGCGAAAGA TCGTACCTGC
CCAGACTGCC ATCGAGTCTT TACCAGCGTT TTGGGCATGA CCTATCACCG CGAAAAGGCT
GTTTGTCACC GTAAAGGCCA AAAGGACGCG GGAACAGAGT CATCGGCCCT GCCTTTCGGT
ACTTTGGAAG CAGGTTCAAA GTTTGTTACC AACTGGGGTG TCGTGCAAGT CATTCGGGAT
GATCGTGCTA CACCCGTAGC TGAGCCTTTG CAGAAACCCA AGGACCTAGC CCGTTCCTTT
CAGGCGCATA AAAGTAGTCG CGAAAATCAA CTGGAAAAAC AATACGCCAC CCTTGCAGTC
TTGTCCCTGA CACGAAGAAA GCAGTTGCTG GAGGAGTACA AGACGAAGGC CGATTCGAAT
ATTACCCCAC AGTCCGTATG GATGGCCTAC TTTGGTACAC GGGAAGACCC GCGGGAGATC
CCAAAATCTC GACAAGCGGC TCCTTTTAGA CTGGGACAAT TGCGAGACGA CCCCTTGGCT
CCGGAGAATG CCTTCGCGGA CCGCATTGTG GAATGCATTG CTATCGCCGA TGACCGGAGA
CGATTCGTAG GTCTCTATGA CGATTCGGAA ACGACAGGGT CGTCATCTAT GGGGAGATAC
CCGACCAAAC TCTTTTTAAG TCGTCGGCTT CTCACCGAAT CATACAACCC AAGTGGTTCC
ATACACATGT GTCCGTCCTG TGGGCGGTCG TTTGGTTCCA AACCTGGTTG TAAGTACCAT
TTGGTCTCCA AAGTATGTAC TAGTAAATCA GATGCGCAGG GAGAGCTGAG ACAGAAACGA
CTTGGTGATA TTGAAGACAG GTCTCTCCGA CTTTTGGCAA AAGGGGCCGG ACCAGAGCGT
CGTAAGTATC GTCCGCCACA GCCTGTGCAC CGTGACATCC CCGCAATGGA CAGTGTAACG
CCACACAAGC GACTTGACAG CGTTTCCGAC GATGATGACA CCAATAAATT GGCCGCACAA
TTACAAGAGA AGGAGCAAAA GACCTACGAC ACCAGGAAAG AAGAAAATGT ACCTTCACCT
GATGAATGCA TCAAAGAGCT ATTTCAACAG CTTCGCTTTG AGCAGTCCAA GCAACTTGGC
CCCATGTACA CTGATGTCTT TCGAGTGCTC AAGTTTAAGC GTTATGTTTC CAGACCTGCG
AAGAAACGAA AGAAACGAAA GATTGTTGTA AAAAAAATCA AGGTAGTCAA GAAAGTAAAA
AGATCAAAGG CCTCGAAAGA GAGCACCACC TTGGACGGGA CTTCAAAGAA ATCAAAGAAG
ACCGAGGTGG TATCGACAAG AACAACATAC CCTTCACTCG TTCTACCGCC ACCACCTCTG
CCAACACAAT TCATTCAAAC TCAAAGCCAT ATACCGATCC CTCCTATTAT CGATACACGA
GTTCTGGTTG GCGAAGTGGA CGCCGGAAGG TACCCGAGTA TAAAACGGGA CCCCGCTCGC
ACAAATCAAG ATATTTGTTC CATCTGCAAG AGAGGCAACC GCTTAGTAGC ATGCGACTTT
TGTCCTTTAT CAGTCCACTT TCGTTGTGTA CGCACAAAAT ACTTGCTTAA AGATCCTGAA
CCGGAAGACG ACTTCATGTG CAATACCTGT ATCCAGTACA TTTGGCACCG TCGTGCTCGG
GCTGAGAAGC GAAGAATTCA GAAACTGGGT GAAGACAAAG TGCAAACTGA CCAAACGGCT
GCGGAATCGG TGGCTCGACT TACAAAAGGC GCAGTGGAAG GCGAAGAATA CGAGTGTGTC
GCATCCCAGG CCCGCCGTCT AGCCGATCTT TCGGAGCTAC TGATGGAAGC CAAGGTCCGT
CTAAAGCAAA ACATGGCGAT GGCTAAAGTC AATGACATGC GGAGAGCTAT GATAAGTGGT
CAAGTAGTTT CCAAAGGTTC CACTTCAATA TGA
 
Protein sequence
MRESGTMIDS AGNPPPSFVA KPFPYKTLPG IPNAQSPPER DPVTKPCDCS VTVSRTGQGA 
PNSFRRWSRG PATRDHHICL LMDGWISGAS ATTMTTSEKQ RREEPPVSRR LRRDTSNHRI
KSTIYNLDAQ DDELEQALEA YNDDTTRTST NDRCLQTKGV KRTMPERQSG PTVQSAYEEE
VEHSSTNNRP NKSMGRRPRQ PSTKRKKRTP TKGITKRIPV DDQSGDEEET DQKTPAPKRR
QPRRSTRTIS TIRPKPSLSE VESSNESHQA EDEDEDDNES VIIAPLSAIT KTLKCPHCPK
TFGTDGGLRY HVANFVCQPD SRPGGPVVRG RRGKSASGGM DGSSKRKFRR IRGAAKDRTC
PDCHRVFTSV LGMTYHREKA VCHRKGQKDA GTESSALPFG TLEAGSKFVT NWGVVQVIRD
DRATPVAEPL QKPKDLARSF QAHKSSRENQ LEKQYATLAV LSLTRRKQLL EEYKTKADSN
ITPQSVWMAY FGTREDPREI PKSRQAAPFR LGQLRDDPLA PENAFADRIV ECIAIADDRR
RFVGLYDDSE TTGSSSMGRY PTKLFLSRRL LTESYNPSGS IHMCPSCGRS FGSKPGYAQG
ELRQKRLGDI EDRSLRLLAK GAGPERRKYR PPQPVHRDIP AMDSVTPHKR LDSVSDDDDT
NKLAAQLQEK EQKTYDTRKE ENVPSPDECI KELFQQLRFE QSKQLGPMYT DVFRVLKFKR
YVSRPAKKRK KRKIVVKKIK VVKKVKRSKA SKESTTLDGT SKKSKKTEVV STRTTYPSLV
LPPPPLPTQF IQTQSHIPIP PIIDTRVLVG EVDAGRYPSI KRDPARTNQD ICSICKRGNR
LVACDFCPLS VHFRCVRTKY LLKDPEPEDD FMCNTCIQYI WHRRARAEKR RIQKLGEDKV
QTDQTAAESV ARLTKGAVEG EEYECVASQA RRLADLSELL MEAKVRLKQN MAMAKVNDMR
RAMISGQVVS KGSTSI