Gene PHATRDRAFT_48710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48710 
Symbol 
ID7194987 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp15245 
End bp18477 
Gene Length3233 bp 
Protein Length969 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183410 
Protein GI219126324 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.873734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTT TCCGTCCAGC AACATGGTTT CTTTACGGGG CTGCTCTTGT TGCCAATTTG 
GTTTCCGTCA AAGGTAAATC ATTTCTTTTG TGTTTCTTCA TCATTGTTTG GATTACTGTT
AACAAAAACA CTACTGACAA TTTCTGCTGC GCAATGACTG TTTCTGTATT GAAAATTCGA
AGGCCAAGCA CCGGTTGATC TCGTTTTTAT TATTGACGAA TCGGGAAGCA TGGGCGACGA
CCAGGCCCAG ATTGCCAACC GTGCCAACCA GATTACGGCC GCGCTTGACT CGGCAACTGC
TGGAAACTTC CGCGTTGGTC TAGTTGGATA CGGCGCGAGT GCTTTCGGAG GCTTCCCTCG
TAAAGTCGGG ACGCTTACAG ACGACGCTTC GATGTTCGGA GCTGCCGTAG CTTCTCTCGA
GACTTCTGGG GGGACCGAAC CCGGTTTTGT GGCAACCGAA CTCACTGCGG AAGACAGCCT
TTTGTACACG ACGACCTCGG ACTCCACAAA GCTCGATGGA TCGTCTGCAG GGACCTCTTT
TCCCGGTCCT GCCGGCTTCT GTGCCGTGTA AGTGAATCCA CCTGCCCTTG TTGAACAGTA
TTGACAACTT CTGACACGCT TATTCTGCCA TACTTCGATT ATTTAGCTTG ATTACGGATG
AACCTTCCAA CGGTGATGGA GCCACAACGC TTGCGGACGC AAAGACTGCT TTGGACAATG
CCAGCAACAG CGTGTTTTTT GGTGTCGTCC CGAGCAACCA ACTCTCCAGT ACAGGAGGTA
CTTACGGAGA TCTTGCTGCG CAGACAGGAG GTAGTGTTGT CGACATCAGC CTCTTCCGTA
GCAGCGATGT GGCTGCTGAA GCCATCATTG CCGATATTTT ACTCGGATGT GTGGGTGTGA
TTTCGACCAA TGCCGGATGC AGCACTAATG GTCCCTACCA AACTACAACC CCGGAGATTG
TGCTCTCTGG GTCTCCCTCA TGCACGGATG GAAGCATGCC GATTTCCGCT CTGTGGTCGA
GTCCAGACTC CGAAGTGGTC TTTTCCGACA CGACTAATCC TGCGATGGCG ACGATTTCAA
CCGCTGGTAA TTATTCCGTT TGCCATACAG TTGAATGCAC CGACAATACG TCTCCGTCCA
TGAACGTCAC CAATCAGTGT TGCACAGCCG TAGAATACAA TCCCCCAATC CCTATTTGCG
CGAACGAGAT TATTGGATTC ACCTTGATTG ACCCGGATAC GGATGCTGAG ATTGGTCCTC
TGGGCGACTA TGATGAGAGT GCTTACCCGT CGGGAGTGAA CATTCGTGCG GACTATTCTC
CTTGCTCGAC GGCTGACTTC ATTGACAGTG TTCGGGTGAC ATTCGACGAT CCGAGTGTTT
CGTTCTGCGA ATTGGATACG CCGTACTCGG TCTTCCGCGA TTCCCCGGAG GGAGACTACC
GTGCCGTTGT CATTCCGGTT GGAGTCCACA CTGTGAGCGC GACGCCCTAC CTGAGTGCTG
ACTGTTCCTC GGACGCTGGA GCGACCTTCA GTCAGACGTT TGAAGTGACT CCTACGCCCC
CGGCTTGCTT GAACGAGATT ATTGGATTCA CCTTGATTGA CCCGGATACA GATGCTGAGA
TTGGTCCTCT GGGCGACTAT GATGAGAGTG CTTACCCGTC GGGAGTGAAC ATTCGTGCGG
ACTATTCTCC TTGCTCGACG GCTGACTTCA TTGACAGTGT TCGGGTGACA TTCGACGATC
CGAGTGTTTC GTTCTGCGAA TTGGATACGC CGTACTCGGT CTTCCGCGAT TCCCCGGAGG
GAGACTACCG TGCCGTTGTC ATTCCGGTTG GAGTCCACAC TGTGAGCGCG ACGCCCTACC
TGAGTGCTGA CTGTTCCTCG GACGCTGGAG CGACCTTCAG TCAGACGTTT GAAGTGACTC
CTACGCCCCC GGCTTGCTTG AACGAGATTA TTGGATTCAC CTTGATTGAC CCGGATACAG
ATGCTGAGAT TGGTCCTCTG GGCGACTATG ATGAGAGTGC TTACCCGTCG GGAGTGAACA
TTCGTGCGGA CTATTCTCCT TGCTCGACGG CTGACTTCAT TGACAGTGTT CGGGTGACAT
TCGACGATCC GAGTGTTTCG TTCTGCGAAT TGGATACGCC GTACTCGGTC TTCCGCGATT
CCCCGGAGGG AGACTACCGT GCCGTTGTCA TTCCGGTTGG AGTCCACACT GTGAGCGCGA
CGCCCTACCT GAGTGCTGAC TGTTCCTCGG ACGCTGGAGC GACCTTCAGT CAGACGTTTG
AAGTGACTCC TACGCCCCCG GCTTGCTTGA ACGAGATTAT TGGATTCACC TTGATTGACC
CGGATACGGA TGCTGAGATT GGTCCTCTGG GCGACTATGA TGAGAGTGCT TACCCGTCGG
GAGTGAACAT TCGTGCGGAC TATTCTCCTT GCTCGACGGC TGACTTCATT GACAGTGTTC
GGGTGACATT CGACGATCCG AGTGTTTCGT TCTGCGAATT GGATACGCCG TACTCGGTCT
TCCGCGATTC CCCGGAGGGA GACTACCGTG CCGTTGTCAT TCCGGTTGGA GTCCACACTG
TGAGCGCGAC GCCCTACCTG AGTGCTGACT GTTCCTCGGA CGCTGGAGCG ACCTTCAGTC
AGACGTTTGA AGTGACGGTT ATTGCAGGAC CCTGTGTAAC TGGCTTTATG CTCTATGATT
CTGTCATGGA TTCGGTTGTG GCTGATAGCA TCCTTATGGG TGGTGAAATT ATGGAAGGGT
CCGTGGTGCC AGCCGGTCGC CCATGTAAAC TGAACATTGA AGCTGTTGCC GACGGATGTC
CCGGCTTTGA TATTGTCTCG GTGCGATTGC AGTTGCGCGA TGCAACGACT AATGCTGGCA
TCAAAGGCAG ACGCGAGATC GACGCACCGT ACATGCTGTA TGGTGATAAG GATGGAGACA
TTCGGAACGG TTCCGTGCCT GCTGGTAAAT ACCGCATTAG GGCCGCTGCT ATACTCGACG
GAGAAAGCAG TTACCAGGAT TATTACGAGA TCGATTTCGA ATTTGATGTC TGTGCTGGTG
GTACACGATC GCTTCGTGGC AATTCTGATG CATACTAAAG CTCATCCAGA GGGCACTGCC
TTTCATTCTT ACATGTCCTC CTTCACAAGA AAAGAAACAT TATCTTCTGG AAAGCAATAC
GAACAGTCAA GCAACATATA AGGAAACAAA AGATATAGTA CATATTAGGC TTC
 
Protein sequence
MKVFRPATWF LYGAALVANL VSVKGQAPVD LVFIIDESGS MGDDQAQIAN RANQITAALD 
SATAGNFRVG LVGYGASAFG GFPRKVGTLT DDASMFGAAV ASLETSGGTE PGFVATELTA
EDSLLYTTTS DSTKLDGSSA GTSFPGPAGF CAVLITDEPS NGDGATTLAD AKTALDNASN
SVFFGVVPSN QLSSTGGTYG DLAAQTGGSV VDISLFRSSD VAAEAIIADI LLGCVGVIST
NAGCSTNGPY QTTTPEIVLS GSPSCTDGSM PISALWSSPD SEVVFSDTTN PAMATISTAG
NYSVCHTVEC TDNTSPSMNV TNQCCTAVEY NPPIPICANE IIGFTLIDPD TDAEIGPLGD
YDESAYPSGV NIRADYSPCS TADFIDSVRV TFDDPSVSFC ELDTPYSVFR DSPEGDYRAV
VIPVGVHTVS ATPYLSADCS SDAGATFSQT FEVTPTPPAC LNEIIGFTLI DPDTDAEIGP
LGDYDESAYP SGVNIRADYS PCSTADFIDS VRVTFDDPSV SFCELDTPYS VFRDSPEGDY
RAVVIPVGVH TVSATPYLSA DCSSDAGATF SQTFEVTPTP PACLNEIIGF TLIDPDTDAE
IGPLGDYDES AYPSGVNIRA DYSPCSTADF IDSVRVTFDD PSVSFCELDT PYSVFRDSPE
GDYRAVVIPV GVHTVSATPY LSADCSSDAG ATFSQTFEVT PTPPACLNEI IGFTLIDPDT
DAEIGPLGDY DESAYPSGVN IRADYSPCST ADFIDSVRVT FDDPSVSFCE LDTPYSVFRD
SPEGDYRAVV IPVGVHTVSA TPYLSADCSS DAGATFSQTF EVTVIAGPCV TGFMLYDSVM
DSVVADSILM GGEIMEGSVV PAGRPCKLNI EAVADGCPGF DIVSVRLQLR DATTNAGIKG
RREIDAPYML YGDKDGDIRN GSVPAGKYRI RAAAILDGES SYQDYYEIDF EFDVCAGGTR
SLRGNSDAY