Gene PHATRDRAFT_39092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39092 
Symbol 
ID7194749 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp311146 
End bp314324 
Gene Length3179 bp 
Protein Length1037 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183071 
Protein GI219125614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.129915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTGC CCGTGGTGTC CGTACGCGGC GTGTCTCGGA TCCCCGCACG CCGTCAATCC 
GTCCGTTCTC TCCCTGGTTG GTGCTTCTAC TGTTACTGTT CTTTGCGACC GGTGTAGTGG
TGCTGGTACT GATGCTGGGG GACGGCTTTG GGACGATACG AACGACCGCC GCCACGCAGG
AACCATCTCG TGGTGCATCA CACCGTGTGG AAGTCGATGG TGTCCCGTGG ATGCCGTCAC
GAACGAAACA ACAACGATTC CGAATTATTA GTCATGATGC ACCCTCGTCA ATGCCCTCTC
CTCCACCACC ACCGCTGCCC ACGTGGACCG ACACGGCGTT CGACACCCCT CCTCCGACAC
TGTACCGGCC AAAATTCATT CCACCTCCTC CTCCTCGGCC TCCACCGCCA CCACGAAATC
CTCCCACTCT TCCACTTACC TCCAACACTA CAGCAACAGT ATCAACACCA CCAATACTAT
TACCACCACC ACCACCAATA ATACCACCAC CAACAATGCC GCAATACCCC CTACGCACCG
CTACATCGAC TCCACCGTTA TCAACTACAA CAACCAGTCC GATTGCCATT CGTGCCGCTC
GTTTGTCCGT AACTGCGGCG ACCATTCGTC CACCGAGACC CCCGTCGATT ACCATCACAG
CCGTCCATCC CTGGCAGTCC CCGTGGTCGG ACCCACCACC ACCGCCCCCG CCGCGGCCCA
CGCCACCCGC ACCTCCTGTA CCGTCGTCGC CCACGACGAA CGTATACTTG TCCTTGTTCT
GGATGCTGCG GAACGTTGCG ATTCGACAAT CGAGCCTGGT TGCTCCCGAA CGCAGGCTGG
TAGACCCACA ACTCCCCACC CACGCCATCT TACCGAACAC GACACACGTT GGTTCCATCC
CGACCGCGGT GGACCATCCA TTGCCATTCA CCAACGGCGT TGACGATACA TCGGACGAAG
ATAGCTTGGA CGAGGATACC TTGAACGACC GGGATTGGGA GATTCTGCAT CAAGCCCAAG
CTTTCCCCGT GGACGTCTAC GACGACGAAT ACGATGGGAT TCCCACGCCA CCCGCACCTC
CTTTAACGAC GAACGTATAC GCACTCTTGT TCTGGATGCT GCGGAACGTT GCGATTCGAC
AATCGACCCA GGTTGCTCCC GAACGCAAGC TGGTAGACCC AAAAGTCCCC ACACGCGCCG
CCCTACCGAA CACGACACAC GTTGGTCTTA TCCCGACCGC AGTGGACCAT CCATTATCAT
TCACTGACGT CGTTGACGAT ACATCCGACG AAGATAGCTT GGACGAGGAT ACCTTGAACG
ACCGGGATTG GGAGATTCTG CGACAAGCCC AAGCTTTCCC CGTGGACGTC TACGACGACG
AATACGATGC GGTTGAAATG GAAAGCGTCG ACCAGGACCC CCCTACACAT CATGCCCTGG
AGTTCAATGC TACCCTTTCT CGATACGCTG TGGTACCACT GTGGGCCACA CTCTCATCAC
TCCTGTTTCG ACCAAACTCG GAACAAGAAT CTCTCAGACT ATCGCGGACG GCGCAGACTC
TTCATAGCAA TGCCCTTATC AAAACCAATA CGGACGTGTT GGGCTTGATT GTACCCCTAT
CACAAGCCTG GGGAACGCAA GTGGATCGTA CGGTGCGATT GTTGGTCACA TGGGCACGCT
GCCTGGACAC GGTGCGGCGA AGGCCACGGC GAGCCACCCA CGTGGTCCGC CGACGGATCG
TCCGTGTCCG CAAAGTACCC CGACGCATCA AAATCTCCAC TTTGCCGGAA GACTACGATT
TGGACGACGC GAATTTTGAA GAATTGCTCG AGCAAGTACG ATTACCGTCG TACATGAAGG
AAGAGTATTC CGACTACGAA GATACGGAAG AAGACTACTG GATCCGCAAC GACTACGAGC
AAACCCCATC TCTGCATTCC TTGCTGCGAT CCGCGCCCGA CGTGAATGCG GTAGCATCAC
GGTCTGTTGA ATCGAGTTTC TTTTCCGATT CCACCATTGG TTGGGATGAT GAGTGGGACT
TTGACGACGA CAGCTTGTGC TCACAAGATT TCGAGATTCT GCAGCAAGCC AACGAAATTG
TCTGGGAAGA TTTTGATTCT ATAGACATTT GCAACTCAGA CGATGAATAT GAGAATATGT
GGGATTCGAA TCGGTGGAAC GGCGAGGACG ACTCGGAATT AGATGAGTCG GACCATGCCA
ACCAGTTCCC CGGCGAGGAC GCAAACCGGA TGGAGGAGAG ACTGGAGGAG CATTGGGGAC
GCCATTCAAA GACTGAACGG CGCAAAAGCC GTGAAAGCTT TTCTAGCCAA TGGCAATCCA
AAAATAGCTC AATGCACTAC AAACATTTAC ATGCGGAACA GGAGGAGACC TTGTTGCAGT
ATTTCAAACG GCCACTCACA GTCAAGCGGA GTCTGTTTTC CTTGAGGAGA TATGTGCCTT
TCTCAGGAAC TAATAGCGCC GTAAAGCCAG TTGACGCAAC TGTTGTTGCT AGGGAAGGCA
CCGCCTGCGA CACAAAAAGT TTTTCAACTG AGACTCCAGT AGGTTTCTTG GGCAAAGCAT
TGGACGAGCA TTCGTCCCCT CTTACACAAC TTCCTTGTCG ACACGAGTCA AACTCGGATC
CCTATACCAA TCAAAGAGCT CCGATTCATA GCTCCTGTTC TCGCACCCAT TTTGGTTGGA
TGCGCCCATG GTCTTGGTCA AAGTTTCCTC ATCGACATTC ATTGCTAACT GATATTCAAA
CATCTAAGGA AAACGATGTA ACGAATATTA TCACGGGACC CAAGAGCTCA GTAACTGGCG
CTTTGACGGA AGCGGCGGTA GTAGAGCCAA AGTTCCTTCA GGAAGTTGAC TGTAAGCTTT
TGAGAAAGCC ACGTAAGCCC TGGCGTATAT CGCAGCTACT TGGTACTTGG GAGTGGCCTG
CGCTCTTCCG ACGGAGCAAT AAAACCACTC CGGATTTCGC CGCAGCCAAG AAGACTTACG
AGGAGGAATC ATCATCGATC AATTTGCAAG ATGGTTTCCT TAAGAGAGAA AACGAATTAG
ACGAGATGAA TCTGGGTGAG AGTGCCATTC GAGAAGTTAA TCATGCGTCA CCATTGGGAC
CTCCACCGCT ACCCTCGAAG CCTTCGCATT CCTCTGTTCT TCGCATGGAA GCCTTTTAA
 
Protein sequence
MPVPVVSVRG VSRIPARLVL VLMLGDGFGT IRTTAATQEP SRGASHRVEV DGVPWMPSRT 
KQQRFRIISH DAPSSMPSPP PPPLPTWTDT AFDTPPPTLY RPKFIPPPPP RPPPPPRNPP
TLPLTSNTTA TVSTPPILLP PPPPIIPPPT MPQYPLRTAT STPPLSTTTT SPIAIRAARL
SVTAATIRPP RPPSITITAV HPWQSPWSDP PPPPPPRPTP PAPPVPSSPT TNVYLSLFWM
LRNVAIRQSS LVAPERRLVD PQLPTHAILP NTTHVGSIPT AVDHPLPFTN GVDDTSDEDS
LDEDTLNDRD WEILHQAQAF PVDVYDDEYD GIPTPPAPPL TTNVYALLFW MLRNVAIRQS
TQVAPERKLV DPKVPTRAAL PNTTHVGLIP TAVDHPLSFT DVVDDTSDED SLDEDTLNDR
DWEILRQAQA FPVDVYDDEY DAVEMESVDQ DPPTHHALEF NATLSRYAVV PLWATLSSLL
FRPNSEQESL RLSRTAQTLH SNALIKTNTD VLGLIVPLSQ AWGTQVDRTV RLLVTWARCL
DTVRRRPRRA THVVRRRIVR VRKVPRRIKI STLPEDYDLD DANFEELLEQ VRLPSYMKEE
YSDYEDTEED YWIRNDYEQT PSLHSLLRSA PDVNAVASRS VESSFFSDST IGWDDEWDFD
DDSLCSQDFE ILQQANEIVW EDFDSIDICN SDDEYENMWD SNRWNGEDDS ELDESDHANQ
FPGEDANRME ERLEEHWGRH SKTERRKSRE SFSSQWQSKN SSMHYKHLHA EQEETLLQYF
KRPLTVKRSL FSLRRYVPFS GTNSAVKPVD ATVVAREGTA CDTKSFSTET PVGFLGKALD
EHSSPLTQLP CRHESNSDPY TNQRAPIHSS CSRTHFGWMR PWSWSKFPHR HSLLTDIQTS
KENDVTNIIT GPKSSVTGAL TEAAVVEPKF LQEVDCKLLR KPRKPWRISQ LLGTWEWPAL
FRRSNKTTPD FAAAKKTYEE ESSSINLQDG FLKRENELDE MNLGESAIRE VNHASPLGPP
PLPSKPSHSS VLRMEAF