Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39092 |
Symbol | |
ID | 7194749 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 311146 |
End bp | 314324 |
Gene Length | 3179 bp |
Protein Length | 1037 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183071 |
Protein GI | 219125614 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.129915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGTGC CCGTGGTGTC CGTACGCGGC GTGTCTCGGA TCCCCGCACG CCGTCAATCC GTCCGTTCTC TCCCTGGTTG GTGCTTCTAC TGTTACTGTT CTTTGCGACC GGTGTAGTGG TGCTGGTACT GATGCTGGGG GACGGCTTTG GGACGATACG AACGACCGCC GCCACGCAGG AACCATCTCG TGGTGCATCA CACCGTGTGG AAGTCGATGG TGTCCCGTGG ATGCCGTCAC GAACGAAACA ACAACGATTC CGAATTATTA GTCATGATGC ACCCTCGTCA ATGCCCTCTC CTCCACCACC ACCGCTGCCC ACGTGGACCG ACACGGCGTT CGACACCCCT CCTCCGACAC TGTACCGGCC AAAATTCATT CCACCTCCTC CTCCTCGGCC TCCACCGCCA CCACGAAATC CTCCCACTCT TCCACTTACC TCCAACACTA CAGCAACAGT ATCAACACCA CCAATACTAT TACCACCACC ACCACCAATA ATACCACCAC CAACAATGCC GCAATACCCC CTACGCACCG CTACATCGAC TCCACCGTTA TCAACTACAA CAACCAGTCC GATTGCCATT CGTGCCGCTC GTTTGTCCGT AACTGCGGCG ACCATTCGTC CACCGAGACC CCCGTCGATT ACCATCACAG CCGTCCATCC CTGGCAGTCC CCGTGGTCGG ACCCACCACC ACCGCCCCCG CCGCGGCCCA CGCCACCCGC ACCTCCTGTA CCGTCGTCGC CCACGACGAA CGTATACTTG TCCTTGTTCT GGATGCTGCG GAACGTTGCG ATTCGACAAT CGAGCCTGGT TGCTCCCGAA CGCAGGCTGG TAGACCCACA ACTCCCCACC CACGCCATCT TACCGAACAC GACACACGTT GGTTCCATCC CGACCGCGGT GGACCATCCA TTGCCATTCA CCAACGGCGT TGACGATACA TCGGACGAAG ATAGCTTGGA CGAGGATACC TTGAACGACC GGGATTGGGA GATTCTGCAT CAAGCCCAAG CTTTCCCCGT GGACGTCTAC GACGACGAAT ACGATGGGAT TCCCACGCCA CCCGCACCTC CTTTAACGAC GAACGTATAC GCACTCTTGT TCTGGATGCT GCGGAACGTT GCGATTCGAC AATCGACCCA GGTTGCTCCC GAACGCAAGC TGGTAGACCC AAAAGTCCCC ACACGCGCCG CCCTACCGAA CACGACACAC GTTGGTCTTA TCCCGACCGC AGTGGACCAT CCATTATCAT TCACTGACGT CGTTGACGAT ACATCCGACG AAGATAGCTT GGACGAGGAT ACCTTGAACG ACCGGGATTG GGAGATTCTG CGACAAGCCC AAGCTTTCCC CGTGGACGTC TACGACGACG AATACGATGC GGTTGAAATG GAAAGCGTCG ACCAGGACCC CCCTACACAT CATGCCCTGG AGTTCAATGC TACCCTTTCT CGATACGCTG TGGTACCACT GTGGGCCACA CTCTCATCAC TCCTGTTTCG ACCAAACTCG GAACAAGAAT CTCTCAGACT ATCGCGGACG GCGCAGACTC TTCATAGCAA TGCCCTTATC AAAACCAATA CGGACGTGTT GGGCTTGATT GTACCCCTAT CACAAGCCTG GGGAACGCAA GTGGATCGTA CGGTGCGATT GTTGGTCACA TGGGCACGCT GCCTGGACAC GGTGCGGCGA AGGCCACGGC GAGCCACCCA CGTGGTCCGC CGACGGATCG TCCGTGTCCG CAAAGTACCC CGACGCATCA AAATCTCCAC TTTGCCGGAA GACTACGATT TGGACGACGC GAATTTTGAA GAATTGCTCG AGCAAGTACG ATTACCGTCG TACATGAAGG AAGAGTATTC CGACTACGAA GATACGGAAG AAGACTACTG GATCCGCAAC GACTACGAGC AAACCCCATC TCTGCATTCC TTGCTGCGAT CCGCGCCCGA CGTGAATGCG GTAGCATCAC GGTCTGTTGA ATCGAGTTTC TTTTCCGATT CCACCATTGG TTGGGATGAT GAGTGGGACT TTGACGACGA CAGCTTGTGC TCACAAGATT TCGAGATTCT GCAGCAAGCC AACGAAATTG TCTGGGAAGA TTTTGATTCT ATAGACATTT GCAACTCAGA CGATGAATAT GAGAATATGT GGGATTCGAA TCGGTGGAAC GGCGAGGACG ACTCGGAATT AGATGAGTCG GACCATGCCA ACCAGTTCCC CGGCGAGGAC GCAAACCGGA TGGAGGAGAG ACTGGAGGAG CATTGGGGAC GCCATTCAAA GACTGAACGG CGCAAAAGCC GTGAAAGCTT TTCTAGCCAA TGGCAATCCA AAAATAGCTC AATGCACTAC AAACATTTAC ATGCGGAACA GGAGGAGACC TTGTTGCAGT ATTTCAAACG GCCACTCACA GTCAAGCGGA GTCTGTTTTC CTTGAGGAGA TATGTGCCTT TCTCAGGAAC TAATAGCGCC GTAAAGCCAG TTGACGCAAC TGTTGTTGCT AGGGAAGGCA CCGCCTGCGA CACAAAAAGT TTTTCAACTG AGACTCCAGT AGGTTTCTTG GGCAAAGCAT TGGACGAGCA TTCGTCCCCT CTTACACAAC TTCCTTGTCG ACACGAGTCA AACTCGGATC CCTATACCAA TCAAAGAGCT CCGATTCATA GCTCCTGTTC TCGCACCCAT TTTGGTTGGA TGCGCCCATG GTCTTGGTCA AAGTTTCCTC ATCGACATTC ATTGCTAACT GATATTCAAA CATCTAAGGA AAACGATGTA ACGAATATTA TCACGGGACC CAAGAGCTCA GTAACTGGCG CTTTGACGGA AGCGGCGGTA GTAGAGCCAA AGTTCCTTCA GGAAGTTGAC TGTAAGCTTT TGAGAAAGCC ACGTAAGCCC TGGCGTATAT CGCAGCTACT TGGTACTTGG GAGTGGCCTG CGCTCTTCCG ACGGAGCAAT AAAACCACTC CGGATTTCGC CGCAGCCAAG AAGACTTACG AGGAGGAATC ATCATCGATC AATTTGCAAG ATGGTTTCCT TAAGAGAGAA AACGAATTAG ACGAGATGAA TCTGGGTGAG AGTGCCATTC GAGAAGTTAA TCATGCGTCA CCATTGGGAC CTCCACCGCT ACCCTCGAAG CCTTCGCATT CCTCTGTTCT TCGCATGGAA GCCTTTTAA
|
Protein sequence | MPVPVVSVRG VSRIPARLVL VLMLGDGFGT IRTTAATQEP SRGASHRVEV DGVPWMPSRT KQQRFRIISH DAPSSMPSPP PPPLPTWTDT AFDTPPPTLY RPKFIPPPPP RPPPPPRNPP TLPLTSNTTA TVSTPPILLP PPPPIIPPPT MPQYPLRTAT STPPLSTTTT SPIAIRAARL SVTAATIRPP RPPSITITAV HPWQSPWSDP PPPPPPRPTP PAPPVPSSPT TNVYLSLFWM LRNVAIRQSS LVAPERRLVD PQLPTHAILP NTTHVGSIPT AVDHPLPFTN GVDDTSDEDS LDEDTLNDRD WEILHQAQAF PVDVYDDEYD GIPTPPAPPL TTNVYALLFW MLRNVAIRQS TQVAPERKLV DPKVPTRAAL PNTTHVGLIP TAVDHPLSFT DVVDDTSDED SLDEDTLNDR DWEILRQAQA FPVDVYDDEY DAVEMESVDQ DPPTHHALEF NATLSRYAVV PLWATLSSLL FRPNSEQESL RLSRTAQTLH SNALIKTNTD VLGLIVPLSQ AWGTQVDRTV RLLVTWARCL DTVRRRPRRA THVVRRRIVR VRKVPRRIKI STLPEDYDLD DANFEELLEQ VRLPSYMKEE YSDYEDTEED YWIRNDYEQT PSLHSLLRSA PDVNAVASRS VESSFFSDST IGWDDEWDFD DDSLCSQDFE ILQQANEIVW EDFDSIDICN SDDEYENMWD SNRWNGEDDS ELDESDHANQ FPGEDANRME ERLEEHWGRH SKTERRKSRE SFSSQWQSKN SSMHYKHLHA EQEETLLQYF KRPLTVKRSL FSLRRYVPFS GTNSAVKPVD ATVVAREGTA CDTKSFSTET PVGFLGKALD EHSSPLTQLP CRHESNSDPY TNQRAPIHSS CSRTHFGWMR PWSWSKFPHR HSLLTDIQTS KENDVTNIIT GPKSSVTGAL TEAAVVEPKF LQEVDCKLLR KPRKPWRISQ LLGTWEWPAL FRRSNKTTPD FAAAKKTYEE ESSSINLQDG FLKRENELDE MNLGESAIRE VNHASPLGPP PLPSKPSHSS VLRMEAF
|
| |