Gene PHATRDRAFT_47778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47778 
Symbol 
ID7202752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp809316 
End bp811691 
Gene Length2376 bp 
Protein Length777 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181981 
Protein GI219123333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.950731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACTT CGGTCGTTGA CGTCGATGCG GCAAATCCGT ACCGGTACAC TTGCCGCACC 
AGCCTTTCCA AAGAGCAATA TCAATCTTTG ACCGCTGTGC ACAAGGAACT CGCGAATAGC
AAGAATGCTA CTAACTCAAA GGATCGGCCT TGGCAGAACA CTATTTTCCC AGTAGTATCC
ACACAGGACC CCCTGAAGCA GCCTAGCCGA GACTCTTCAA CAAGCGGCGC GCTCCGGAAA
AGGACCCACC AGCATCGCAG CGAGAAAAGG GATAATACGG ATCCTTCCGA CACTATCGTA
TCCACTACTG CGGTCAATAA ATCTTCCAAA CGTCGTCGTC GCCAAACGAA GGATCCGTCG
GCCCCGAAAC GACCCATGGC ACCATTCTTG CGTTTCCTGC ATCAGCATAT TGGACAAGTC
AAAACTCTCA GGTCGATTTC ATACCGAGAA GCCATATCAC TGTTGGGAGA GATCTGGTCG
AGCGAGACTC CTGAAGCTCG ACGGCCCTAC TTGGAAGCGC ACGAGGCGGA CCTGAACGCC
TACAAAATCG CCTTGACTTC TTGGAACGCA TTGCAAGCGG CGACCACTAC TCCCACGCAT
ACATCGGAGC GCCGATTTGA CAAAACCCGA CGGATGGACA CAGCCCAAAC TAGTAGAGCG
GCCCTTGATA ATGGAACTAC CACGCTAACT GAGAAGTCTT TGGCCGAGTA CGAGTCGAAA
AACGAGGCGG CGACAACGAA AGCCATTGAA ATACTGCCGC CTGTGAATCC ACGCGACTTT
GTGCCTCCCA AGACGCGAGT CCAACTGGCG GACGGCTACT ATAAGCGACC CCGAGGAGCT
CCGCCAAAGG GATGTTCCTG GGACAAACGC CAGGGAGTTT GGATCCAAAA AATTCCATCG
ACAGAAGTCG ACGCCTATCT AGATACTGCC TCATCCAATT GTCGGACGTC ATTGAAGAAG
GTCAAAGCTC GTGGCAATCA CTCCAAAACT TTGCAGCCCA ATCAACTTGT GCGCTTAGCT
CCACAAGCAC GACCTGAACG TTTAAGCGAC GGAAGTTTTC GGCGACCTCT AGGAAGACCC
CCCAAAGGAT ATTCATGGCA TTCGAAGAGC GGTGTGTGGC TATTGAATCT AGCTGTATCA
AGGGGGGGCA AAGGTGCACA CTGTTTGTCA GCTCCAATTC AGAGCTTCAG GAGTGCCAAA
ATCTCTCCGG GTGCGCAGCA GATCAGTAGA AAAGCATCGC AACCGGATCC GCATGTGCGC
GAAACAGAAT TGAGCTCTCC ATTATTTTTT GGGACGGATG GCTGTTACGG CGGACCTTCT
CTGGCTTCTC CTTTGGAACC ACCTGCTCCA TACATGAAAA AGCCTCTCAC AACACTCCCT
AGTTGCACAA GACAGAGTTG GCAGGAATAC GAACAGCAGG ATATGATGGG CGCGACCAAA
TCTCCTTTGC TCTGTAGTTC AAAAATCACC CAGGAAGAGA TTTATCTAAA AAGATGTATC
GCCCCGAAAA GCAAACCCTT CGCCTGTGCA GATGGAAGTT TCCGGCGACC AAGGGGGAAA
GCACCAGCAG GGTACAGCTG GGACCGTCAC CAGGGGGCTT GGCAAAGATC TGATTCTGTT
GCCACCACCA TCGAAGCCAA GGAAGACGAC GCAGTGTCAG ACATATCAGA AATTCCAATC
AAGGAGGTTT TCATTAAAAG CGGGTGGGAC ATCCCTCGAC GAGTAAGTAC TCTCTCGCTT
TCTTGCGAAA ATGCAGATGT GCCGGAATTC AATTCACCGT CAGTTCAGTA TGAAAGACCC
CAAAACCCAT CCAAACCCTT AGAGACTCTT CGAAGCAGCG AACTGGCAAA AAAGCCAGGA
TTACATCGCA AGCCTGTGTA CATGCATAGC AATAGTGCGT TATTGCAAAC AAGATACTCT
GCATGCGGAA GTTGCCGGGC CTGCTGCGAA CCCGTCGCCT GTGGGACTTG TCTGAACTGT
ATGCAGCAGC TTGACGATCA TGTTCCGACA TTGTTTGTCC CAACGTGCGC GCGTTGCATT
TGTATTGCTC CCATCCTGCG TGTTCCATCG GCACAATTAG AGACAATGTC CGTGTGTAGC
TTGTCAAATC CCAACTCCTT GGTTGAAGAT GAACTTAGTG ACCTTGATTC GAAGGTCAAC
GATCATGGTA CTTCGGCATT CTCATTCTCA CAATCAGCGT CTCAGTTCTT GAAGCCGAAA
AGTGAGTCGA ATTTGAGTAT TTCTGGGGCT GATTTAGCAA GAAAGTTGGA TGGAATCAAT
CGTGCCCATG ACGACGAATG CGCAGGAAGC ACCGACGATT CTGTTGTCGT ATAGCTCAGC
TCAGGGAAAT TTCTATTAAC ATTATAGACC CAATAT
 
Protein sequence
MSTSVVDVDA ANPYRYTCRT SLSKEQYQSL TAVHKELANS KNATNSKDRP WQNTIFPVVS 
TQDPLKQPSR DSSTSGALRK RTHQHRSEKR DNTDPSDTIV STTAVNKSSK RRRRQTKDPS
APKRPMAPFL RFLHQHIGQV KTLRSISYRE AISLLGEIWS SETPEARRPY LEAHEADLNA
YKIALTSWNA LQAATTTPTH TSERRFDKTR RMDTAQTSRA ALDNGTTTLT EKSLAEYESK
NEAATTKAIE ILPPVNPRDF VPPKTRVQLA DGYYKRPRGA PPKGCSWDKR QGVWIQKIPS
TEVDAYLDTA SSNCRTSLKK VKARGNHSKT LQPNQLVRLA PQARPERLSD GSFRRPLGRP
PKGYSWHSKS GVWLLNLAVS RGGKGAHCLS APIQSFRSAK ISPGAQQISR KASQPDPHVR
ETELSSPLFF GTDGCYGGPS LASPLEPPAP YMKKPLTTLP SCTRQSWQEY EQQDMMGATK
SPLLCSSKIT QEEIYLKRCI APKSKPFACA DGSFRRPRGK APAGYSWDRH QGAWQRSDSV
ATTIEAKEDD AVSDISEIPI KEVFIKSGWD IPRRVSTLSL SCENADVPEF NSPSVQYERP
QNPSKPLETL RSSELAKKPG LHRKPVYMHS NSALLQTRYS ACGSCRACCE PVACGTCLNC
MQQLDDHVPT LFVPTCARCI CIAPILRVPS AQLETMSVCS LSNPNSLVED ELSDLDSKVN
DHGTSAFSFS QSASQFLKPK SESNLSISGA DLARKLDGIN RAHDDECAGS TDDSVVV