Gene PHATRDRAFT_20188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20188 
Symbol 
ID7200900 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp142282 
End bp145668 
Gene Length3387 bp 
Protein Length693 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179982 
Protein GI219118417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.248743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCTA CGAACAGATC CTCTTCGTCG GTGTTTCTGC CGCCGTCCCA AACGCGCTTG 
GTCACGATTG CCGCGCACGT GGATCACGGC AAAACAACCT TGGCGGACAA TCTTATCGAG
GCCAACGGGC TTATTTCGGA ACGTCTCGCC GGGACTCTCC GCTACCTCGA CTCGGATCCG
GAAGAACAAC GCCGTGGCAT TACTATGCGC AGCTCCGCGA TTGGGCTCCA GCACGTGTAC
CAGAAACACC ACAAACCGAA CCACACGGGC GGCGATCACA CTCCAGCCAA CCAAGGCCAA
AAGCACGTGA TTCATCTCTA CGATTCTCCC GGACACACGG ATTTTTCTCG CGAAGTATCG
TCCGCCATGT CCTGCTGCGA TACGGCCTTG CTCGTCGTGG ATGCCGTCGA GGGTATGGGA
CCCCGTACAC ATCAAGTCTT TCGGGAAGCC TACGCGCAAC AGCTGGTTCC CATTCTCGTG
CTCAATAAAA TCGATCGATT GTGTTTGGAT CTGCGCCTCA CACCCACCGA GGCGTATCTG
CGTTTGCGGA ATCTACTCGA AACGGTCAAC GCCGCCGCTT CCACCTTGTT GACCAGCTCG
CGGCACGCGG ACCACGCGTC AGGAAGCAAT GGCGATCCGT CAACCGAGAT AACCACGGAA
TTGGAAACAC AGTGGACGTT TGATCCGGCC CGTAATAACG TGGTGTTTGC GTCCGCCCTG
TTTGGATTTG GATTTACGGC ACAAAATTTG GCGAGAGCCT TGTATCAGAC CAAGGCTATT
CCATCGTCCC TTAAACCACC CGTATTTCGT TCCCTGGTTT TTGCCGACGC CAAACTCAAA
GGTGATAAGG TACTGAAGTG GAAGGCACGG GACCAGACAG ACGATGCTCC CATTTTTGCC
ATCTATGGTT TGCAGCCACT GTGGGATGTT TTGGAAGGCG TTGCGACGGC GGCCGCGGCA
GCGGGACTCG GATCGTCACA ACTGTTTCAC CATGGAAGCT CCAACACGGT AGATCATCAC
CACAATGGCA CCCCGCCTTC GGTACCAACA ACGACTACTG TGGACGTGAA AATTAAAGCC
GACACGACTG GTATGAATCA AACTCTACGA GCCCTGAGTA TCGGACCGAC CGGCAGCGAT
GTACCGTCAA CCGTAGAAGC ATTGCAGACA ATCTTGACCC GGACGGGTGC CAATACGGAA
GAAGCCATTG TGCGATCCCT GTTACGGCGC TTTCGACCTC TCTCACGGAC ATTGTTAGAC
GTCCTTGTCG AATACGCTCC GTCGCCAATC CAGGCTGCAG CGTCGATGCG ACATCGGGCT
CTGAGTTTAC AAATGCCTGA GAGAACGGCA ATTACAAATG CTGCTCAGGA GGAATATTCT
CGAATTGCGG AGGCGGTTCA AAATTGCAGC GTTGCTCCGA ACGCCCCCAC CGTAGCGCAT
GTGTACAAGT TTATGGCCGC GGAACGTTCC CAAATTCGGG ATCCATGTTT GCCTACGAAT
CTGGAGAGTC ATGATGAGGA CCACACAAGC TTGATTCTGG GCGTAGCAAG GGTGTTGAGT
GGGAGCTTGA AAACGGGAAA GTCTTACTAC GCAATGGGCC CAAAGCATTT GCACACCGAC
TCCAATATTG TACCAAAACG AGCTATACGG CTGTACTTAC TCATGGGTAG TTCGTTTGTA
CTCGTGGACG AGGTACCGGC TGGACATTTG TGTGGGGTCT ACAATTTGGA AGACACGCAG
CTTAAAACAA TCACGCTATC TGACTCGCCC CACGGCATGC CTCTGACTGC CATGGAACAG
GGTATCCGAC CCCTCGTGAA GGTCAACGTG GAAGCACAGG AAGCTTCTGA TACCATTGCC
TTAGAACGCG GATTACGAAA ACTGGCCTTG GCTGATGCGG CCGTTGAAGT CACAGCGACG
GCCAAAGGAG AACGGCTTTT GGCTTGTTTA GGAGAAATTC ATTTGGAACA ATCTATTCTG
GATCTTCGGA ATGTTTATTG CGGTAGAGAA ATAAAATTGC GCATTTCTGA TCCCATTGTA
GACTTTGGCG AAACCACCGA CTGGTTTGAA CACGAAATCG ACTACGCCAC ATTTTGGGAG
GACCCAGCTC CGAGGCTGCG ACAAGTCTCG ATTCCACCAT ACAATGAGGA ATATGGCATA
TCCCTTAGCA GACATGGTAG GATGAGATCG TTGGTATCGG GCCGCTCAGC TGCGATTCAT
GTACGTGTAG TACCCTTGGC TTCGTCGATC TATCAATCTC TTTCGGACGA CAAGGTCGTG
GAGAACACAG AAGAAGATCT GCTGAACCTG GCCAAAGCAC TCGGATATCA CTGTCTAAAT
GCGGATGATG TACTGGAGAC ACTCAAGAGC GCGTTGTGCT CTTTGGGTAC GAATGGAAAT
GCACTTATAC TAGGACCAGG ATTGTGCAAT GAATCCTGTG TGGTCGGTGT CGTTTCGGAC
ACCGGCGAGG TTCACCTCCC ATCAATAGCA GCGGAAAAAT CGGGAAATTC TGACATCGCT
CCGGTGGAGC CGGAATCCAC TTCGGCCGAT GTCTGGGACA AAGACGGAGT GGGAATGAAA
GAGTTTCGAT CCATGCTAAG AAAGCTTCGA ACTGTTGGAG ATCAGAATGG ACACTCAAAT
CTAGAAATGT CAGAAGTGGA TGTTGCTGCA CGAAAAATAT GGAGCGAAGA TATGCGCGGA
TCAATGGTAG CTGGGTTTCA ACTTGCGGTT CGGGCCGGTC CAATTTGCGA AGAGCCCGTC
CGAAATGTAT TAGTGGTCTT AGAGGGTGCC GAAGTTGGAC TAGCTAGGCG GGGAGATTCT
TACGAAGCTG CAAAATCACT ATCTGGAGGA ATGCTGGTAG CCGCTCTTCG TTCAGGTATT
CGTTGTGCGC TCTTAAGCAG ACCCGCTAGG TTAATGGAAG GCCACTTGAG ACTTACGCTC
CACTCATCCA TGGCTGGACT CGGTCCTCTA TATTCGGTAC TTAACAAGCG TCGCGGCAAA
GTCCTAGATG ATTCCATGGT TGATGGTGCT GACTTGCTCA TGATCACTGC GCTTATTCCT
CAAGCGGAAG CATTTGGACT CGCACCGGAA CTTTACAGCA ATACCAGTGG GGAGGTCACC
GCGCCAGAAC TAAATTTTAG CCACTGGGAT CGACTTGACG TGGACCCGTT TTGGATCCCA
ACAAGTTTAG AGGAACGGGA GGATTTTGGC GAGTTACAGA TGGCTGGAGA TATGTCTACT
GGTCTGGACA ATACCGCTCT CAAATATATT CGCAAAGTTC GAGAACAAAA AGGCCTGACT
ACTGACTCGG CCCGTACAGT TTTAAATGCC GAAAAGCAGC GAACACTTAA GCGATAGAAA
TAATGAAAGG AGTACGTAAC TTGTAAC
 
Protein sequence
MSATNRSSSS VFLPPSQTRL VTIAAHVDHG KTTLADNLIE ANGLISERLA GTLRYLDSDP 
EEQRRGITMR SSAIGLQHHV IHLYDSPGHT DFSREVSSAM SCCDTALLVV DAVEGMGPRT
HQVFREAYAQ QLVPILVLNK IDRLCLDLRL TPTEAYLRLR NLLETVNAAA STLLTSSRHA
DHASGSNGDP STEITTELET QWTFDPARNN VVFASALFGF GFTAQNLARA LYQTKAIPSS
LKPPVFRSLV FADAKLKGDK VLKWKARDQT DDAPIFAIYG LQPLWDVLEG VATAAAAAGL
GSSQLTAITN AAQEEYSRIA EAVQNCSVAP NAPTVAHVYK FMAAERSQIR DPCLPTNLES
HDEDHTSLIL GVARVLSGSL KTGKSYYAMG PKHLHTDSNI VPKRAIRLYL LMGSSFVLVD
EVPAGHLCGV YNLEDTQLKT ITLSDSPHGM PLTAMEQGIR PLVKVNVEAQ EASDTIALER
GLRKLALADA AVEVTATAKG ERLLACLGEI HLEQSILDLR NVYCGREIKL RISDPIVDFG
ETTDWLMEGH LRLTLHSSMA GLGPLYSVLN KRRGKVLDDS MVDGADLLMI TALIPQAEAF
GLAPELYSNT SGEVTAPELN FSHWDRLDVD PFWIPTSLEE REDFGELQMA GDMSTGLDNT
ALKYIRKVRE QKGLTTDSAR TVLNAEKQRT LKR