Gene PHATRDRAFT_19901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19901 
Symbol 
ID7200552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp174994 
End bp177906 
Gene Length2913 bp 
Protein Length900 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179808 
Protein GI219118050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCTCGCGCA CTCACCGCCG CACATTATCG AGTGCGCCGC CGCCGAAGCC GGGGCCTCGC 
AAGATGCCTG TCATCTCCCG CCGGCCACGC GGTTCGCCGC CGTCGGACAA AAGGGCGCCA
CGCTCTGGAT GACGGGATGT TCCGGTGCCG GCAAAACCAC CATTGCCACC GCACTCGAAG
ATCAACTCGT CAAGAGTTAC GGGAAACACG TCTACCGTCT GGACGGGGAT AACCTCCGCA
CCGGACTCAA CCGTGATTTG GGATTCTCCG AAGCCGATCG CGCCGAGTCG GTCCGACGGA
CCGGGGAACT CGCCACACTC TTTGCCGACG CCGGTGTCGT CACGCTCGTC GGACTCATCT
CGCCCTACCG CAAGGATCGC GACGCCGTAC GCAAACGTCA CGTCGACCAA GGCATTCCCT
TTTACGAAGT ATTCCTCGAC GTGCCCGTGG ATGAACTCAA AAAACGCGAT CCCAAGGGAC
AGTACGCTCG TGTCGAGTCC GGAGAACTCA AACACTTTAC CTGCATCGAC GACCCCTATG
ATGAACCCTT GCAACCAGAA ATTACCCTCA AAACGCACGA ACTCACCATT GAACAGTCGG
TGCAGATTCT CTTTCGACGA CTCGAACGAG ACGGAATTCT GGTCGGGGCG CCCAAACTTA
GTCCGCCCGG TCTGCCCAAC CCCGACGGGG ACGTCTTGGT GGACTTGCAC GTTCCCGACG
AATCCAAAGA AGCCCGTCGC GCCGAGGCGG CGACCCTCCC CAAGGTCTTG ATCAACGACA
TTGATCTCAA CTGGTTGCAA ACCATTGGGG AAGGCTGGGC CTCACCGCTC CGAGGTTTCA
TGCGCGAAGG CACACTGTTG GAAACCCTGC ACTTTAATTC GATCCTCACG GATCCCTTCA
ACCTCACGGG CAACGCCCTG CGACTGGAAA CCCGCACGAA CTTTGATCAC TTTTCCGCCC
ATCCGGCCCC CCAACGCGTC TCCATGCCCA TTCCCATCAC CCTCTCCTGT ACATCTTTTA
CCAAGGACCT CATTGACGCC TCGTCCCACA ACGCCGTCGC TTTGGTGACA CAAATGGGAC
ACACCGTGGC CATTCTACGC GATCCCGAAG TCTACGCCAA CCGCAAGGAA GAAATCGTGA
CGCGTATGTA CGGTGTCGTG GATCCGGATC ATCCCTACAT TCAACACATT TATCGGGGCG
GCGACTACTT GATTGGCGGA GAAATCGAAC TGCTGGATCG CATCCGCTAC AATGACGGCC
TCGACCAGTG GCGCAAAACA GCGACGGAGC TCGTGCAAGA GTTCCAGAGC AAAGGGGCCG
ACACGGTGTA CGCCTTCCAA ACGCGTAACC CGACCCACGC GGGTCACGCG TACCTGATGC
GTTCCGCCGG TGAAGACCTG CGTCGTCAGG GGTACCAGAA ACCCGTCCTG TGGTTGAGTC
CCCTGGGCGG TTGGACCAAG GCCGACGACG TGCCGCTCGA TGTGCGCGTC AAACAGCACG
AACAAGTCCT GCAAGCGGGC ACCACCCATC CCGGTGGCCT CGATCCGGAA TCCACCGTCA
TGGCTATTTG GCCCGCTCCC ATGGTCTACG CCGGACCCAC CGAAGTCCAG TTCCACGCCA
AGTCACGGCG CTCCGCGGGA GCCTCGTACT TTGTGGTCGG CCGCGATCCC GCCGGAATGA
AAGGATCGCC CAACGCGGTG GCGCACCCGG ACGATGACCT CTACGACGGT AACCACGGAC
GTTACGTTCT GCAGAACTCG CCGGGCCTCG GAGATATGAA GATGCTGAGC TTTGTCAAAG
TCATGTACGA CACCACCGAC AATATTATGA AGATTCCGGA CGAAGCGCGG CTGGCGGACT
TTATCAGTAT TTCGGGCAGT AAAATGCGAC TGTTGGCCCG GAACGGGGCC ACCCCCTGCA
GTCCCACCAA TATTCCGACG GATCTGGTCG AAGCCAACTG CGTCCCCAGC GGATTCATGG
TACCGGACGG TTGGAATCAA GTGGTCGACT ACTACCGGAA TATTGATGAT GTGCAACGCT
GGACGCCGTG GAGTCAACCT CGCGTAGATC CCCCCACGGC ACCGCGCACC ACGTATCAAG
GCCAGTTTGG TTCCCGATCC TTCCACCTGA CTAGTACAGA ATACGAATCC TTCTGGCACG
ACATTCCCCT GAGTCCATCG GGGCAATCCG AAACCGTAGT CAACATGGTG ACGGAAATTC
CCATGTATTG CACGGCCAAA ATGGAGATTC AAAAGATGCT GTCCAACAGT CCCATTGCTC
AGGACACCAA CAGCGACGGT TCGCCGCGTC ACTACAGCTA CGGTACGCCC TTTTTCAACT
ATGGTCTCAT TCCACAAACA TGGGAAGATC CCAACCTAAA ATCTGCGCAA GGGTACGGTG
GGGACAACGA TCCGCTCGAC GTTATCGAAT TGGGGTCGTC GCCCTTGCAA ATGGGTGGAC
TAACGCCGTG TCGGGTGTTG GGATCGTTTG AGCTCATTGA CGAAGGCGAA ACGGACCACA
AGATTCTGTG CATTGCCGTG GACGACAAAG ACGCCAACCA AATCCATTCC TTGGAAGATT
TGGAGCGTGT CAAGCCGGGT CACTTGGACA AGCTCCGGGA TTGGTTGAAG CGGTACAAGA
CGAGCGAGGG CAAAGCGGAA AACAATTTGG CGTCTGAAAC GCCGCGCACC GCGATGGAAG
CCGTAGGCGT CATTCAAGAA ACGCACGGAC GCTGGCGATC ATTGTGTGGT AAGGATGGAA
CGACAGTCTA TTCTCTTTCG AGCAAGACGG CCGGTTTCTG GCTCAGCAGT CCGGGGTGTA
GGGGAACGTA ATCTTACAGT TAGTGTCGCA GTTCCCTCCA GCCCAAAATT ACACAAACCC
TTATTTACTT TTTAGAAATT TGCCACGAGT CGT
 
Protein sequence
MTGCSGAGKT TIATALEDQL VKSYGKHVYR LDGDNLRTGL NRDLGFSEAD RAESVRRTGE 
LATLFADAGV VTLVGLISPY RKDRDAVRKR HVDQGIPFYE VFLDVPVDEL KKRDPKGQYA
RVESGELKHF TCIDDPYDEP LQPEITLKTH ELTIEQSVQI LFRRLERDGI LVGAPKLSPP
GLPNPDGDVL VDLHVPDESK EARRAEAATL PKVLINDIDL NWLQTIGEGW ASPLRGFMRE
GTLLETLHFN SILTDPFNLT GNALRLETRT NFDHFSAHPA PQRVSMPIPI TLSCTSFTKD
LIDASSHNAV ALVTQMGHTV AILRDPEVYA NRKEEIVTRM YGVVDPDHPY IQHIYRGGDY
LIGGEIELLD RIRYNDGLDQ WRKTATELVQ EFQSKGADTV YAFQTRNPTH AGHAYLMRSA
GEDLRRQGYQ KPVLWLSPLG GWTKADDVPL DVRVKQHEQV LQAGTTHPGG LDPESTVMAI
WPAPMVYAGP TEVQFHAKSR RSAGASYFVV GRDPAGMKGS PNAVAHPDDD LYDGNHGRYV
LQNSPGLGDM KMLSFVKVMY DTTDNIMKIP DEARLADFIS ISGSKMRLLA RNGATPCSPT
NIPTDLVEAN CVPSGFMVPD GWNQVVDYYR NIDDVQRWTP WSQPRVDPPT APRTTYQGQF
GSRSFHLTST EYESFWHDIP LSPSGQSETV VNMVTEIPMY CTAKMEIQKM LSNSPIAQDT
NSDGSPRHYS YGTPFFNYGL IPQTWEDPNL KSAQGYGGDN DPLDVIELGS SPLQMGGLTP
CRVLGSFELI DEGETDHKIL CIAVDDKDAN QIHSLEDLER VKPGHLDKLR DWLKRYKTSE
GKAENNLASE TPRTAMEAVG VIQETHGRWR SLCGKDGTTV YSLSSKTAGF WLSSPGCRGT