Gene PHATRDRAFT_42749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42749 
Symbol 
ID7196376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp958597 
End bp961687 
Gene Length3091 bp 
Protein Length1002 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176696 
Protein GI219109886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCCATTGTAT TTAGAAATGG ATTCTAGTCG TTCATCAAGC AACCGAGCTT CGAACTCATC 
TGCTTCGCTA AGTGCCAATA GTTTGCCTTC CCAAAAGGCG GACCAGACGG CGGCAGCAGC
CGCGTCACTG AACTTGTTGT TGCGTGAAGA TCGTCTCCTG GTGATCCATG GTTTGACCGA
ATCGGGTACT CAGCCTGACA AGGTGGCCTT TCCGTACGCG GCGGCACTTT TAGCAGACGG
AGATGATAGC AATACTGTCA ATACGAAACA ACGTGCTGAT GGAGCGTTGC AGGATGTCGA
GCGGAAACTA GCTTTGGTCG AGAGCCTTGC CGTGAAACTC AGTCGCACCA GCCCTGAGGC
CGTTGCAGGC CATCTACTCA GATTGCATGG ATATCATCTT CCTAAAGAGG GCCTTAAAGA
AGATAAGCCA TCCAGCACGA CGCTATCAGC GGTTAGAGAC AAAGCTGATC GCCTGGAACG
GCAATCCGAA GTTCTGGAGA ATGTAGCTCG ACGAGTGGAG GGATCATTGT CACGGGGCTT
GAAACGTATG GAGACTGCGT GTACTCGTCT TGAACGGGTT CTGTCACTGA GCAATACGCT
AAAAATGATA CTGAGACTTC AGTTCGAAAA TAGCAAGTTG CAAAATTACG ATCTGGAGGA
CTTACGTGAC CTGACTCGTG CCGCCGCCAG CGTCTCGGTA GTCGAGGATT TGCTGAAGCG
CTCTGAACTA CAAGCCTCGA TTGAAGCCAT ACAAAAGATA CGCCCCGAGG TCGAACGCAC
TGCAACTGAT GTACGCCAAT CAGCCGCTAT GCTGTTGCAG GATCAGTATC ATCAAAAAAA
TGCTATTCAT CAGCTGGGCG GTACTCTTCA AGTGTATTAC CATTTAGGAG AATTACCGGG
GGCCGTTTGG AAAGTAGTGG AAAACGCGCA CGGTAAAGCA GAATCTACAT CTCGAGATTT
ATGGAACGCA TTGACCCTAA TGAATTTGAC AGAACAGGCC AAAAAGACAG CCAAAGATAG
CCGGTCCGTG GAAAAGAAGC TCAAGCAAAT GCGGGCAGAA GCTGCATCTC AATGGGCGAA
TGGTATCTAC GACGTGTCAA CACAGGTGCG AAATTTACAG CGAGTGCTTA TGCGCAAAAG
CGATCCAATA CAGCGCCAAT TTTTTGTGGA CGTCGTGGCC GCCGCATCAA TTCCAGCCGC
CTTCAGAGAT TCGTCTCTCG GAAAAGACTT TTCTTTGTTT GGTCTATTTT GGGGGCGCTT
TTGCAAATCC CTGGGAATTA TTTTGGAAGA TATTTTGCAA CAGGACAATG GAAAACATCG
CTCGGACGTC GCAAGCCTGT ATCCGTCTGT GCGTAGCGTT TCGAACGACA TGTTGAGCAC
TTTGCAGGAT AATTTGAATG CAGGCAATTC AGCATTGGAG GACCTTGGAA CTGCTGCAAC
CCCAGGTATT CTCGGAGGAT CTGCTCTCTT AGATGACACA TTTCTGGACT GGACAACGGG
CCAGTTCGAT GTAGAGGAGA ATCCGCAATC TGCCACCACA CCTGATTCCT GGACCCATAC
TACTCAACGC AGCGCATCGG CGAAACACCC TTCGCAACGT TTTTCGGCCT CAGGTGGGAC
AGGCTCTGCT ACGATGTCTC AAATATATCA ATCGATGGAG TGGAATACTT TACAGGGAGA
TAAGAAGGGA CGCCATGGGC TTTATCCATT ACAACAAGCA TTTATTGAAG CCTGTACGGA
CAGGCTATGT TCTCCACTTC AATTCATGTT CCCAGAAAAC GTTGCTCTTG ACGACGACGG
TGTCGCCATT GCTTCCGGGC TCAGTATGTT GCCCAGCAAG TACGATATTC AACGCTTTGA
CGAAAACATC CGTCAGGAAA TTTCGTTGGC TGACCCGAAA GAAGGCGGCG GTGATCTAAG
CAGTGTTACT ATGATAGCCA ACTGTGTCGT GTCTATGATT TCAGAGCTCT GCCTCCGAGC
AAAGAATGCG TTGAGTGGTA TTGGAGAATC AGGATATCTG AATAGTGATT GGTCAATGAC
GGAATCGCTG AAGCATGATC GAAAGGTGAC AGTGATTCTT TTCACTGTGG CAAATTACTT
GCGTATCGCG CCTGATACAG TGTTTTTGGC ACCATACCGT CCGTCCATTT CATTGCAACA
AGAAGAAGCA GCGAGCGTCT GCCAAGTCGC ACTGCAACCG GCTCTCAAGG AGATTGAGAA
AATGGTTAAA AATTCTGTGA CCTCACCTTT AGGACGAGCA ATCAATAAGC GAATTGGTGA
CACCATGGCA AAAATGCATC AAGGTGTCTA TCTTGGTAGC AATGTGGGTA TCGACGAAGA
CTCCCCTGCC TTTGTGCAGA AACACTTGAA CGGCATTTAC GAAATCATTT CGAAAGAAAT
TCTTTCGAAG TTGCCTCCAG AATATGGGTC GGCTGTGGCG ACATCTGTGG CAATGTTTTC
GATCTATAAT TTTGTGTCAA ATTTTACTCT GCTTCGACCT TTGGGTGAAT CGGCTCGTCT
GCATATTACG CAGGACTTGG CCGACCTTGA GCTTGCACTG GAACAGCTCA TGTTGAAGAG
CGGAAATTCT GTTTCTTTGC ATTTTATTGG AAACGGCAAG CCGTACTTGG AACTCCGTGC
CGTTCGCCAA ATGCTGTTTT GGACGGGGTT GGACAGCGCT GATAAACAAG CCGTGGATGT
CGCCAAAAGC TTGTTGCGCG AACCGTGGAT GAAGGATGTA CGTCCGTCAA CTATCTTTCA
CTATTTGTAC TCGTACGCGC CTTCGTTTTT GTCATCCCCA TACCATACGA GACGTATGAA
GCCAGAAGCT TACGTTCGGT TGTTGGTGAA GCCAGATGGC TCCGTAGAGG AGACAGAAGA
CGATGCTTGG ATGACAGTTA TGGCGAGCTG CGATGCTTAC CAACAAAGAG CAAGCTCTGG
AGGCTCAAAT ATGGATGGAG ACATTCGAGT GGCTGAAAAG CTTTTGACTA TGGGACCGGA
TGTTATGCGT CGGCGAGGAC ATTAGATGTA AATGTCTCAT CCCTCCTGGT GCTTATTGGT
TTATTTAAAC TAGTTCGTCT TGTGTTCATT G
 
Protein sequence
MDSSRSSSNR ASNSSASLSA NSLPSQKADQ TAAAAASLNL LLREDRLLVI HGLTESGTQP 
DKVAFPYAAA LLADGDDSNT VNTKQRADGA LQDVERKLAL VESLAVKLSR TSPEAVAGHL
LRLHGYHLPK EGLKEDKPSS TTLSAVRDKA DRLERQSEVL ENVARRVEGS LSRGLKRMET
ACTRLERVLS LSNTLKMILR LQFENSKLQN YDLEDLRDLT RAAASVSVVE DLLKRSELQA
SIEAIQKIRP EVERTATDVR QSAAMLLQDQ YHQKNAIHQL GGTLQVYYHL GELPGAVWKV
VENAHGKAES TSRDLWNALT LMNLTEQAKK TAKDSRSVEK KLKQMRAEAA SQWANGIYDV
STQVRNLQRV LMRKSDPIQR QFFVDVVAAA SIPAAFRDSS LGKDFSLFGL FWGRFCKSLG
IILEDILQQD NGKHRSDVAS LYPSVRSVSN DMLSTLQDNL NAGNSALEDL GTAATPGILG
GSALLDDTFL DWTTGQFDVE ENPQSATTPD SWTHTTQRSA SAKHPSQRFS ASGGTGSATM
SQIYQSMEWN TLQGDKKGRH GLYPLQQAFI EACTDRLCSP LQFMFPENVA LDDDGVAIAS
GLSMLPSKYD IQRFDENIRQ EISLADPKEG GGDLSSVTMI ANCVVSMISE LCLRAKNALS
GIGESGYLNS DWSMTESLKH DRKVTVILFT VANYLRIAPD TVFLAPYRPS ISLQQEEAAS
VCQVALQPAL KEIEKMVKNS VTSPLGRAIN KRIGDTMAKM HQGVYLGSNV GIDEDSPAFV
QKHLNGIYEI ISKEILSKLP PEYGSAVATS VAMFSIYNFV SNFTLLRPLG ESARLHITQD
LADLELALEQ LMLKSGNSVS LHFIGNGKPY LELRAVRQML FWTGLDSADK QAVDVAKSLL
REPWMKDVRP STIFHYLYSY APSFLSSPYH TRRMKPEAYV RLLVKPDGSV EETEDDAWMT
VMASCDAYQQ RASSGGSNMD GDIRVAEKLL TMGPDVMRRR GH