Gene PHATRDRAFT_37916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37916 
Symbol 
ID7202843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp393196 
End bp396321 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182059 
Protein GI219123495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGA GAAGGATATC CCATGCCATC CCGCTGGCTC TAAGCGTTTG TCTCCGGAGT 
CGGCTCATAG ACTCCTTTCT CGCCCTGCCA TCAAAATTGC ATTCGAATCG GTTGGTCTTT
GATTCTCCAT CGTCCACGAA AACAACATGC AAAAGAGATT TTAGAATTTT CGGAGACCTT
ACCGGAAAGC TCGAAGACGA CCAGGATGCA CAAGAACTCA CAACGACGAG ACACTCAAGC
ACACATCAGT CTCGGCGGAA AGCCATGCAA GCCTTAGGCT TGGCTTCGTT GGCTATCCCA
ATGGCCGCCT CAGCGGGAAT CGCCGAACTC GACAAGTCTA CGGGAGCACT GTTCAGTCCC
AAATCCGAAA TGCTTTCTGG TGGGAGTGCG GCTGCGCGTG GTATTCCTGT TTCTGGTAGT
CGCCGCCAAC AACTCCAACC TGGACAAGCG CTCCAAACAG TATACGAAAC TCGCTTTATT
GTGTACCTTG CGCGATTTTT ATTGAATTTT GATCCATCAG CACACGCCTG GTGGCTCCAA
CAAGGTTTTG CCGATTCTTG GGAACCTCGC TCTGGCTCGG ACGAGGCATT TGCCGACAAC
ACTTTAGCCG AGTTTGCAGA AAGTGTCGAA GTTGGTTTGG CTGATTATTT TGTTGGTCCC
TACGGTAGCT ATTCGTCACT GTCCGCTGCC AAGGCTGGAA TTTCAGCGGC GCGGCCAGCA
CCCTCCGCGC AACCACAACA AGAAGAAAAC TACTTAAAAG AACTCCTTTT TGGTCGACCG
AAACTTTCCG ACGAAAAGAC ACCCAAAGAA AAGGTAGACA GCGCAAAAAA AGGGATACTC
AATTTGTACA CTCTGTTGAA AGCTCGCTAT ACATCTGTTG CCGCCAAACG GCATCTTGCT
ATTCTGTTTT CTTTTATATC CTCCCCAAGG CTTCAACCAA GTAATGAAAT CCTAGCTTTG
CTTGGTGAAT CGGACAATGC AACAATTTCT GAGATTCGGA TCGTCAAACC AACTCATTGG
CCAGTCAACG AGGCGGACTC GCGGACGAGT AGCCGACGGG GCGGTGGCTA CTCAATCGAG
GAGCCACCCA TTGTTACTAT CGATGAACCA CCGGCGTTAG GCGACAGCTA CGTACCGGCT
GAATTAAGAC CCGTTCTCAA GCCGACATCA CGTGTTTTGC GCATTTCTGT GATTGACGGT
GGTGAAGGTT ATACATCGGC GCCCCAGGTG ACCGTAGTGC AAAGCGGCTA TCTACGCTTG
TGTCAAGCTA CTGCTATTAT AGATCGGAGT GGCAGAGTCG AATCTGTCAT TCTTTTGGAT
CCGGGATGGG GTTACGGTGG TCGAAAGCAG GCTCCTCCCA AAGTCAAAAT TGAACCTCCC
AGACTTAAAA GTAAAGGAGA ATCGGGTCAG CGAGCGAAGG CTGTTGCTGA ACTTGAGTAC
GAAATAGTCG GTGCGAAGGT TGTCCGTGGT GGAAACGGAT ATGTCAAGAC CGAAGTACCA
CAAATTACAA TTACCCCGCC GGATGGGAAT CCCGACTGGT TTCTGGCAGT GCAGGAACAG
CCAGAGATGC GAATGAAGAA ACCAGCAGAA ATCGAACCTT TACGACTAGA AGTGGCCGAA
ATGAAATTCT CGGATGGCAG CGTTGCCTAT TCTATCGATC GTATGCCCGA GAGCAAAGGT
GTGGACAATG CATTACTCGA TCGTCTTCAA AGAGATCCAC TCGAAATGCT ACCCCCTTCA
ATTCGACCCG AAAGATACAA ATATGGAATC TACGCCATAC CTTCCCTGGC ACGCATTCCA
CAGTCCGTTC CGAACTTGTC ACCGAGATAC CGTGCATGTG ACCCGGTATT TGGAGGCGTC
GGTCGTGTTC CAGTTACAAA AGGGGCTGTG GCCTTGAAAG CTAGCGAATA CGCTCGTCTT
GCTTTAAGCG GCGCCGTCTG CACAGTGTTG GTACGCACAG CTTTGAATCC GCTGGAGTTG
ATCAAGACCA AGCAGCAATT GCAAAACGAT AACGAATTAC TCTCTTTTGC GAGAGCTCGG
GCTCTCCGAA AAGGGATTTC TCCTGATCAC CAAGACGCGC ACGAACATAT GCTGTCTAGC
AACAAAGCGG AAATCAATGC CACTGCTGCT GTTGCTCCTC AAGAAACGGA CGGACAAATA
AAGCTAGGAA CACTCGATTT GATATCAAGT CTCATCGAGT TACGAGGCCC TTTAGCCTTG
TTTCAAAGTG CTGACATTAC CTTTCTTGCT TCCTTGGTAT TTGGCTCACT CGGTTTTGGC
GCCACTGAAT TATTTCGTCG TTCGTTCACT GCATTTTTCA TTGCCGGTTC TGGAACGGAT
GAAATTGGTT TGGATGTTGT TGCGCTTTTA GCGGCTGCTT CCCTGGCGAC CGTTGTGACA
GCAGCCGCTG CCGCGCCCTT TGAGGTCTTG CGTGTAAGGA GTATGGGTCT AATAGAATCG
GTGGGTTGGA CAAAGGTTTT GGAGGATTTC ATCGCCGAAA AGTCAAGACC AAGACAAAAA
ACATCAAACT CTTTCGGTCT CAATCGTAAA CAACATGGAG GTCATCAAGA ATTTGAATTG
CGCAACTTAA AAGCGAGGGA TATCCTGCCT TTATGGGCTG GTTTTGCACC CACCGTCAGT
CGTGAACTCC CGTTCGCGGT CGTCAAGTTT TTGACATTTG ACTTTATTAC TGGAACGGTA
ATTACCTTCT TGAACACACA GTCCAGTGAT GGCGCGTTGC CTATTCAGGT TGGAACGGGC
CCTATTGGAT TGATAGTATC GGCGCTGGCT GGCGCTGTGG CAGGTATCGC AGGTGCTGTT
GTCTCGCATC CAGCCGATTT GATTTTGACA AAGACATCAG CTAGTGGCAA TCGCAATGGA
ACAGAAACTT CCGCATCGGT AGAGGAGCCA GACTGGAGGG ATGTTGTCAG GGAGTTGATA
GCACAGCCCG GCGGAATTGC GAATCTTTAC GTCGGTTTTC CTGCACGTGC TACATTTTTC
TTCCTTGTCA TTGGGCTGCA GTTCTTTTTG TACGATTATT TCAAAACGTT GCTAAATGTT
GGCTCCGACG ACTTGAGCTT GGTATTGGAT GTGTTTTACG CGGTGCGTGC CGGTCTCGTT
GGGTAG
 
Protein sequence
MKRRRISHAI PLALSVCLRS RLIDSFLALP SKLHSNRLVF DSPSSTKTTC KRDFRIFGDL 
TGKLEDDQDA QELTTTRHSS THQSRRKAMQ ALGLASLAIP MAASAGIAEL DKSTGALFSP
KSEMLSGGSA AARGIPVSGS RRQQLQPGQA LQTVYETRFI VYLARFLLNF DPSAHAWWLQ
QGFADSWEPR SGSDEAFADN TLAEFAESVE VGLADYFVGP YGSYSSLSAA KAGISAARPA
PSAQPQQEEN YLKELLFGRP KLSDEKTPKE KVDSAKKGIL NLYTLLKARY TSVAAKRHLA
ILFSFISSPR LQPSNEILAL LGESDNATIS EIRIVKPTHW PVNEADSRTS SRRGGGYSIE
EPPIVTIDEP PALGDSYVPA ELRPVLKPTS RVLRISVIDG GEGYTSAPQV TVVQSGYLRL
CQATAIIDRS GRVESVILLD PGWGYGGRKQ APPKVKIEPP RLKSKGESGQ RAKAVAELEY
EIVGAKVVRG GNGYVKTEVP QITITPPDGN PDWFLAVQEQ PEMRMKKPAE IEPLRLEVAE
MKFSDGSVAY SIDRMPESKG VDNALLDRLQ RDPLEMLPPS IRPERYKYGI YAIPSLARIP
QSVPNLSPRY RACDPVFGGV GRVPVTKGAV ALKASEYARL ALSGAVCTVL VRTALNPLEL
IKTKQQLQND NELLSFARAR ALRKGISPDH QDAHEHMLSS NKAEINATAA VAPQETDGQI
KLGTLDLISS LIELRGPLAL FQSADITFLA SLVFGSLGFG ATELFRRSFT AFFIAGSGTD
EIGLDVVALL AAASLATVVT AAAAAPFEVL RVRSMGLIES VGWTKVLEDF IAEKSRPRQK
TSNSFGLNRK QHGGHQEFEL RNLKARDILP LWAGFAPTVS RELPFAVVKF LTFDFITGTV
ITFLNTQSSD GALPIQVGTG PIGLIVSALA GAVAGIAGAV VSHPADLILT KTSASGNRNG
TETSASVEEP DWRDVVRELI AQPGGIANLY VGFPARATFF FLVIGLQFFL YDYFKTLLNV
GSDDLSLVLD VFYAVRAGLV G