Gene PHATR_46837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46837 
Symbol 
ID7204690 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp590628 
End bp593844 
Gene Length3217 bp 
Protein Length997 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185738 
Protein GI219121013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAC GCAAGCAAAG ATCCAAGCGT TCTTTGGCTA GTCCTCCAAA AGGTCTTGCT 
ATTCGAGGTG GTACGAACGG AGCCACGAAT CCGTTTGAGA TCACATCTCG TCACAAGCGA
CCGAAACACG AAGTGCACAA TCGCAAGATT CCCACAAAGA ATGTCGCCAA GCCGTCCGCC
TTGGCCCAGG CGGTCTCTCG TCGCAGGAAT GCACTCAGTG AAGCTTTGCA GAAGTCAAAG
AAGAACAATA CTTTTGCCGA TAAGCGCATA GGAGAGTACG ATCGGAGCAT GACGTCGGAC
GAGCAAAATT TGGCTCGATT GGTTCGGGAA CGGGCTAGGC GTAGCAAACG CTCTACAAAG
TTTAGTTTGA ATGACGAGGA TGAGGACGGC GGCCAAACAA CCCTTACACA CAAAGGAAAA
TCCATCAGTG ATATGTCTGC TAGAGACCAC GTTATTCTGT CGGATGATGA CGAAGACGAC
ATTGGAAACC TAGACGCTGC CGATACGGCA CTGCATTTCG GCGGCGGATG CCGGTCGGCC
GTCCCAGATG ACGGATACTA CGGTCCTACC GCTGGCGCTC AGGGTAAAGA TATGTCAAGC
ATGTATATGA CGCGAAAAAT GGAATTGGAT GATCTCATAT TGCGGCGAAA AATTAAGAAA
GCCGAAAGGA TGAAATCTCG CGAAGATCAA ACCGAAGCCT TTGAGGCCAT GGACGAATCG
TTCGCCGAGC TGTCGAATAT GCTATCTTTC CGTGACAAAG AAAAGGAGAT CCGGGAGCAT
GTTAAGTCCA AACGTGCTGG CATGCTTGCA CCGGAAGACC AAGAAATGGA GGACTGGGAT
AAGCAAATGA AACAATATCA GTTCAATGAG AGAAAAGTGA AAGCAACTGA TCGCGTAAAA
ACACCAGAAG AGATTGCCAA GGAAGAGGTA GATCGTTTGC ACGAGTTAGA GACCCGTCGC
TTAGCTCGCA TGAATGGTGA CTTTGTCGAC GATGACCTTT CGGATATTTC CGATGGTGAG
CGGAACCTGC AGAAGCGAAA GACAAGAAAG AGTACCCTAG CTAGTACCGC TGAAGCGTTA
GATGATTCCG AAGTGGAGGA TGACGCGGAA GAAAAGGTAA CAACTCGTTT TACTGCCGAC
GGCCTTGTGG AAGTCGACAG AAATGGTGTT GTTGCAAAAA AGGTTAATGC ACCAGATAAT
GACGCTCTGA ATGGTGCGTT TGCAAAAGGA TCAAGAGTTA GCGCTTGCTA CCACGCGAAA
GAACAGCTCG ACGACGATGC GACCTGGTTC AATGGGGTTG TTTCGGGTGT CTACGCACGA
GACGACGGCT TGGTAGTATA CAACATCGAA TACGATGATG GAGATTTCGA AGATGGTGTT
GAGCATCGAC ATGTTCGCCC CGCGGATGAA ACGAAAAGTG TTAGGAAGGT GGAGGAAATA
GAAGCAAACA AGCAGGAACA GGAGCTTGAG CTGAAAAGAA AACGCCTACG AGCGAAAGAG
AAAGCAAGGT ATGTAAAACA ATCTGTAGTA TGGGGTGATC GCACGGATTT CGCGCTTGCA
TCTTTGTTGC ACCTGTATGG GGCTTGGTAG AGCGTTCTGC TCTCAATTGC TTCTCGAGTC
GCGCTTCCCC TATATTGAGA ACCGTCCACA ATAGTTCTTG CAAGACAAAG TCACGCCTTG
TGAGTCTCCA TGGAATTTAC ATTTCTAACT TTTAATGGTG CATTCTTCCA GGGCGGAACT
TCCTTTTGTG TTTGAAGTTC CAACAACGTT GGAGGCACTG CATGATATGA TTGGTACATA
CGCCTCTACC GGCGCGGATG CTTCTCTAAT CATAAAACGA ATTCACGCAT CCAATTCAGT
CAGGTTGGAC AGAAGAAATG CTGAAAAGAT GCAGAACTTT TACGATGTGG CGATTCGACG
GTTTGTTGCA GTTGGTGATG CGATTTATAG ATCAGGTGAT GGCGATTCAG AGTTAGGTCG
ATTCAAACAG CTCGATGAGT TGGCCAGGAT CATGTACACA ATGGCGCAAG ACTCGGCTGA
AAGTGCAACT GCCGTATGGG GGCGCCGTTT AGGTGTTTTC CAAAATGCAC ACGCCAAGAG
ACTGCGCGAC GCTGAGTTAG ATCGTGATGA AGACGATGAG GATCAGTCTG CGTGGCCGTC
GATTGGAGTT TTCTTAGCCC TCCGTGCGAT CGGTCATATC TTTCCAGTGA CCGATCAGCG
TCACCAGATT ATCACTCCCA CGCTGCTAAT GCTTGGTCAA ATAGTTTCTC AAACACCTGT
TCTTTCCATA TATGATCTTG TTGTCGGAAC AATGTGCTCA GCACTACTAA TTGAGTACAC
AAAAGCTGCC AAACGTATTT CACCGGAAGC GATTGCTTTC ATCGCCGGTG TTTTGCGAAT
GTTCGCATCG GATTCCTTGA AGCGACAAGG GCCATATTCT CTTCCAAGTT TGGAGAAAGC
GGTGACGGGA GAGCAATTTG ATTCATTTCG AGCCCTTGCT AGTAGCTATC AAGAGGTGTA
CCCACCAAAT CTGAGTTTTG AAAGGATTGG CATGTTTAGT GCGGAAACAC CCGCTGCGCT
GCTCTATGCT GCGCTACATT TGGTTGAGGT CACCGTGCTT TGCCTTTCGG GCTCAGGCAT
CGACGCTGAA CGAGAACTTT TTCACACTTT GGCGGAGTGT GTATTGAACA TTAAACCATA
CAGTAAGAAG CACACACTCT GCAAGTGTTT GCAAGAAAAG GCTGCTTCAG CTGCTGAGTG
CCTTGAGAAA GCGTGTAAGC TTGAAAATGC CCGTCCTCCA CTGAGGCGCA GATCGCTACC
GACAATTCGT GACACTATGA TTAAGTCACT TGCTCCACGT ATCGAGAATG CTGAAAAATA
CTCCATGTCG AAAGACAAAG GCAAGAATGC CCCACAAGCC GCTTTGGATC GCACTCGCCG
AGAACTTCGC CGTGAGCACA AGGCTGTTTC TCGCGAGCTG CGGCTGGATT CTTCTTTCGT
GGAGAGGGCT CGGCGCGAGG AACAGACCAA GAAGGATTCC GCTGCTCAAG CCAAACGTCA
GAAGGCTTAC GGATGGCTTG AGAATGAGCA GGCTACCATG AACCAAGAAG TGCGGATGGG
AGGCGGCTTG TTGAAGGGTG GTGGTATGGG AGCTGCAAAA GCGAAGGCGG CTACTGGGAA
ACTAGGCATG AAAAAGGGCG GCAAACTAAA GCGGTAG
 
Protein sequence
MGKRKQRSKR SLASPPKGLA IRGGTNGATN PFEITSRHKR PKHEVHNRKI PTKNVAKPSA 
LAQAVSRRRN ALSEALQKSK KNNTFADKRI GEYDRSMTSD EQNLARLVRE RARRSKRSTK
FSLNDEDEDG GQTTLTHKGK SISDMSARDH VILSDDDEDD IGNLDAADTA LHFGGGCRSA
VPDDGYYGPT AGAQGKDMSS MYMTRKMELD DLILRRKIKK AERMKSREDQ TEAFEAMDES
FAELSNMLSF RDKEKEIREH VKSKRAGMLA PEDQEMEDWD KQMKQYQFNE RKVKATDRVK
TPEEIAKEEV DRLHELETRR LARMNGDFVD DDLSDISDGE RNLQKRKTRK STLASTAEAL
DDSEVEDDAE EKVTTRFTAD GLVEVDRNGV VAKKVNAPDN DALNGAFAKG SRVSACYHAK
EQLDDDATWF NGVVSGVYAR DDGLVVYNIE YDDGDFEDGV EHRHVRPADE TKSVRKVEEI
EANKQEQELE LKRKRLRAKE KARAELPFVF EVPTTLEALH DMIGTYASTG ADASLIIKRI
HASNSVRLDR RNAEKMQNFY DVAIRRFVAV GDAIYRSGDG DSELGRFKQL DELARIMYTM
AQDSAESATA VWGRRLGVFQ NAHAKRLRDA ELDRDEDDED QSAWPSIGVF LALRAIGHIF
PVTDQRHQII TPTLLMLGQI VSQTPVLSIY DLVVGTMCSA LLIEYTKAAK RISPEAIAFI
AGVLRMFASD SLKRQGPYSL PSLEKAVTGE QFDSFRALAS SYQEVYPPNL SFERIGMFSA
ETPAALLYAA LHLVEVTVLC LSGSGIDAER ELFHTLAECV LNIKPYSKKH TLCKCLQEKA
ASAAECLEKA CKLENARPPL RRRSLPTIRD TMIKSLAPRI ENAEKYSMSK DKGKNAPQAA
LDRTRRELRR EHKAVSRELR LDSSFVERAR REEQTKKDSA AQAKRQKAYG WLENEQATMN
QEVRMGGGLL KGGGMGAAKA KAATGKLGMK KGGKLKR