Gene PHATRDRAFT_47294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47294 
Symbol 
ID7202376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp214783 
End bp217267 
Gene Length2485 bp 
Protein Length777 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181510 
Protein GI219122351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.599773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGCACTCTA CTTCCCGTGT TGACAATCTA GAAGCACTCG ATAAAATTTT GGTTTTGTCG 
TCCTTCAGCA GCTTTGAATT TGCTTTCTCA GCCAAGAACT AATCTAGCCT CCCTAGCGGA
CGGATTGATT AATGTAAGGT GTCGAGTTAC CATGGGAGAC AATCTATCAG AATTTCCCGG
CGTCAAAGAC TTGCTGCAGC TATGGAGCAA CGATAATGGA GACATGATGA TGATGGACGG
TATGCGGTCT CCAATAGTAC GGCGACAAGA CCGTGTCGAT ACTGCGTATG ACGAAGAAGA
GGACGACTAT ACACCCATTC CTATTGAACT GTGCTCGGTG GACGCGGAAA GCGTTGACGA
AAGCTGCGAC AATCGCTCCC GCGCTTCCCA GAAACGCCGT CGCGTGTCTG GTGCAACCGG
TACTATCTCT TCACAAACTG GCTCGAACAT CTACCGGGAT GCGCTTACAG TTACGGCATC
GGCAATGTTG TATCAGAATC GCAACAGCGA ATTTATAGAC GAATGCGATG TGATTCTAGG
TGATCCACAT ACACCCACGA TAGGGACAAA CGTGTATCGC CAGATTCGTC AAGTTTACGC
CAAAAGAAAG ATCAAACAGT CGGACCTTGC TATCATTCGA GAGCAATTAA CTCAAAAACT
CAAGGAAATG ATGAACTCGA CGCCTACGCC TACACCGACC TTTTGGTACA GTATTAAGAA
CAAATCCAAG CTCAAAGAGG GCTCGCCTCA CCGCTATCTC AAGTCGCACA TTAAAAAAGG
CGCCTTCGGA ATCGCCACAG ATCAAGCCGT TATCGAAGAT GTCCGGAATG CTCCCAAACG
TTCCCGCGTG CGGAAAAAGG ACAGCACACC ACTTTTTGGG CTGGAGGGTT CCTTATCGCA
CGTAGTGTCG CAAGCTAAGA CGAAATGCGA AGAATGGGTC GGTGCGAACG CGGACAAGGA
GCTATATGCC AAGTTTTTCA AGGAAGATCT GCAACGACTC ATACCGTGGT GTGATTCGTA
TGTGCAGAAA AAGACCGACG CTGAGACAAA TGCTCTGCAT CAGCCTGATT TAAAGCAAAC
ATTCTTACGT GAGGCTCGTG TACACGATAT TCATGCAAAG CAGCTGGACG GAGCTTTGGT
TTCCTTGGTC AACCAACTTG GTAATGATAT GGTAGAGCGC TATCAGTTGG GACAGCAACA
AAAAGTTTAC GACGATCGTT TACGAGAGGT GGGATCTGTT GCGGCATCTG TCGCGAGTAG
TCGGTCTGGC CTACCTCCTC GAAGTCGATT GTTACCTCAC AATTGCAATG AGTCAAGCTC
GACCGGCCCG ACCATCATAA GTAGTGATAT GGTGAGTGCG CCCGGAGCTA ATAGCATGAG
GACGCTGTCG ACCGGGGCCC TCTCCAATGA AAGCGACACT CCACCGCACA CATCGTACCA
CATGCCGTAC GTGAGGCCCG AGGACCCATC GGGTGTTTTC ACGCAATCTC TTTCTGTACG
CTTTTCCAAA GCATTGAATG TTTCGCCAAA GCGCGTCAGG GATTGTGATC CAGCCAGCGT
TCCTCTTGAA CTACGATTAC GGAACCCGAG TAGCATGGAC GACCATTCAG TCTCTTCAGT
GTCCAAAGAG TCCTCCGCAA TTCGAAGCGG AAGTTGTGGC GCTTTCCGTC CTATCATACA
GACGTTGCAA CATCGACGAG AGAACGAGGC AGACGAAGCT GCAGCGAGAG AAACTGTGTT
GGCCAATAGT GACGTTCTCC ATGTTCGACC TGGTAGTCCC GCAGATTCAA GTTTAGCCAG
CGAAGATTTG CTTGCTGATG ACTCTTGGAT TGTTCCAGTT CCTTATAACA ATTCGGCTTG
CAGTCGGGCA AAGCACATCG ACACTGTGCG TACACTTGAT CTAGACTCGG ACAACAGTCT
CGAAGGCGAG CCTACTGAAG ATTTGGACTG GAATAAAGCG TTAGCGAATC ATCTGAAGAC
TCTTGAAGAT CGCGAAGCAA GATTGCGAAA GTATCATCCT GATACTGCCA AGTCTTATAA
TAATCTCGGT CATATTTATT CGAAGCTCGG AAATTGGAAT GAAGCGCTTC ATTATCATCG
GATGGCGCTC GAAGTACGGG AGTCGGTTCT TGGCAAAGAG AATCTTGACA CGGTAAGATC
GTACAGTAGC ATGGGATACG TATTTTTCAA AAAATGCGAC TGGGACGAGG CCCTTTTGTA
TTACCGAATG GCGTTGGAGG TCCAACAGAC TGTGCTGGGT AAAAGTCACG GGGATACTGC
AAAATCGCAC AAGAGCATAG CTGTGGTCTT GTCCAAAAAG GGCGCCTGGA ACGAAGCATT
GCGGCACCAT CGGATGGCAC TCGAGGCGCG AGAAGCTCGA TTGTCGAGCA GATCACAAAA
TGATACTCCA GGTAGCTTTG GTCGGGTCCG AGCTTCGTGG CAAGGACGTG TGCGTAGTAT
GCCGTCTAGT ATAGCGGAAG AATAG
 
Protein sequence
MGDNLSEFPG VKDLLQLWSN DNGDMMMMDG MRSPIVRRQD RVDTAYDEEE DDYTPIPIEL 
CSVDAESVDE SCDNRSRASQ KRRRVSGATG TISSQTGSNI YRDALTVTAS AMLYQNRNSE
FIDECDVILG DPHTPTIGTN VYRQIRQVYA KRKIKQSDLA IIREQLTQKL KEMMNSTPTP
TPTFWYSIKN KSKLKEGSPH RYLKSHIKKG AFGIATDQAV IEDVRNAPKR SRVRKKDSTP
LFGLEGSLSH VVSQAKTKCE EWVGANADKE LYAKFFKEDL QRLIPWCDSY VQKKTDAETN
ALHQPDLKQT FLREARVHDI HAKQLDGALV SLVNQLGNDM VERYQLGQQQ KVYDDRLREV
GSVAASVASS RSGLPPRSRL LPHNCNESSS TGPTIISSDM VSAPGANSMR TLSTGALSNE
SDTPPHTSYH MPYVRPEDPS GVFTQSLSVR FSKALNVSPK RVRDCDPASV PLELRLRNPS
SMDDHSVSSV SKESSAIRSG SCGAFRPIIQ TLQHRRENEA DEAAARETVL ANSDVLHVRP
GSPADSSLAS EDLLADDSWI VPVPYNNSAC SRAKHIDTVR TLDLDSDNSL EGEPTEDLDW
NKALANHLKT LEDREARLRK YHPDTAKSYN NLGHIYSKLG NWNEALHYHR MALEVRESVL
GKENLDTVRS YSSMGYVFFK KCDWDEALLY YRMALEVQQT VLGKSHGDTA KSHKSIAVVL
SKKGAWNEAL RHHRMALEAR EARLSSRSQN DTPGSFGRVR ASWQGRVRSM PSSIAEE