Gene PHATRDRAFT_26970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_26970 
Symbol 
ID7199997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp980366 
End bp982778 
Gene Length2413 bp 
Protein Length654 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179552 
Protein GI219117515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GACCTTCTTT TACCATTCGT TAGTCACAAT AGGAATAGCA AAGGGTAGCA CCGGTTCGGT 
TTCCAGTCCA CTCGAATTTG CGAAAAAGCG AATACGCCTT ATTCACCACA TAAAGTCTCG
CACAAGAGTA TAAAAGTCTC ACTCACAAGT CACTGCATTG GCTACCAAAC CAACAGGCAG
GTATCCAGTA GAGCAGACGT TCGGACCCAA ATCTCTTTCA TCGCTTGCTC ATACAGCATC
TAATCAGCAA CCAACTGGAT CGATAAACAT CACCGAAGAA CAAAAACGCG AGGGATAGCA
AGTTCTGAAC TCTCAAGTGT TTACCTTTTC GCATTCCAAC GAACTCAACA CCACTTTCTA
GACTCTTTAC TTCTCCACGA CAACTTCCGC ATCAAGATGA AGTGGGAATC TACCGCATTC
GTGCTTCTAC AGCTCATTGT CACTACTCAC GCCTACGCCT TCCCCAAGCC AACACAGAGT
TCTCGTTGGG CTACCCCTGC CACGACTGGA AAGTCCTCAG CCGCCAAACA CTTTGTGTCG
CGCTCGAGGA GCACTACCAG TCTCGCCGCG TCCACCCAGA AAAAGCCAGA AGAACTCCGT
CGGGAGATTG CGGAGCGGAA TTCTCTCGTA GAGGATGAAG CACAGTATGC CGTAGCGGAC
GGAGAATTGT TGGAGCGCAT GGGCACTTCG GATGCGGTAG CAGCCGAGGA GACCGACGTC
AAGACGGACT ACACCGACAT GTACTCGCGC ATGAAGCGCA TGACCAAACC GAGGGCGTAC
CCGCTCTTTT TGGCAGAAAA AGGCGTTGAA TTCTTGGAAG GAACCGTGCA CGATATTGCC
AAATCCTTCC AACGCACCGC CGAAACGGGA GCGGCCACTT CTACCAGTGA CGTCAACGGA
AGCGTGGGAC AAAAGGAGCG AGTCGTGGTT CTCGGGACAG GCTGGGGCTC GGCTTCTCTA
TTGAAAGAAA TCGACACCGA CCTGTACGAT GTTACCGTCA TTTCTCCCCG AAATTACTTT
CTTTTCACCC CGATGCTCGC TGGTGCCAGT GTCGGTACAG TGGAATACCG TTCCATTACT
GAACCCATCC GGGCGATCAA TCCGCAAGCC AATTTCTTGG AGGCCACCGC CACGAACATT
GATACGAAAA CAAACACAGT CACCTGCGAG TCCGTCATTT GCGAAGGCAA TAGTTGTGAT
ATCCAAGATT TCAGCGTTCA ATACGATCGT CTCGTAGTGG CGGTGGGAGC TCAAACCAAC
ACGTTTGGCA TTCCTGGAGT CAAGGAATAC TGCAACTATT TGCGACAGGT TGAAGACGCA
CGTCGCGTAC GAACCTCCAT CATCAACTGC TTTGAACGAG CTAACTTACC GGGTCTTTCT
GACGAAGAGA GAATTCGCAA CCTTACTTTT GCGGTGATTG GTGCTGGTCC TACCGGGATC
GAGTTTGCCG CCGAGCTGCG TGATTTTGTT GAGGAAGACG GCCCCAAGTA CTATCCGAAG
CTTCTCCAGT ACGTGCGCAT CAAGGTCATT GAAGCGTCGC CGATGGTTTT GGCGCCTTTC
GACAAAGAGC TCCAGCAAGA AGCCATTGCC CAGCTGAAGC GTCCTACCAT GATTTCGGAC
CCCAAAGTAG CGAAGTTACT GCCGCCCAAT TTTCAAATGA CAGAACTCTT GTTGGAAGCT
TCCGTCAAGG AAGTCAAGGA GGATCGTATT TTACTGAACA ATGGCCAAGA AATTCCGTAC
GGTATCGCTG TTTGGGCAGC TGGCAATGGT CCGATTCCTC TGACACTGCA GTTGATTGAA
AGTCTCGGCG ATGAACAAGC GTCGGCACAA GCCGTTGCAC GGGGACGTGT CGCTGTGGAT
TGCTGGATGC GGGCCATTGG CGGTCAAGGC AAAGTACTGT CCTTTGGTGA TTGCTCATGC
ATGTTCCAGC AGCAGCTTCC AGCGACGGCG CAAGTAGCCT CACAGCAGGG GGAATATTTG
GCCAAGCTTT TGAACAAAAA GTTTGAGTTC ACGCCGGCTC TGACTGAAGA TGGCATCTTC
CCGCCACCGC GGAAAGACCC CGCCCGGACA CAAACCAGCT TTTCCGACGC GATTGCTGCA
TTTGCGTCGA ATAACTACGA ATACGCCAAA CCGTTCCAAT TCTTGAATTT GGGCATTTTA
GCTTATACTG GTGGGGGTTC TGCTTTGGCG CAGGTGACAC CCGTGCCGGA TGGTGCTTCG
GTCCAGGGCA AGGGCAAACT CGGCAACGCG TTGTGGCGCA GTGTCTACTT GACCAAGCAA
GTGAGTTGGC GCAACCGACT GCTCGTGATG AATGACTGGA CCAAGCGTCG ATTGTTTGGA
CGAGACATTA CGCGACTTTA GAAATAACAA CAGACTGATA TAATTTGCAA AACAATTACA
CTTTACTCTT TTC
 
Protein sequence
MKWESTAFVL LQLIVTTHAY AFPKPTQSSR WATPATTGKS SAAKHFVSRS RSTTSLAAST 
QKKPEELRRE IAERNSLVED EAQYAVADGE LLERMGTSDA VAAEETDVKT DYTDMYSRMK
RMTKPRAYPL FLAEKGVEFL EGTVHDIAKS FQRTAETGAA TSTSDVNGSV GQKERVVVLG
TGWGSASLLK EIDTDLYDVT VISPRNYFLF TPMLAGASVG TVEYRSITEP IRAINPQANF
LEATATNIDT KTNTVTCESV ICEGNSCDIQ DFSVQYDRLV VAVGAQTNTF GIPGVKEYCN
YLRQVEDARR VRTSIINCFE RANLPGLSDE ERIRNLTFAV IGAGPTGIEF AAELRDFVEE
DGPKYYPKLL QYVRIKVIEA SPMVLAPFDK ELQQEAIAQL KRPTMISDPK VAKLLPPNFQ
MTELLLEASV KEVKEDRILL NNGQEIPYGI AVWAAGNGPI PLTLQLIESL GDEQASAQAV
ARGRVAVDCW MRAIGGQGKV LSFGDCSCMF QQQLPATAQV ASQQGEYLAK LLNKKFEFTP
ALTEDGIFPP PRKDPARTQT SFSDAIAAFA SNNYEYAKPF QFLNLGILAY TGGGSALAQV
TPVPDGASVQ GKGKLGNALW RSVYLTKQVS WRNRLLVMND WTKRRLFGRD ITRL