Gene PHATRDRAFT_35055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35055 
Symbol 
ID7199995 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp968830 
End bp971321 
Gene Length2492 bp 
Protein Length740 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179328 
Protein GI219117067 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGAG CGAGTCCGAG ATTGCAAGAA CCGTCGCCAT CTGCGGCGTC CGTGTCGGCT 
CTGGCGGCTC CGGCTCCATG GTTTGGGCTG CATCCTGCCG GACCCACCAC GCGTACCAGC
CGCGCGGCGG CCTTGTCCTG CCTCGGACGC CTCTGTAGTG TCGTCGTGCT TGCCATATTA
GCTAGTAGTC ACGTACATCG TGTTTGGCAG CAGTATCCGG AGTGGCTTCC CACGTCGTCG
CGCAATGCCA CTGCGTCCAT CACTCTGGAC TCCGCATCAA TTCACGCACA CACCGGACTC
GTCCAGGAGT CGCACCACCC GAACGACGGA TTCCAGCGCG AATCGGTCTC GTCGTCGGTT
TTGGGCGATG CGTACGACGA ACAAGGTCGG GCGGGGTACG TGGCGGATCC CACCGCCTTG
CGGCGCGAGC GGCAACGGTT TCGACACGCG AGGACCCGGG AGGCGTCGGA ATCCGGGGCA
ACGGAACCAT CAATCGAAGA TTCGTCGGAT TATTGGCACA TTCTGGAAAA TTTCGTACCG
TTCCGAGCAG ACCAGGATCC GTTGCTCAAC TCGCGGAATC ATCCGTTGCG TGCCAACGTA
TCTTCCGATT ATGTGTGTGC GTTCCCCCCG GGTCGGGGAC TCGAAGAGGA AGGTGGGTAC
AAGCTCTTGA CGGAAAAGAT ACGACTCCAG ACCGTGCACC GTCGAAACAC TACCACCACG
CCGCGACTGT TGTGTGCCGT GTACACGTAC CCACGAATGC GAGATTTGGC TCGAGCGTCG
GCATTGTCGT GGGGATACCA ATGCGATGGT TTCTTGGCCT TTACGACGGA AACGATTCCG
TCCTTGGGCT TTGTCCATCT ATCCCACGCT GGCAAAGAGT CCTACCGGAA CATGTGGCAG
AAGACGCGGT CCATCTGGTC CTACATTGCT CGACACTACG CGGACGATTA CGACTACTTT
CATTTGGGTG GGGACGACAT GTACGTGCTG GTGCCGAATT TGCGAGCCTT TTGGCAAGAC
GAGATTATCC CAGACAGCAT GGGTACTGGT TCCGGTGCCG GCCAGGACCA AGCCATCTTT
ACGGGTCAAC AAGTAGTGTT GCGTGAGGGG CAGCGTCCCT ACGTCTCGGG TGGTCCGGGA
TACACAATGA ATCGTGCCGC CTTGCATCGT TTGGTGAACG AGGCCTTGCC GGAGTGCGAG
GTGGATACGA TTGCCTCGCA CGAAGACCGT CTAGTCTCAC AGTGTTTCGA CCGCATCGGG
ATAAAACCGT GGGACAGTCG TGATGTACCC ACCGGATCCC AGCGCTACCA CGACTGCTCA
CCACACCACT TGTACACGTT CCGTGCGGTG ACTTCGTCCG GTCGCGGCCG CGGTTCGTTC
CACGCGCGTG CGGCCGCCTA CTGGGCGAGC CAGCCGCGGT TGGGTTCGAT GGTCGGAACC
AACGAGACGA CGGGTCCACG GTACGGCTTG GCAGCGGCCG CCACCCACAG TATTTCCTTT
CACGACGTGC ACAATCCGCT GTACGTGGCA CGCATGTACG CGTTGCTGCA TCCCGGGAGT
TGTCCGTCCG CGACGGCGCT CGGCCGTGGA CTCGCGCAGA GTCACGGGCA TCGTGCGGCG
GGTTTCTAGA GGACGCTTCC AGAGGAAAGT GTCTGTGTGT TTAATGTGTA AGTGTGTGTA
TAATTAGGTG GTGGTGTCGA TACATGTATG TATAGGGGGG TACTACAGAG ATCGAGAACG
AACGGTTGGG GTTGTACGTT TGGAATTAAC AACGGTAGTA TTAGTCTAGA GGAGTACCTA
TTTTGTAGTA TGGTTGGGCC GGTTCTCTCT CTCGGTAGCT CGGGCGCGAG GGTGGGGGTT
CCATGCGATC GTCTCCTTCC CTGGAGCCCG TAGGGAGGGT TGGCGCGTCG GCCGAAAATC
GAGTCTCGCA CGGCGCGCGG CCCAATGACG GCCGGCGTAC ACGACAGACC GGTTTCGGCC
TACCGCGAGA CATACGCGCC ATCTCGGCAC ACCGCGCGCG TACCAACAAA CCCTCACACC
CCAACTGACA AAACGAACAG GGCAGCACTC TCATATACCA CGACAACACT ATCGAGACCT
GACAGGGAAC ACGGTCGGTA CGAATCCCAA GATATATACA CACACTAGAC AAACACTTGC
TTGCAGAAAC ACTTTACTAC AAGAGAAAAG GGATGAATGC GTTGGAGGAG AGGCTTCCAA
CGTCGGGGAA CGCATCGGAC GACATGGACG ACGCGTCGCT GGACCGCGGC ATTCCGGGAA
CGGAGGACGA GGACGCGTCG GTGGCACGAG CTCGAGCACA CGTTGCCGTC CTCGATGAAC
CTACGGGTGA TCGAGTCCCA CGCGATACGA CGGGAGATCA AGCCGTCTTG GGAGCAGCTG
CGGCGCAGGA CGGGGTAACG GCCTTTGCCG TGACGGATGA AAGTGGACAA CTCGTCCTCG
CACGTTTTCA ACAGTTTCTC GGACAATTGT GA
 
Protein sequence
MRRASPRLQE PSPSAASVSA LAAPAPWFGL HPAGPTTRTS RAAALSCLGR LCSVVVLAIL 
ASSHVHRVWQ QYPEWLPTSS RNATASITLD SASIHAHTGL VQESHHPNDG FQRESVSSSV
LGDAYDEQGR AGYVADPTAL RRERQRFRHA RTREASESGA TEPSIEDSSD YWHILENFVP
FRADQDPLLN SRNHPLRANV SSDYVCAFPP GRGLEEEGGY KLLTEKIRLQ TVHRRNTTTT
PRLLCAVYTY PRMRDLARAS ALSWGYQCDG FLAFTTETIP SLGFVHLSHA GKESYRNMWQ
KTRSIWSYIA RHYADDYDYF HLGGDDMYVL VPNLRAFWQD EIIPDSMGTG SGAGQDQAIF
TGQQVVLREG QRPYVSGGPG YTMNRAALHR LVNEALPECE VDTIASHEDR LVSQCFDRIG
IKPWDSRDVP TGSQRYHDCS PHHLYTFRAV TSSGRGRGSF HARAAAYWAS QPRLGSMVGT
NETTGPRYGL AAAATHSISF HDVHNPLYVA RMYALLHPGS CPSATALGRG LAQSHGHRAA
GGYYRDRERT VGVLGREGGG SMRSSPSLEP VGRVGASAEN RVSHGARPND GRRTRQTGFG
LPRDIRAISA HRARQHSHIP RQHYRDLTGN TVETLYYKRK GMNALEERLP TSGNASDDMD
DASLDRGIPG TEDEDASVAR ARAHVAVLDE PTGDRVPRDT TGDQAVLGAA AAQDGVTAFA
VTDESGQLVL ARFQQFLGQL