Gene PHATRDRAFT_16786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16786 
Symbol 
ID7198971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp254544 
End bp256691 
Gene Length2148 bp 
Protein Length716 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185158 
Protein GI219129989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCCCC CGGAGGGCGC CGAAAAGTAT GCCTTTGAAG CGGAAGTGCA CCGTATGATG 
GATATTGTCG TCAATTCTCT CTACCAAAAC AAAGACGTCT TTCTGCGTGA ACTCATCTCC
AACGCCAGCG ACGCACTCGA CAAGTTTCGT TACCTCGCTC TCACGGAACC GGACACGTAC
AAGGGTGAAG AAGAGATTCC TCTCCAAGTC AAGATCCAGT ACGATGCCGA CGAACACACT
CTGACTATTC GGGATACCGG CGTCGGTATG ACTCACGACG AAATGGTGGA GAATCTCGGA
ACCGTCGCTC GATCGGGAAC GACCAAATTC ATTCAGTCGC TCAAGGAAAG CGGCAACGAC
GATTCCGCCA TGTCCCAGAT TGGTCAGTTC GGTGTCGGTT TCTATTCCAC CTTTTTGGTA
GCCGATCGCG TCACAGTGGC TTCCAAAAAT CCACGCGATG ACGCCCAACA CGTCTGGGAG
AGTGAGAATG CATCGTCCAG TTTTGTGGTG TACCCGGATC CCCGTGGCAA CACGTTGGGC
CGGGGGACCG AAATTACGCT CCATCTCAAG GAAGACTCGT TGGAATACGC CGACCCAAAT
CGTCTCCGTG AACTTGCCCA ACACTATTCC GAATTTGTCA TGCATCCCAT TTCGTTGCGA
ACGACCAGTA CCATGGAGGT CGAGATCGAA GACGAAGAGG AGGAAGCCAC GCCCGAAACG
ACCGAGGAGG CCAAGGAGAC CAAAGGAGAT GACGGCGACG ACGAAATTGA AGTCGGTGAC
GACGAGTCCG AAACGGAAGA GAAGCCGAAA AAGACCAAAG AAGTTACAAC GCACTCTTGG
GACGTTCTCA ACACTAACCA AGCTATTTGG ACACGTGAAA AGGAAGATAT TTCGGACGAC
GAATACCAAG CCTTTTGGCA AGTTCTGGCA CACGAGGCCA CCAGCAACGC TACCTCCTGG
TCCCACTTCA ACGCCGAAGG AAACATCAAT TTTAAGTCAA TTCTCTACTT ACCCGACGAC
CTTCCCCCCA CGTATTCGTA CACCAACATG GAACCCGTGC AAGGGGCTCT GAGGCTGTAC
GTGCGTAAGG TATTGATCGG GGACGAGTTT AACTTGCTTC CCAAGTACCT GGGTTTTATT
CGGGGCGTGG TCGATTCGGA TGACTTGCCT TTGAACGTCA ATCGCGAAAC GCTGCAAGAA
AGCAAGATTC TATCTGTGAT CAAGAAAAAG TTGGTGCGCA AGGCCATCGA GATGATACGC
CAATTGGCCA AGGACAGTGA AGACGAAGGT CAGTCCGAAG CGGAAATTGA CGAAGAAGGA
AACGTCATTG AAACAGAAGA AAAGGACTCT AGATACATTG CATTTTACCG CAAGTTCTCT
CCAAACATCA AGCTGGGTGT AGTTGAGGAC GAACCGAACC GCGGAAAGTT GATGAAGTTG
CTCCGTTTCC AGACTTCCAA ATCAGATGGT AAGATGATTT CGCTGGCTGC GTATTTCGAC
AACATGAAAG AGTGGCAAGA GGAAATCTAC ATTTTGGGTG GTGCTTCGGC TGAAGAGATT
GAGAAGTCAC CGTTTTTGGA GACATTCCGT GACAAAGACG TGGAAGTCAT ATATCTAACC
GACTCTATCG ACGAGTACAT GCTCCGCCAA GTTCGCGATC ACGAGAAGAA AAAATTTGTG
CAGATCTCGA GTGAAAGCGT CAAGTTCAAG GACGAAGATG AAGACTTGAT TAAACGCCGC
GAGAAGGCCT ACAAGAACAC ATTTAAGCCT TTGACGAAGT GGTTGCGAAA GCTATACACT
GGCTCCATTC TTCGAGTGCA GGTGGCCCGC CGTAGTCTCG GATCGGTACC CGCCATTGTT
ACTAGCTCAG ATTTCGGAAA CTCGGCAAAT ATGGAGCGGA TTCTTCGCGC CCAGGCTTTC
CAGCACGGCA TGGACCAGTC GTCGTTTTTG GCGTTGAAGG TCTTTGAATT GAACCCTCGC
CATCCGCTTG TGAAAAAGTT ATTGGACGGA TGTCCACCGG AAGAAGAGGG CGACGAACCC
TTCGAAGTGG CTCCGGATAT CCTGGATGCT GCATGGATGC TTCACGATAT GGCAATGCTC
AATGGCGGCT TCCCGATCAG TGACCCAGAG GCACACAACC AGCGCCTC
 
Protein sequence
MVPPEGAEKY AFEAEVHRMM DIVVNSLYQN KDVFLRELIS NASDALDKFR YLALTEPDTY 
KGEEEIPLQV KIQYDADEHT LTIRDTGVGM THDEMVENLG TVARSGTTKF IQSLKESGND
DSAMSQIGQF GVGFYSTFLV ADRVTVASKN PRDDAQHVWE SENASSSFVV YPDPRGNTLG
RGTEITLHLK EDSLEYADPN RLRELAQHYS EFVMHPISLR TTSTMEVEIE DEEEEATPET
TEEAKETKGD DGDDEIEVGD DESETEEKPK KTKEVTTHSW DVLNTNQAIW TREKEDISDD
EYQAFWQVLA HEATSNATSW SHFNAEGNIN FKSILYLPDD LPPTYSYTNM EPVQGALRLY
VRKVLIGDEF NLLPKYLGFI RGVVDSDDLP LNVNRETLQE SKILSVIKKK LVRKAIEMIR
QLAKDSEDEG QSEAEIDEEG NVIETEEKDS RYIAFYRKFS PNIKLGVVED EPNRGKLMKL
LRFQTSKSDG KMISLAAYFD NMKEWQEEIY ILGGASAEEI EKSPFLETFR DKDVEVIYLT
DSIDEYMLRQ VRDHEKKKFV QISSESVKFK DEDEDLIKRR EKAYKNTFKP LTKWLRKLYT
GSILRVQVAR RSLGSVPAIV TSSDFGNSAN MERILRAQAF QHGMDQSSFL ALKVFELNPR
HPLVKKLLDG CPPEEEGDEP FEVAPDILDA AWMLHDMAML NGGFPISDPE AHNQRL