Gene PHATRDRAFT_20135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20135 
Symbol 
ID7200463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp924656 
End bp926786 
Gene Length2131 bp 
Protein Length690 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179940 
Protein GI219118326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGAACACC GGAGCACAAT CTAAACTGAC AGTGCGTAAT CGTCACCACA ACGCGGTAAT 
GTCCAGGCAA GCTTCCCAAG TTAAGAATCG GGCTCCGGCA CCGATTCAAA TTTCGGCCGA
ACAAATTTTA CGCGAAGCCG CCGACCGCCA ACAAGCGCAC GAAATCGAGC CGATCGTGAA
AATCCATGAC GCCGAGGAAT ACCAGGCTCA TTTACGCGAT CGAAGGAAGC ACTACGAGGA
TAATATTCGT TACCGACGGG AAGATGTAGG GAATTGGGTC AAGTACGCAC GATTTGAGGA
AGAAAATAAA GAATTTGAAC GGGCACGGTC CGTTTACGAA CGATCTCTCG AAGTGGACCA
TCGCTCGGCG CAATTATGGT TGCGGTACGC CGAGTTTGAG ATGCGGCAAG AATTTATCAA
TCACGCACGG AATGTGTTGG ACCGGGCCGT CCAGATCTTG CCCCGTGTCG ACTTTTTGTG
GTACAAATAC GTATACATGG AAGAAATGGT CGGGGATCTG CCCAAAACAA GAGCTGTTTT
CGAGCGATGG ATGGAATGGA TGCCTGATGA CAACGGTTGG TTGAGTTACG CTCGCTTTGA
AACACGTTGT GGAAATGTAA CACAAGCCGA CAGCATCATG CGGAGATATG TAAATACATA
TCCGTCCGCG AGGGCATTTC TGCGATTCGC CAAGTGGGCC GAGTTTGAGG CCAAGGACGT
TGCCTTGGCA CGCACCATTT TCGAATCCGC CTTATCCGAA TTGGAGCCCG AAGAATCTCG
GCAAGCTCGA GTTTTCAAAC AATTTGCGTC TTTTGAAGAG CGACAGAGGG AATACGATCG
AGCAAGAGTC ATTTACAAGC ACGCCCTTTC CTTACTCCAC CTTGGCGAGA CACCGTCGTT
AGCTGATGAG GAAGACTTGA CTAACGCCGA GCGCACCAAG CGAGAGGAAC TGTACAAAGC
CTACATCACG TTTGAAAAGA AACACGGAGA TCGCCAAGGA ATTGAAGACG TCATTGTTAC
GAAGCAACGC GCGCAATATA GGGAGCGGGC AGCGGAACAT CCCTTTGACT ACGACTGCTG
GTTCGAATGG GCCAAACTGG AAGAAGAACA CGGTAGCGTT TCGGCAGTTC GCGAAACTTA
CGAAAAAGCC GTGGCAAATG TACCACCTTC GGAACAGAAA GATCATTGGC GGCGGTACAT
CTATCTATGG ATATATTATG CTGTATATGA AGAACTTGTG AATGCCGACT TAGATAGGGC
CTTCCAAGTT TACGAAACCT GCTTGAGCAT CATACCCCAC AAGAAATTCA GTTTTGCCAA
AATATGGATA CAAGCAGCCA AGTTATTGAT TCGACGTCGG GAGCTTACGG CTGCGAGAAG
ACTCTTGGGT AGAGCGATCG GACAGTGTGG TAAGGAGCGC ATTTTCATTG AGTACGTTGC
ACTTGAGCTA GCGCTGGGTG AAGTTGATCG ATGCCGCAAT CTATATAGCA ACTATCTCAA
GGCAATGCCA CACAATTGCA AAGCATGGTT CAAGTATGCT GATCTGGAAA AGTCGGTTGG
CGAAACAGAA CGTTGCAGGG CTATTTTCGA ATTGGCTATT GCACAACCCG CTTTGGACAT
GCCGGAAATG CTCTGGAAAG GGTATATAGA TTTTGAAATC GAAGAAAATG AAGGGGAGAA
TGCTCGAAAG CTATATGAAC GGCTGTTGGA GCGAACAAGT CACGTGAAAG TCTGGATATC
GTACGCACAA TTCGAAGGTA CCGACATTGG CAAGGGGTTG GAAGGAGCTC GCGCAGTGTT
CGAGCAAGCC TACGATCACC TCAAAGCCCA AGGGCTCAGT GAAGAACGAG TGTTGCTGTT
GGATGCTTGG CGAGTATTTG AGAAGAGCAA TGGTAGCCAA AAAGACGTGG CAGATGTGGA
GGCCAAGATG CCGCGGAGAA TCAAGAGAAA GCGTATGCGC GAAGACGAAA GCGGCAAAGA
TCTTGGCTGG GAAGAATATT TCGACTATCA GTTTCCAGAC GATGAAGGCG GCGCTTCCAA
CAACTTCAAA ATTTTGGAGA TGGCTGCAAA GTGGAAGCAG CAAAGGGCCG AAGCAAGCGA
TGATGATTCG GATCTTGACA GTAATGAATA A
 
Protein sequence
MSRQASQVKN RAPAPIQISA EQILREAADR QQAHEIEPIV KIHDAEEYQA HLRDRRKHYE 
DNIRYRREDV GNWVKYARFE EENKEFERAR SVYERSLEVD HRSAQLWLRY AEFEMRQEFI
NHARNVLDRA VQILPRVDFL WYKYVYMEEM VGDLPKTRAV FERWMEWMPD DNGWLSYARF
ETRCGNVTQA DSIMRRYVNT YPSARAFLRF AKWAEFEAKD VALARTIFES ALSELEPEES
RQARVFKQFA SFEERQREYD RARVIYKHAL SLLHLGETPS LADEEDLTNA ERTKREELYK
AYITFEKKHG DRQGIEDVIV TKQRAQYRER AAEHPFDYDC WFEWAKLEEE HGSVSAVRET
YEKAVANVPP SEQKDHWRRY IYLWIYYAVY EELVNADLDR AFQVYETCLS IIPHKKFSFA
KIWIQAAKLL IRRRELTAAR RLLGRAIGQC GKERIFIEYV ALELALGEVD RCRNLYSNYL
KAMPHNCKAW FKYADLEKSV GETERCRAIF ELAIAQPALD MPEMLWKGYI DFEIEENEGE
NARKLYERLL ERTSHVKVWI SYAQFEGTDI GKGLEGARAV FEQAYDHLKA QGLSEERVLL
LDAWRVFEKS NGSQKDVADV EAKMPRRIKR KRMREDESGK DLGWEEYFDY QFPDDEGGAS
NNFKILEMAA KWKQQRAEAS DDDSDLDSNE