Gene PHATRDRAFT_47890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47890 
Symbol 
ID7203158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp361729 
End bp363052 
Gene Length1324 bp 
Protein Length402 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182378 
Protein GI219124159 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGTGAGTC TGCTATTCTA GCATCTTTCG TAAATCTTGG ACAGACAGGC CGTCCCTCGC 
GAAGTGATTC GCCATTATTG AACAGCATGA AGCAAAATCT CTGTTATCTC GTAGTTGCGG
TGCTGTTACC GCAGGAGATT TTGGCCTTTC TGCCAGTCGC TCATCCCTTT TTTCGACCAT
CCTCAAGCAA TCTATTTTTA GCCCGAGAGA GAGGCGACGC GACCACAGCG AAAAGTCGCA
AGCGATCTAA AGTAACCGAC CCAGCTGGTC CGACTCCACA ACTGGAAAGC GACGAAATTG
AGGAAATCGA TCCCGAATCC GTGGAAGAAC TCCGTGACAT TCGGAGTCAG TCCGAACTTC
CGCATCCAGT TCCACACCAA CCCTGGCGTC GTGGCGATAC GGCCGGTTGT GAAGCTCCTA
TTGCGGCCGA GTGGAGACAA GAAGCAGAGG ATCTCATTAT CAAAGCTGTG GCGTTTGTCG
GTGGTCGTGT TCTTGACGTT ACGTGGTTTC TGACACAACT AGTTGTGACA ATTGACGAAG
AGTCCATGCC TCCCCGCGAT TTCCTGAAAG CCGAAGGCCC CGTCATCAAC GTCCAAGACC
CATCAGTGCC GCGCTTTTAC GACCCAGATG ACCCAACTCC GGAAGATATA TGGGATGACG
AAGAAGACTT CTTGTATCAA CGTGAGACGG AAGAGGAAGC AGCGAATGCA GAAACTCGCC
GTAACAATTT GTATGCAACG AAGGATGCTG ACGATGACCC CGACGAGCCG CACAATCCCG
ATATGGCGGA CGGGGATGAC GCACCGCGAC TCCGCAATGT GGAAACTAGA GACGAAGTTG
CCTACGGGGT GGCTCTCGAA GAGGAGAACC GATTTGAAGA ATTGGAGAAG CCGATTGATT
TAGATACTCT GCAACTAGAT AAAGCGGGGC TTTCCACTAT TGCCAATGCT ATTCTGGATG
TCCTTGGTGA CGCTGAGGAG GAGTTGCAGA TACTTAGTCG TCACGAGCTT ATTTTGACAA
GTCCGGGACC TGTGGATGTG TTGGAAACAC AGCGGCAGTT TGACGCGTAT CGGGATAAGG
ACGTGATGGT GGAAACGCAA GACCCGTTCA ACTCAAATCG TACTCTTAAG GGGAAGCTTG
TGGATCGCAA CGCCATGGAC TTGATTATCA ACAAGAAGGG ACGCATGGTG ACAATTCCTC
TCAATTTCGT CAAGTGCGTA AGATTGCCTC CTCACGAACT TAACAAAGAG TACGACGCTG
GAGAGATAGA AGCGTACGAA GAAGAATTAG AGTAAATCAA TTGTAAGCCT TTCTACTCTA
TGCG
 
Protein sequence
MKQNLCYLVV AVLLPQEILA FLPVAHPFFR PSSSNLFLAR ERGDATTAKS RKRSKVTDPA 
GPTPQLESDE IEEIDPESVE ELRDIRSQSE LPHPVPHQPW RRGDTAGCEA PIAAEWRQEA
EDLIIKAVAF VGGRVLDVTW FLTQLVVTID EESMPPRDFL KAEGPVINVQ DPSVPRFYDP
DDPTPEDIWD DEEDFLYQRE TEEEAANAET RRNNLYATKD ADDDPDEPHN PDMADGDDAP
RLRNVETRDE VAYGVALEEE NRFEELEKPI DLDTLQLDKA GLSTIANAIL DVLGDAEEEL
QILSRHELIL TSPGPVDVLE TQRQFDAYRD KDVMVETQDP FNSNRTLKGK LVDRNAMDLI
INKKGRMVTI PLNFVKCVRL PPHELNKEYD AGEIEAYEEE LE