Gene PHATRDRAFT_46164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46164 
Symbol 
ID7201372 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp454922 
End bp456667 
Gene Length1746 bp 
Protein Length460 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180639 
Protein GI219119772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCTGGTAGGC GAGTCAAGTA TTCGTCAGCC TTTGGTAGTC TCGAAACTCT TATTCTTTGT 
GATAACTGTT GATAAGGGAC AGAATAACAG CCAGGCTACC ATGTTCAAAG TCAAACGGCG
TTTGTCGAAG CTATCCAAAA ACAAACAACC AGATTTGGCT GTATCACAAC AGTCTCTGCC
AGTCCAGCAT CCAGCGATTT TGTCTCCTCC TTTACACAGT GGATCGGACG TTTCTTCCTT
GCAATCTTCG AATGCAGCAC TACGCAGTTG CTTGCGTGCT TCGGAAAGTA GCGGTGGCAG
TCACGATGCA TCCTCCGCTC GCGATAGTCA AATTCAACAA AAGCAGCCGT TGCAATCCGC
TGAAGATCAA TCAGGGCTCA TTTATGGCCG GCCACGCAAT GCGAACGGAA GCAGCCAACA
TAGTGGTGAT AGTAGTTCTG GAAGATTGCG AAAAGTTCGA TTTCGTCATG TCCAGGTTCG
CGAGTTTGAA CGTATAATTG GAGATAATCC GTCGTGTTCC AGTGGAGCAC CCGTAGCGTA
AGTTTCAATC AATTAGTTGA TCGCATCGCG TGTTTTACGC TGGTAGAACA ATGGTTTGGC
GACTTTACAG CTCGATTGAA ACTTGTGAAT TCACTTTCGA TGTTTTACTC TTGCCTACAC
ATACAGGTTG GGCTGGGCCC ATAGTCGGGA TCGCACGATG CGTTTGGACG ACTACGAATC
GGTACGGCCA TCGCGTCGGT CGCAGTTGGA CTTGATTTTA ACCCGGCAAG ATCGGGAGGA
GCTCTTACTG GAATGGGGCT CAACCTTTCA GCAGATTATT GACGCCATCC GATCCAACAT
TCGGGTAAAG AATCAACGCC GGCGGACGGT CAACAACATA GGCACCTATG ATCGTTGGGA
GGAAGCCATG GAGAATGCTG GTCGGAAAAT CAAACGTACG CTACTCCTCA AGAAATCTAC
TAAGCAGCGT GTTGAAGAAA TGACTGCGCA ATCGAATACA ATTCGCGTTG TATCGCAGCA
CGAAGTGCAA GGTAACCGCA ACGCTAGTGA AGTTAACGCA ATCACGCCGC GACGTCGGCA
TTCCGAGGAC GATTCAGAAA AATCCCCTCA ATTGACATCG CAATCTGATA TTTCGAGCAA
TGTTGATCCC GCAACGATCC TGGAACCAGA AGACGCTGCG TTTCTGGTTG GAGTTGAAGA
GGGGAAATCC TTAGGAGGCT CATCGTTCTT AAGTGTAGAC AGCAACAGAC CAACATCTGT
TATTGAAGTC GGCATTGTGG AGCGCACGAC GCTCACAGAA GATTTCTATT CTTACATGGA
AGAAATGACA GCTACTTCCG GACTTACGGG CATTTCAGGA GCAACACACG AAGACCAATT
TTCAAATGAT GGCTTTGAAA TGCTGGACCG GGACAATTCT TTCTGGGAGG TTGATGAAGA
TCGTTCCGAT TTCCCCCGTA TACGCCGCAT GGTTACCCCT ATGGTCATCT CCGAAGATGG
AGCAGCGTTT GATGTCCTGA ATCAGTACGA GCAATGGGAC AGCAATGGTG GCTTTTCGGG
TGCTCAGCAA CCGCCGCCAT ATTCTAACTC CATCATTAAC AAGTGGGAGT AGTGGACTAC
GCATGTTATT CATTGTCAGT CCATGGATGT TTACAACAGA TTTCGAAGTG CAAGGGTTCT
CACACACAAC GTGGAAGCAC AGCGTTACGT TGTACAGTCG ATGGCACGCT CATAATGGGA
TTGACT
 
Protein sequence
MFKVKRRLSK LSKNKQPDLA VSQQSLPVQH PAILSPPLHS GSDVSSLQSS NAALRSCLRA 
SESSGGSHDA SSARDSQIQQ KQPLQSAEDQ SGLIYGRPRN ANGSSQHSGD SSSGRLRKVR
FRHVQVREFE RIIGDNPSCS SGAPVALGWA HSRDRTMRLD DYESVRPSRR SQLDLILTRQ
DREELLLEWG STFQQIIDAI RSNIRVKNQR RRTVNNIGTY DRWEEAMENA GRKIKRTLLL
KKSTKQRVEE MTAQSNTIRV VSQHEVQGNR NASEVNAITP RRRHSEDDSE KSPQLTSQSD
ISSNVDPATI LEPEDAAFLV GVEEGKSLGG SSFLSVDSNR PTSVIEVGIV ERTTLTEDFY
SYMEEMTATS GLTGISGATH EDQFSNDGFE MLDRDNSFWE VDEDRSDFPR IRRMVTPMVI
SEDGAAFDVL NQYEQWDSNG GFSGAQQPPP YSNSIINKWE