Gene PHATRDRAFT_49451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49451 
Symbol 
ID7195810 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp279210 
End bp280534 
Gene Length1325 bp 
Protein Length389 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184103 
Protein GI219127773 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCAT ATCGATCCAG TGATGACATG AAAGATGGCA CTGCGAGAAG AATGAACGGA 
AAGCGTTCGA GGGAAGCGAA GATACAAAAA TGGCAAAACA AACAAACGAA GTTAGAAAAG
AGAAGAGAAG AAATTAGTAA TCGGGACGAG TGGATCGAAA AGCATTCAGA CTTTGTTTCG
GTGATAAACC TAGAGGACGT GAGAGATGCA AAGAACATCG CGTCCGGAAA CGCTCTTCGC
GAGCGTTTGT CTCCATACTT TAATCTACAG GACAACTCAT TGGTTTCCCG AGCGAAAGAT
GGCCGACGGC GCTTTATTGC TGAAGGAACA GAAACTGTCC GACTCCTGAT GCAGCAATTA
ACTGTAAGCA ACAATTCTTC TTCCGGACTT TTTCCGGTTG AGGTTGAGTC CATCTTTGTA
AAGCCGAGTG TCTTCTTCGA CCCTCCTGTT TCGCTCATTT TCGACTTTCA GAAGATGATT
GACTTGACAA AGCATACTAC TGTTTGTGTA AGCGAGGCTG CTAAAAGGGC AAAAGTCCAT
GTCATGATCG GAGCTGAAAA TGTATTGAGC GAAGCCGCTG GATTTACAAT ATCGAGAGGG
GCCTTGGCAT GCGGCTTCGT CCCCGAAAAT CGTAACTTTG CCTGGTTGAT GGAATATTTT
AGAAAAACAA GAATGTCTGG TGAAGGGGAG CTTCGCCTGT TGGCGCTAGA TGGAATTTGC
GACACCGCAA ATCTAGGATC CGTTGTACGG TGCGCCTCGG CGTTTGGGGT TCACGCCGTT
CTTTTAAGTA AGGATTGCTG CGACCCATGG TACCGTCGAG CGGTGCGTGT GTCCATGGGT
CACATTTTCC GAATACCATG TGTGCGAGTC GACAATTTAG TTCAAGCCCT AACTGCACTA
TCGCAAGAAC CATTCGCAGT CACTTCCTAC GCAGCTGTGA TCGACCCGAG AGCGGATCTT
CTGTTGGAGA ACATCGCACA AGGTATGTCT TTCATTTTAA ACCTAAACGA AGTCAAGAAT
ACATATAGTT GACCGTTCGC TCCTCTTTTC ACTCGACAGG CGCTATTCCA AAATCGTGGT
GTTGTATTGT GGGTAGTGAG GTATGTCGCT TTCGCACTAT TGAAAACTTG AAATAAAATT
GCCTCTCTCT GACAATGAAT GTTACATTTT GACAATAGGG GAAAGGAATT TCTTGTGACG
TTATCCAAGC TGCGACTACA ACCCTCAGGA TCGGGATGTA CGACCACGTT GATTCACTGA
GTGTCCCTGT GGCTACAGGC ATTTTGCTGC ATGGTTTGAG CGAGCGCTCA AAACCGCTTC
TATAG
 
Protein sequence
MDPYRSSDDM KDGTARRMNG KRSREAKIQK WQNKQTKLEK RREEISNRDE WIEKHSDFVS 
VINLEDVRDA KNIASGNALR ERLSPYFNLQ DNSLVSRAKD GRRRFIAEGT ETVRLLMQQL
TVSNNSSSGL FPVEVESIFV KPSVFFDPPV SLIFDFQKMI DLTKHTTVCV SEAAKRAKVH
VMIGAENVLS EAAGFTISRG ALACGFVPEN RNFAWLMEYF RKTRMSGEGE LRLLALDGIC
DTANLGSVVR CASAFGVHAV LLSKDCCDPW YRRAVRVSMG HIFRIPCVRV DNLVQALTAL
SQEPFAVTSY AAVIDPRADL LLENIAQGAI PKSWCCIVGS EGKGISCDVI QAATTTLRIG
MYDHVDSLSV PVATGILLHG LSERSKPLL