Gene PHATRDRAFT_38995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38995 
Symbol 
ID7194698 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp60037 
End bp61223 
Gene Length1187 bp 
Protein Length351 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183025 
Protein GI219125519 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATGA AGCTTCAAGG TTACAAGAAG AACTACTCCA ACGTTATTCT GATGATTGTT 
TTAACGCACA CGATCGTCAA CATAATCTGG AGACGTCCAC TCAATGAGCT TAGATTTCCA
GTCGCATCCA CATCAATAAA CACCGAAGCT GGCAGCGTCG ATATTCCAGG AACGACAATG
AAGGCGTTGA ACGATAGACC ATGGAAGGTT GAAGGCGACT TTAACAACGA TACGACAGTA
CTTTCAGAGA TGCGGCAAGC GCACCGATCA TGCAATGAAA CCACTATCGT CCTGACTACG
AATTTTATTC CGACTTCTCC CTCCCTGGCG ATCATCAACC GCACTATCCA TTCGATTAGA
AGGCTGAAAG GACTCTGTCC TACCGCACCC CTCATCATAT CTGTCGATGG TCTCAACAAG
GAAGCTCGAA GGATTCACAA CAACTCAGAA CCGCGACTGG AAGAGTACGT CAAAAGGCTC
CGAACCGTCT ACAACGAAAC GCATCAGAGA GTTGTGGCGA GCAATCATTC ATTGATGATT
ACCGGAACCG TCTATCAGGC CATGGATCTA GTCAAGACGG AGTTCGTTTA TGTCATACAG
CACGACATGC CATTTATTCA GGATATTGAC CATACTGCGC TTGTGCGGAC CTATGATCAA
TTTCCTGCGG TGCTTCGTTT GGTGCGATTC AATTTGAGAC CCAATATTCA ACGGGGAGAT
CTCGAAGGGA ATAATACATG CTATGCCGAA GAAACGCCCG TGAACGATGT AAATGGGATT
TCTCTCATCA AGACATGGAT CTGGAGTGAC AAGTAAGTAT GCAATGGAGT TTCGTGGAGA
GATGGATGAA CAAAAAGACA TACTAGTGAA GCTCTGGTAG CTACACTACG TTTGCTGTGT
CTTTCATACA TTCCTCAAAC TTTCCGCCCT TTTTATTCTT TAGCAACCAT TTCACACGAA
AGTCGTACTA CGACGAAATG AAAGAATTGT TCTACAAAAG ACACGGAAGG CTGCCTTTTG
CCATGGAATG GGTGATGCGA GTTGAGGGTC AAAAGAACTG CTCTTATTGG GGGACCTTCT
ATTACGGGCC TCAAGGGCAA GCCCCAACAA TTGCCCATAT GGATGGCCGT CAAACGACAC
AGGTAGCGGA GAACGAAGAT TTGCGTCTGC GTCGATGGAT GCGATAA
 
Protein sequence
MQMKLQGYKK NYSNVILMIV LTHTIVNIIW RRPLNELRFP VASTSINTEA GSVDIPGTTM 
KALNDRPWKV EGDFNNDTTV LSEMRQAHRS CNETTIVLTT NFIPTSPSLA IINRTIHSIR
RLKGLCPTAP LIISVDGLNK EARRIHNNSE PRLEEYVKRL RTVYNETHQR VVASNHSLMI
TGTVYQAMDL VKTEFVYVIQ HDMPFIQDID HTALVRTYDQ FPAVLRLVRF NLRPNIQRGD
LEGNNTCYAE ETPVNDVNGI SLIKTWIWSD NNHFTRKSYY DEMKELFYKR HGRLPFAMEW
VMRVEGQKNC SYWGTFYYGP QGQAPTIAHM DGRQTTQVAE NEDLRLRRWM R