Gene PHATRDRAFT_40171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40171 
Symbol 
ID7195937 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp312699 
End bp313866 
Gene Length1168 bp 
Protein Length364 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184227 
Protein GI219128032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.280697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAAAG GGGTTCCAAT GCACCGAGTT CCCCAGCCCC AACATCAGCG ATGCCGAGTT 
AAGCTACAGT GCCTGGTACT CGGTGCAGCT GGTGCCGGAA AGACGTCCCT CTTGAGGCGA
TATTTTCACA ATGCCTTTCA GGCTGGAACT CGTGTGCCTA CGCTAGGCTC CGACTTCTAT
ACGGGACGCG TGCCGAATCC TTTGCAGGAG CATAATTCGT CGTCTACGGA TTCACATGTT
CTCATCAATC TTCAAATGTG GGTAAGTCAT TGGGCCCACG GGCAGGCTCG TAGGCATCGT
ATTTAAAAAG GAACGCAGCT GCTTTTGGCT GAAGACGGGC GTAAACTCAG AGTTCTTTTT
CTTTCAGGAT ACGCCTGGTC GGGAACGATT CTATTCGAAA CGTCAAAGGC GACACACCGA
CGCAGCTTCA TTGGGGGCAT CGTTCTTCCG GCAAGCTGAT GCAGTAATGT TGGTCTACGA
CATGACATCT TCAACATCGT TTACACAACT TTTGAAATGG TATGCCGATC TGGTGGACCT
TTGTCAAAGC AAGCCTGTTC CAATTTTGAT TGTGGCGAAT AAACTGGACC TCTTCATTGC
TGACCAGCAA CGCGCTTCGA CGTGGGTCCA TCCCCGTAGA GTTTCGCAAC GAGACGTCCT
GGGACTCGCT GGGTCCTTTC GAGGCAATGA CTTTCGGTAC GAGTATCGTG TTTCTACGCA
GTTATCCCCC AATCCGATGA AGAAGAAACA TCAACGGAAA CAAAGCCACC GCAGAATGGA
GATCTCCAGC TTTCTTGCCA ATCGTGAAAA CTGGACAACC GACGGATCCT ATTTAGAATC
CTTGCTTAAT TCGGAAGACG CTTCGCACCC GGATCGTGAA ATGGTTTTGC TTTGGTGCAT
GCGAAACGGT TTGAAACACG TTGAGGTCAG TGCCGCTACT GGCGAGCATG TCGATGGAGC
GATCGATGAG CTCATCCGTC TCGCCTTGCT CACCAAACAA AGCAAAAATT GCGACACGAA
AGCAGACCTA GTGGGCATTG AAAGCCAACC TTTATATCAA CGAAACGATG AGTTGAACGT
TCAAGAAAGG TATCAGTCTA ATGAGGATCG ATGTACGTTT CTACGACCCG TGATAGACCT
TTTTCAGCAA AGGAAAAATA TGATATAA
 
Protein sequence
MQKGVPMHRV PQPQHQRCRV KLQCLVLGAA GAGKTSLLRR YFHNAFQAGT RVPTLGSDFY 
TGRVPNPLQE HNSSSTDSHV LINLQMWERS CFWLKTGVNS EFFFFQDTPG RERFYSKRQR
RHTDAASLGA SFFRQADAVM LVYDMTSSTS FTQLLKWYAD LVDLCQSKPV PILIVANKLD
LFIADQQRAS TWVHPRRVSQ RDVLGLAGSF RGNDFRYEYR VSTQLSPNPM KKKHQRKQSH
RRMEISSFLA NRENWTTDGS YLESLLNSED ASHPDREMVL LWCMRNGLKH VEVSAATGEH
VDGAIDELIR LALLTKQSKN CDTKADLVGI ESQPLYQRND ELNVQERYQS NEDRYLFQQR
KNMI