Gene PHATRDRAFT_39901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39901 
Symbol 
ID7195533 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp319578 
End bp320937 
Gene Length1360 bp 
Protein Length283 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183968 
Protein GI219127492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000488323 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACT TTCTTGCGAT TGTTTGGCAT TTTGCTAGTC ATGGGAGCTC GTCGCTTATT 
GGCAGTGCTA GTTCAATGGA TATATTTCTG TGGTATGTTG TATCAATCTC ATTGGATGGC
CCATGTATTT CTTCTTCAAG CACGGTCCCT ATAGCTTCCT CTCTAAAGCT CTCCCTAAAT
ACTATTTATA TGTATTTAAA GGGACTGCGA GTGGCTTGCC TTTATTTTTG GTCTTCGTCT
ATTTGGTGGA GTTGTTGTTG ATATGGATAC AATCTTCTGG GCCAAGACAC CAGACTACCT
GAATGTACAG GATTACGACA CTTGTCGTGA AGCAGCGTCA ATCATCTAGT ATTCTGGGCT
ACAAAATATG CTCCCATTTT ATGCTCGCTG CCTCGTTATT GTTTGTTTCA CAAGCGATGA
TTCGTCAGGG TTCCTTTCTT TTAATATATG CGTACTTATA TAGAACCAAC CTATCCGTGT
CTCGGAGGAC GGGTTTCTAA TTTTATCTTA CATTACTATG TCCACGCAAA CAATCAAACC
ACTTCATCGA GATCCAGCGT TGTCCTTCGT GCTGCTGTCC AACAGTTAAG TCCCTGCTGT
ATCTGTGGCA ACAAGTTTCG AGTATCACTT TGTTTGCATT CCCAAAAAGC CTGTGAGCCA
AAAAGTGTTT TTATGTTTGG TCGATTTTCG ATTTTAAGAG AAGGGCATAA GTGTTTACTT
GGATTTCGCG GCAAGGTCAT CCATCTTCGA ATTTTGGGAC ATTGGTGCGG CTTTTCCGTC
GGGCCTTCGA GGCATGATGT GCTGCGGAGG TTGGCAAAAA TGTGGAACGT TAATGGGAGA
GGTCAAAATT ACCAAGCAAC ATACTACTGA TGTATCTCCT GCTCATTTGG TCAAGTCCAT
AGATATGCCA GCTGATCAAC TTATCAAGAC CAAACTAAGT ATTAGGTACA TCGATGCCCG
GCGACTGGCT CAAGAGGCGG CGCAAGGTTT GGCATCCGGC GCAACTGAAG CAAACATCGT
AGAAGAAGCG TGCGAACTTT TCGAGGACCT TGGCCACGAG GAGCAGGAGG CCATGAAAGC
AGCCGCCGAT GCGGAGCCTG AGTGGAAGAG AAAAGCGCTT GAACAGGCTG AACGCCGAGA
GCGAGAATGG GAGATCCAAG CAGCGCGTGA ACGGCAACAA ACGGACGGTA AAGCACAGAG
TGCAACGATT GAGAACGACG ACGAAACTGG AGCATATGGA CCAGGTAGTG AAACAACAAC
ATATACAACT GTCAAAAGGA CATCTACTAC AAGGGTAGTA CCAGTCGACG GGGGTCCTCA
TCAGGTCGGA ACATCCGCCA GCTGTTGTGT TGTATTGTAG
 
Protein sequence
MSNFLAIVWH FASHGSSSLI GSASSMDIFL WDCEWLAFIF GLRLFGGVVV DMDTIFWAKT 
PDYLNVQDYD TCREAASSIF EFWDIGAAFP SGLRGMMCCG GWQKCGTLMG EVKITKQHTT
DVSPAHLVKS IDMPADQLIK TKLSIRYIDA RRLAQEAAQG LASGATEANI VEEACELFED
LGHEEQEAMK AAADAEPEWK RKALEQAERR EREWEIQAAR ERQQTDGKAQ SATIENDDET
GAYGPGSETT TYTTVKRTST TRVVPVDGGP HQVGTSASCC VVL