Gene PHATRDRAFT_47395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47395 
Symbol 
ID7202539 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp521912 
End bp523185 
Gene Length1274 bp 
Protein Length331 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181744 
Protein GI219122837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTTCGATT CTTTCGTTGC CTTACTTCCA GCTTTCAAAG TATCATCATG AAGTTCACTT 
CCTTTGCCTT CCTCGCACTT GCGGCTTCCG CCAATGCCTT CACACAGCCC GCTTCCTTCG
GGGCCGCTCG CAGCTTCTCC ACTTCTTTGG ATGTTTCCAA GAAGGATTTG GAGGGCGCTC
AAACGATGAT TGACAAGATC ATTGACGATA CGAACGCCAA CCCAGTCTTT GTTCGTCTTG
CTTGGCATGA CTCGGGTACT TTCGACGTCA ATGTTGAAAA AGAGTGGCCG GCATCGGGTG
GGGCTATTGG CAGCATCCGC TTCGACCCCG AAATCAACCA TGGCGCCAAC GCTGGTTTGT
CGGGAGCCGT CAAGCTTTTG GAACCCGTTA AGGAAAGCTT CCCCGATGTC AGTTTCGCTG
ACATTTTCCA AATGGCCTCC GCCCGTTCGA TCGAACTTGC CGGAGGTCCC AAGATTGACA
TGAAGTACGG TACGTTGTGA CTAACCTGCC TAGTGTGATG CCAATTTTCG AATCGTGTAT
GGCTTTGGCT TCGAATGCTC ACCCTTCTGG TTATGTTGCC TTTAGGACGT GTTGATGCGT
CCGGTCCTGA AAACTGTTCC GCTGAAGGAA ACCTTCCCGA CGCGGAACCG GGTCCGGACG
GAAAGTATGG TGGTCCGGGA GGCAGCGCTT CAACCGAAGA CAAAACCCCC AACGGTCACT
TGCGCAAGGT GTTCTATCGC ATGGGCTTGA ACGACGAGGA AATCGTGGCA CTATCGGGTG
CACACAGTTT CGGTCGCGCG TACAAGGACC GCTCCGGACT TGGCGCTGAA AAGACCAAAT
TCACTGACGG CAGTAAACAA ATTCGAGCGG ACGGGAAGGA AGCCAAGTAC AACCCCGGTG
GTAGTGCATG GACCAAGAAC TGGTTGGTTT TCGATAATAG CTACTTCACA ACGATCCCCG
ACGAGTCCGC TGATCCAGAA CTTCTCAAGC TTTCGACTGA CAAAACTCTC TTCGGCGATG
AAGACTTCAA GCCCTTTGCT GAAAAGTTCC GTGATTCACA GGATGAGTTC TTCGCTTCGT
ACGCAAAGGC GCATAAAAAG CTTTCCGAGC TCGGATCCAA GTTTGAAGCC GTCGAATAAA
GATCCAGTAC AAACAAACTA CTACCCTTGG CCACCGAGGA TGTTTAAAAA GTACGCCAGA
AGGCGTCATT ACGAAATTCA AGACAGATCA TCTTCAAAAT GAGAGATAAA AATAAAACTT
GGCTAGTCGA TGGG
 
Protein sequence
MKFTSFAFLA LAASANAFTQ PASFGAARSF STSLDVSKKD LEGAQTMIDK IIDDTNANPV 
FVRLAWHDSG TFDVNVEKEW PASGGAIGSI RFDPEINHGA NAGLSGAVKL LEPVKESFPD
VSFADIFQMA SARSIELAGG PKIDMKYGRV DASGPENCSA EGNLPDAEPG PDGKYGGPGG
SASTEDKTPN GHLRKVFYRM GLNDEEIVAL SGAHSFGRAY KDRSGLGAEK TKFTDGSKQI
RADGKEAKYN PGGSAWTKNW LVFDNSYFTT IPDESADPEL LKLSTDKTLF GDEDFKPFAE
KFRDSQDEFF ASYAKAHKKL SELGSKFEAV E