Gene PHATRDRAFT_50330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50330 
Symbol 
ID7198988 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp328935 
End bp330178 
Gene Length1244 bp 
Protein Length234 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185174 
Protein GI219130022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.162709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCAG ATGACGCAGA CGGTGCGTGT GTCATCCGTC GCTACTTTCC TTTGTCTGTG 
CAACTATGGA AGCCACTCGT TGGTTCAAGT AACATGTAAA GGATAACTAC TCTAAGAAAT
ACTGCGGGAG CTCTTTTCCC ACTTCTGGTT GGTAATCACC GATTGGAAAA GAGCATCTCG
TAGGGCACTA ACAGGCCAAC GATTGTTCTA GGTATAACAT ACTTTTATTA ATTTGATTCC
TTCAATCCGC AGGATGACAA CAATTATCAA GGCAATTTCG GCGCTTTCTT CTGTGCGAGG
CTACTATCGT GTCGCCTCGT CCGCTTTTCC CATACGTATG GGTGCAACGG ACTCAGACAT
GTTCCGTATT GACTTTGTCG ATGAGGACAA TACCTTGAGC GTGTCGCTGC AAGACTTTCA
CCTCGCTTTT CTGACGTCGC CTTTGTTCCA ATTCGAGCTG TGGATTTTAT CGTTCGCGAC
GGTGTCCGAC CCGGCCACTA CCACAACGGA ACATTTGGCA GCGGTAACAA GTGGTGACAA
GAGCAGTTTC GGACCGTGGA CTGCGTGGGC AGTCGAGGGT AAACGCAGTG CCCCACTTAC
AGCCTCTTCT GAACCGGCCT CAGCGTGCCA AATCATGCGG TGCTACATCC AAGGTTCGTT
CTTGACTGTT TCTACGCGAC CTTTTTGATA ATTTGTATTG CTGTGTGTTC TAATGCAGTG
TTTTACGCTT TTACATTAGG CAAAACCTTT TGTGATACGT GGTGGGCTGT TGAAAAAGTG
GCTGACCGAC CTAATCCGGA GCTCGTTTTT GGATCGGCCT TTAAATTTTG CGACGATCAT
AAGCCTCTAA TGTTTCGCGT GCTGGACCCT TTGCACCGCT TGTACAGTCG CCTATTGTTA
GCGTCAGCCA TGATCAACCT GATCCAACAA AAGCAAAAGT GCGAGTGATT TACTGTTAGC
GATAAATTTG TGGTCTCACC TTTCAGATAG ATCAGCAATA TCATACCGGA TGACCATCTT
TTATTGATTT TGCATGCCAT TGGTCCCCAG TCGACGGCCG ATTGCCAGCT CGTGTCTCCT
TGGAAGATTG ACGGATGTGA CCGGTTGTCT CGGATGGAAC ATACCGATAC AAAATTGTTT
GATGTGACAT CCGTGACAGT GAATAATTTC CCTACGTTTG TGCAGCATGA GATTTGTAAC
ATAGATTGTA AACAAAAGGA ATACGTAGCT CTTTCTTTGG AAGG
 
Protein sequence
MSPDDADGAC VIRRYFPLSV QLWKPLVGSS NMMTTIIKAI SALSSVRGYY RVASSAFPIR 
MGATDSDMFR IDFVDEDNTL SVSLQDFHLA FLTSPLFQFE LWILSFATVS DPATTTTEHL
AAVTSGDKSS FGPWTAWAVE GKRSAPLTAS SEPASACQIM RCYIQGKTFC DTWWAVEKVA
DRPNPELVFG SAFKFCDDHK PLMFRVLDPL HRLYSRLLLA SAMINLIQQK QKCE