Gene PHATRDRAFT_45993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45993 
Symbol 
ID7201055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp889364 
End bp890556 
Gene Length1193 bp 
Protein Length322 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180136 
Protein GI219118738 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00551878 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGATG CCAAGAGGAA CACTACAGCG TCTTCTTCGG AAGAAGAATC GTCTTCGGAC 
GACGACAGTT TGGCATTGGA AGGGGTCGTT GCCCGGCATC CTGACGCCCC CTCCTCGTCG
GACGATGACG ACGAAGGAGA AGAAGAACCT TGTGAGCCCA CGGCTGACGA AACGGCCAAT
CAAAAAGACT TTGGCACTGA CAAACGCAAG GGAGCAACGA CAAGGTCGTC CAAAGTTCCA
CCACCCAAGC GGAAAAAATC CAACGAATCC GAAACAATGC AAGTCGAATT TACCTTTCAC
GATATGGACG ACAAGTTCTT TCACGGACTC AAATCCCTCC TCCACAACAG CTGCACCACT
TACCAACCAC ATTCGTCGGA GCTGGTAGAC TTGATGATAA ACAATATTGC CGTGGGGACC
GTCCTGTCGA CACAAGGCGA TACGGAGAAT AACGTCTACG GCTTTGCCTC TGTCCTCAAC
ATTACCGAGC ATCAGCAATC GCCCGCCATA CAGCAGCTTC AACGATTTTG TCTCGACGGA
TGCCCGGCCG ATCGGAGTGC AGAACTAGAG GTCGTCTTGT CGGGAAAAAC CAAACGGCCC
GCCGGCTTTG TCTTGCACGG TCGCATGCTC AATCTGCCTT TGGAAATTGT CGAAGTCTTG
CAGCAGCAAC TCGTCTTGGA TATGGATTGG GCCGTCGAAC ACGCCGAGGG CGGTGTTGAC
GCGCGCAAAG CCCTCGACTT TGGAGTCTTT CTCCGACTCG CTCCCTGTCA AAAGGACAAC
ACGGGAGCAC TGGTCTATCG CTTTTTTGAC GACGAAGTGT TGGCGGGCCA AGCAGACTTT
AGTTTCCTGG TGGAAGCTCC CGCCAGCTAT TCCAAGGAAG ACAAAAATTA CGTATCCGTG
ATTGTTCTCA CCAAAACAGG ACACCGTGCC GCCATGAAAG ATCTAGCCAA GCTCATTCAC
GGAAGATGAA GGCATGATCA TGCAACGTGT CTTGTTCATG AATGAGGGAC ACGCTCCCTT
TATCGATAGC CAATACCCTA CACCGACAAA CCGTAAAACC AAGTTTTTTA CAAAATTGTA
CCGCATACAC GTTGTTTCTG TTTGGACAAA GTTCAGCTAC GATTCAGAGT TTAACAATAA
ATTCGTACAA TTCGCGGATT CCTCTCTACT GCCCATCAAA CTAATTCCAT ATT
 
Protein sequence
MPDAKRNTTA SSSEEESSSD DDSLALEGVV ARHPDAPSSS DDDDEGEEEP CEPTADETAN 
QKDFGTDKRK GATTRSSKVP PPKRKKSNES ETMQVEFTFH DMDDKFFHGL KSLLHNSCTT
YQPHSSELVD LMINNIAVGT VLSTQGDTEN NVYGFASVLN ITEHQQSPAI QQLQRFCLDG
CPADRSAELE VVLSGKTKRP AGFVLHGRML NLPLEIVEVL QQQLVLDMDW AVEHAEGGVD
ARKALDFGVF LRLAPCQKDN TGALVYRFFD DEVLAGQADF SFLVEAPASY SKEDKNYVSV
IVLTKTGHRA AMKDLAKLIH GR