Gene PHATRDRAFT_39785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39785 
Symbol 
ID7195641 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp31934 
End bp33203 
Gene Length1270 bp 
Protein Length320 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183796 
Protein GI219127134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGGCA AGAACAATGC CAGTCGCAAC GTCGACTACG CGATTCATGA GTCCACTTTC 
CCCGTCAAGC TTCATTATCT TCTGAGCGAG ACGGAGGACA ACGGGAGCGA TCATATCATC
TCGTGGCAGC CCCACGGACG GGCCTTTCTA GTACACGACC ACGGGGCCTT TGTGGATCAC
GTCCTTCCCA AGTAAGTCAC TTCTCAACGC GTAACAGTAT TGGGTATGCG TCACACGGTG
GATGCCTTTG CCTATGATTG GGACACGGAG GAACCCGTGT ATCCTTTTAT GTATTACACC
TTGCGTTGTA CTGTGTGTTG TTGTTGTTGT TGCTTGGCAT TGCTTGGCAT TGGATCTTTC
TCTACTCACC CGTTGTTCAA AAGCTGGTTC AAGCAATCCA AGTTCCCTTC GTTTCAGAGA
CAGCTCAACT TGTACGGCTT CAAGCGCTTT ACGGCGGGTA AGTGGCGCGT ATTGTGTATG
TAGAGAGAGC TACGGGACAA TATTACTATG TGAGACCACT TCTCACGTGA TTCATTATTC
GTTCGTTGAA TCGTTCTTCG GGACCCGCAC AGGCCGCGAC AAAGGAGCCT ATTATCACGA
AATCTTCCTC CGTGGCCGAC CCCATCTTGC GCACCGCATT CCTCGCGTTA AAGTCAAGGG
CAGCGGGGTG CGCAAGCCGG GAGCGCCCGA GTCGGAACCC AACCTTTACC TCCGACCCTT
TTTGCTCACT TCGGACTTTA ACGGTGACGC CACGGCCGAG GAAGAATTGC ACACCGTCTC
CAAGAAACCC CACACCGTGA TCCCGGACGG TCCCAGCAGC CAGAGTGGAC CGGACGTGGC
CCGGATCGCC GGTAGAGCCC TGCCGGTTGG CTCCGTGGCG GCGCTGTACG CCAACGCTCC
GCCACCGAGA CCTCCTTTTC CGGGTCGCCC GTCGTTGCAT CACTTTCTGG CAGCGCAACA
CTGGTCCGGT CCTCCACGCG GGTTCGGTGT CCCGATGCAC AACCCTACCA TGAGAAGATT
CGACCCCACC GCTTTATCTC CGCACCAATT GATGTCCCTA CAAACCGCAC TGGAGGAAGA
CAATATTCGA CAGCGAGAAG CCCTCCTGAC TTCGTACGCC ACCTTCCCCG GACGGAGCTC
GGAGCCGGTG GCGCCATCCG CTAGTACCCT CCAGAAGTTG TCCAAGAAAT TCGCGGCGAC
GTCCGCGTCG ACCGCCGCCC AAGATGTAGC CACACTGCTC CAATTGGCTG CGTCCCTCGG
ATACCGCTAG
 
Protein sequence
MGGKNNASRN VDYAIHESTF PVKLHYLLSE TEDNGSDHII SWQPHGRAFL VHDHGAFVDH 
VLPNWFKQSK FPSFQRQLNL YGFKRFTAGR DKGAYYHEIF LRGRPHLAHR IPRVKVKGSG
VRKPGAPESE PNLYLRPFLL TSDFNGDATA EEELHTVSKK PHTVIPDGPS SQSGPDVARI
AGRALPVGSV AALYANAPPP RPPFPGRPSL HHFLAAQHWS GPPRGFGVPM HNPTMRRFDP
TALSPHQLMS LQTALEEDNI RQREALLTSY ATFPGRSSEP VAPSASTLQK LSKKFAATSA
STAAQDVATL LQLAASLGYR