Gene PHATRDRAFT_36081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36081 
Symbol 
ID7201158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp502625 
End bp503701 
Gene Length1077 bp 
Protein Length358 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180448 
Protein GI219119372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGCC GACTCGATGG CATTCGACGT TCTTACCAAG CCTTGACGGA ACGTCTCGCT 
GATCCTGATG TCATCAATGA CTCCAATTTA CTCCGCCAGG TAATGACCGA TCGATCACAA
ATCGAAGAAG TCGTCATGGT ATTCGAAGAA TACGTCGCCT TGCAAGAAGA ACTGAGTGGT
GCTAAGGAAC TGTTTCAGGA CGCCGGAGAC GATCCGGATA TGAAGGAAAT GGCGCGAGAT
GAGATGAAAG CTATTGAACC ACAACTAGAC TCTTTGGAAG AGAAAATTAA GGTTCTGCTG
TTGCCGAAGG ATCCAAACGA TGCTCGTAAC GTCATGCTAG AGATCCGGGC TGGTACTGGA
GGTTCCGAAG CCAATATTTT TGCTGGTGAT TTGCTCGATG TCTATCGAAA GTATATTTCG
ACACAAGGAT GGCAATCAAA TCTGATAGAT TCTTCTTCTG GCGATGATGG CGGGTACAAA
AATGTCGTTT TGGATATCAA GGGCGATATG GTTTACAGTA AACTCAAATG GGAAGCAGGA
GTTCATCGTG TTCAACGTGT ACCAGCAACA GAATCCCAAG GCCGTGTCCA TACGTCTACT
GCTACGGTTG CTATTATGCC CGAATGTGAC GAAGTCGATA TAAAGATTGA TCCTAAGGAA
ATCGAAATGT CGACAATGCG TTCCGGTGGT GCTGGAGGGC AGAACGTCAA CAAGGTCGAG
ACGGCTGTCG ATTTGTTACA CAAACCGACA GGCATTCGTA TCAAGTGTAC TCAGGAGCGA
TCGCAGCTAA AGAACAAGGA GCTGGCTATG AAAATGCTTA TGGCAAAACT TTACGACATG
GAAAACGAGA AGCGGGAAAT GGAAGAACGA GCTCGACGAG GGTCCCAAGT TGGCACAGGA
GGACGCAGTG AAAAGATTCG AACCTACAAC TGGAAGGATT CCCGATGCAG CGACCATCGT
CTCGGTCAGA ACTTTCCGTT GGCACAGTTC TTGAGTGGCG ACATCGGTAG CATGCATGAT
TCCATGATCG CAAAAGACCA AGAGGAGAGG CTGAAAGGGC TGAGCGAAGA ATCATAG
 
Protein sequence
MMSRLDGIRR SYQALTERLA DPDVINDSNL LRQVMTDRSQ IEEVVMVFEE YVALQEELSG 
AKELFQDAGD DPDMKEMARD EMKAIEPQLD SLEEKIKVLL LPKDPNDARN VMLEIRAGTG
GSEANIFAGD LLDVYRKYIS TQGWQSNLID SSSGDDGGYK NVVLDIKGDM VYSKLKWEAG
VHRVQRVPAT ESQGRVHTST ATVAIMPECD EVDIKIDPKE IEMSTMRSGG AGGQNVNKVE
TAVDLLHKPT GIRIKCTQER SQLKNKELAM KMLMAKLYDM ENEKREMEER ARRGSQVGTG
GRSEKIRTYN WKDSRCSDHR LGQNFPLAQF LSGDIGSMHD SMIAKDQEER LKGLSEES