Gene PHATRDRAFT_39523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39523 
Symbol 
ID7195351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp48662 
End bp49840 
Gene Length1179 bp 
Protein Length392 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183538 
Protein GI219126594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCCAG ATGGAAACAC AAAGCGCCAA GCAAAGCAAG TTGCGTCCGA GAGCAAGGGG 
CCTGTCGAAG TCAGTGACGA CACCGCGACG AGGTCAAAAA GGCCGCGAAA GACCTCAAGG
TCAGCCGGAA AAACCAGCGC TTCGAAACGC CGTCGATGTG CACGTGGCGC GGGAGCTTTA
TTGGAACTTG CCGAGCGGCT CGTAGAGGGT CAAGAAGTTG TCATCATTAC CGGTGCTGGA
CTTTCGGTAG CTTCCGGTAT TCGCCCGTTT CGATCCACCA ACGGAAGTAG CGCAACTTCT
GTTCCTACAA AACGAGGAGT TGTTCCTACA GCAGGTTTAT GGAATGATGT CATTTGGACG
ACCGCCACCC GCGAAGCCTT TCGCAAAGAT CCGAAGCGAT GGTACAATGA CTTTTGGTTG
CCCCACTTTC AAGACGGTAC CACCTATTAT CCCAACGCCG GCCACCTGGC GTTACAGGCC
CTGCACGACC GTTACGAGAA TCTCCGACAA ATTACTCAAA ATATCGACGG CCTGCAAGAG
CCCAATAATC ATCTTATCGA GGCGCATGGA CGCGTCGGTC TCTATAAATG CATTCCGCAC
GAGGACGAAG AAAGTGACGC AATGGAAGGT GACTCGGACG ATGATGAAGA CCGAGCCGTG
CAATTGGGAC ATCGTCGGCA AGGACGCAAG GTAAGAGAAG CATCCACAAA TCCCGAAATT
TGTCCCTACC AATACTTGCA ATCGTTGAGT CCTTGTCAGC TGGAGCCGGC AAATGTTCGA
AATGCCCTGT GCGAAAGCAA AGGCCAAAAC CTTCCGGAGG CTCCGGCTTG CCCAGCTTGT
GGCGGGGACG TTTTACCGCA AGCCCTCCTT TTTGATGAAG GCTACCACGC ACACGACTTT
TATGATTTTG AGCGAGCGGA GGCTTGGTTA GAGAGTGCGG AGGCAATCGT TTTTTGTGGA
ACTTCGTTTG CGGTTCGCAT TACTCATGTA GCTCTGGAGC ACGCTCGAGT ACACAAGGTT
CCTGTTTACA ATTTCAATCT ACACGATGTA CTCGAATCCA CAGCGCGATT GAATGTCACA
AATATCATTG GGCCGTCCGA CGAAACCTTG CCCAAATTAG TGGAGGCCTG TGATGAGGCT
GAAAGTCAGC AGGTCGGGGT AGGAGAAGGG AGTTGTTGA
 
Protein sequence
MPPDGNTKRQ AKQVASESKG PVEVSDDTAT RSKRPRKTSR SAGKTSASKR RRCARGAGAL 
LELAERLVEG QEVVIITGAG LSVASGIRPF RSTNGSSATS VPTKRGVVPT AGLWNDVIWT
TATREAFRKD PKRWYNDFWL PHFQDGTTYY PNAGHLALQA LHDRYENLRQ ITQNIDGLQE
PNNHLIEAHG RVGLYKCIPH EDEESDAMEG DSDDDEDRAV QLGHRRQGRK VREASTNPEI
CPYQYLQSLS PCQLEPANVR NALCESKGQN LPEAPACPAC GGDVLPQALL FDEGYHAHDF
YDFERAEAWL ESAEAIVFCG TSFAVRITHV ALEHARVHKV PVYNFNLHDV LESTARLNVT
NIIGPSDETL PKLVEACDEA ESQQVGVGEG SC