Gene PHATRDRAFT_39794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39794 
Symbol 
ID7195611 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp47106 
End bp48411 
Gene Length1306 bp 
Protein Length366 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183919 
Protein GI219127389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACG GTGTAGAGAG AGATCAGGGT TATGTTGACT ATTCTGCAGA AGTAGAGGAG 
ACTCCCGACA AGGAAGGGAA AACAAACGAA AAGTTTATGT CAGTTGGCGC ATGGCAACGC
ATAAAGGGTG GCAAGCCAGC CTTTCTTGTT CAGCTTCATT TCCATCTAAG CCAAGCCGCA
AATACAGGGC TGGATGACAT ATTCTCGTGG CAGTCTCATG GCCGATGCTT TGTAGTGCAC
AGTCAAAAGC GATTTGAACA GTACGTACTC CCCGTGTAAG TTCCCAATGT GTGCGCTGCT
TTTAACAAGG GTGCTTACCC CGTGACATGC TACAGTCTAA AAATTTCTCT TCAAATGTCA
CGCTTTGGTG CAGTTGGTTT CGACAAACGA AAATATCATC CTTTCAAAGG CAGCTGAACC
ATTATGGATT CAAAAGACTG ACAAAAGGTA AGTCCGTGGT CTGAATATGC AGGTTAAGGT
ACTGTATCTG CTTCTTAGCT TTAGTCGACT CTTGCTTCCA AACGCCTGAA TTTGAAAAGG
TTTTGATAGG GGAGGCTACT ATCACGAGCT TTTCCTGCGC ACTAAGCCAT TCTTGGTCCA
TCGTATTAAA CGCAAAGTCA AAAAGGGAAC CGGTAGACAA CCACCCGACA TGCCTAGGCA
GGAACCCAAC CTGTATTTGT ATCCCTTCTT GCCACAAACG GTATTTCAAG TTTCCGGCAG
AAGAGCACAG ATTTCCGACA GTTCGGCTGG ATTCGTCACA ACGACTAAAC CAGAAAGTCA
GGGACAAGTG TTGCCACTAA TCCCGGTCGA GGATTCCCGC AGCTTGCCGA AGTTCGTTTC
CGAAGTCTGC ACTAATAACC AAACGGTCCC GGTAGACCGC TGCCTTTCCA CTAAAATCCA
TGACCAGAGT CTCCTGGAAG AATCTGTCCG AGCTGTTGCA AACTCACGTT CTCTACCGCC
GTTTGAGCGT GCCGTACCGT CTACATTGGC ACTATCACAG GCGACGTATG ATTCCAATAT
CCGCATGTTG CTGTTGCTGC GGGAGGAACA GGCGCAAGCT GAAGCATTCG AACAAACCAG
AGTACATCAG CAGCTACTGC TCGAAGCGAA TCTTCTCATT GGTTGGCGTA GCGAAAGACA
ATCGGTCTGG TTCGAAGATG GAAATTGCAA CCGCGGACAT GATTGTGAAA TGCGGCTTGC
CAACGCTGCA GCTTACCAGC AACAGCTTTC ACCACAATCA CCCTCGGATC CCCAAAAAAT
TTCGCTTCCC AATTTGTATC GAATTTTGAG GCAAAATGGT TATTGA
 
Protein sequence
MSNGVERDQG YVDYSAEVEE TPDKEGKTNE KFMSVGAWQR IKGGKPAFLV QLHFHLSQAA 
NTGLDDIFSW QSHGRCFVVH SQKRFEHWFR QTKISSFQRQ LNHYGFKRLT KGFDRGGYYH
ELFLRTKPFL VHRIKRKVKK GTGRQPPDMP RQEPNLYLYP FLPQTVFQVS GRRAQISDSS
AGFVTTTKPE SQGQVLPLIP VEDSRSLPKF VSEVCTNNQT VPVDRCLSTK IHDQSLLEES
VRAVANSRSL PPFERAVPST LALSQATYDS NIRMLLLLRE EQAQAEAFEQ TRVHQQLLLE
ANLLIGWRSE RQSVWFEDGN CNRGHDCEMR LANAAAYQQQ LSPQSPSDPQ KISLPNLYRI
LRQNGY