Gene PHATRDRAFT_31068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31068 
Symbol 
ID7199055 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp252089 
End bp253900 
Gene Length1812 bp 
Protein Length353 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185242 
Protein GI219130165 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTTACTGTT GGTGTTGATG CTTGGACTGT GCATAAGACC GTCGCGACAA CTCGTTCTTC 
GTATAACAAG GATTGAAAGA CTAGTAGTGG GCTAGTGGGC GTGTGGCGTG TTTTTCCTGG
AAAGCCAGCG TGTGTCGGTT TCGTCAGTCA GTCAGCACCC AAGTTAGATT CCTGAAGCCA
GCAACCACGA TGGAATCCGC ATCGACGACT ACCGCTCCCC CGTTCGGAGA GACCCCCAGT
GGTCAATTGC CACCGATCGT GCAACGCTTA GAACGTGGCG AAAAGACACC TATATGTGTC
ATTATGGTGG GAATGGCGGG GTCGGGAAAG ACAACTCTCT TGACGCAATT GCAACGATCT
CTGGAAACGC CGTCCGTACC GCCCACGCCT GATGATTTCG GTAAGGATGT ACACGGCGAC
ACCAGCCGCG CACCCGACAG TGACGTCGAA CCACACTCCG ACTTAAAAGT CGCCGCTCAT
GTACCAGTCG CGGCCGATAC CGCCGCTGAC GCCAAAATGG CATCGTACGT TGTGAATCTG
GATCCAGCTA CGCTATCGGT ACCCTACGAA GTGTCCATCG ATATTCGCGA TACGGTCGAC
TACAAGCAAG TCATGCAACA GCACAAACTG GGACCCAATG GAGCAATCAT GACTTCGCTG
AATCTCTTTG CCACCAAATT TGATCAAGTC ATGACGTTGT TGGAAAAGAG GGCGTACGAA
GACGCTTCCG AACACGACCA GGAACAAGAT GGTACCACGA GTACGCCGCC CGAACCGCTG
CCACCACAGT CACAAATAGG GATGGACTAT ATACTGGTGG ATACACCCGG GCAAATTGAA
GCGTTCACCT GGTCCGCGTC GGGAGCCATA ATGAGCGAAG CGCTAGCCTC CGCCTTTCCG
ACCGTGCTCT GTTTCGTGGT CGACACGGTT CGCTGCGCGT CGTCCCCCAA TACCTTCATG
AGTAACATGC TGTACGCCTG CAGTATGATG TACCGTACCA GACTGCCCCT GATCGTCTGC
TTCAACAAGA CGGACGTGGT GTCGCACGAG TTTTGCCTGG AATGGATGCG GGATCATGAT
GCCTTTCAAG AAGCTCTAGA CGACGTCTCC GAGTCGGCCG GGTTTTACGG ATCTCTGACC
AGAAGTTTGG CCCTGGTATT GGACGAGTTC TATTCCAGTT TCGCCAACGC CGTTGGGGTG
TCGGCAGTGA CTGGAGACGG AATGGATGAC TTTTGGAAGA CGGTGGAAAA GGCGGGACGT
CAAGACTTTG TGTTGGACTA CATAGAAGAT TTGAAGAATC GGATAGAGGA GCAGCAAGCT
CGCACTCAAG CCATGGCTCG AGTGAGTTTA TCGAGGTTAC AGCGAGACGT GGATGCGGCG
GACTAACTGT AAAGGATGCT CGGAACGTGG ATATCATTGA ACCCTGGTAA TCCCTTTTTG
ATTTCGTGTC CACGTAGTCA TCAATACGAT ATTCTCTTGT GTTCGGGACT TATACAAGCT
GCCCTGCCCT CCCTGTCCCA TAGCTGACAC TGAAAATGAG CAAACAAACG GATTTCACTG
CTACAGCACC CATCCTCTGC AACCCCTTCT ATTGGAGCAT AAGTGGCACA GCTGGTGAAA
TTATCTAATA ATAGTGTCCT TTTGATGTCG TATGGACGCC AACTATATTT GGACGACAAC
CACATGGACA TTTTGAAGCT CGTCTGGCAG CGTTTTCGGC CACACCTAGC TTGAAAAGTT
GACTTCAAAA GAGAAACAAA GCCAGGGCGG CAACCTTTGG TAGCTAGTGC CAAGAGTATC
CTTTCTTCCT GC
 
Protein sequence
MESASTTTAP PFGETPSGQL PPIVQRLERG EKTPICVIMV GMAGSGKTTL LTQLQRSLET 
PSVPPTPDDF VAADTAADAK MASYVVNLDP ATLSVPYEVS IDIRDTVDYK QVMQQHKLGP
NGAIMTSLNL FATKFDQVMT LLEKRATPPE PLPPQSQIGM DYILVDTPGQ IEAFTWSASG
AIMSEALASA FPTVLCFVVD TVRCASSPNT FMSNMLYACS MMYRTRLPLI VCFNKTDVVS
HEFCLEWMRD HDAFQEALDD VSESAGFYGS LTRSLALVLD EFYSSFANAV GVSAVTGDGM
DDFWKTVEKA GRQDFVLDYI EDLKNRIEEQ QARTQAMARV SLSRLQRDVD AAD