Gene PHATRDRAFT_39151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39151 
Symbol 
ID7194889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp451157 
End bp452602 
Gene Length1446 bp 
Protein Length481 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183099 
Protein GI219125673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCAAG CCCGTCGCCA GGGTATTGTG GAACACGACG ATGAAGAGCT CTGTGTGTGG 
GTAAAAGCCT ACACGGAAGG TACACTGCCC GACTATCAGA TGGCAGCTTG GCTAATGGCC
GTCTGCTTTC ATCCGCTGAA TGCTAGAGAG ACGGCTACTC TCACATCCTG CATGGTGGCC
TCGGGCGTAC GAGTCGACTG GACGTCTCAC GACGCAATCA CAAGCGACAC TATGGGTAAA
GTTGCTTCGA CTACTGCTAC CAATACCGCC TTAGTGGACA AGCATAGTAC GGGAGGTGTG
GGTGACAAAA TTTCCATTGT GCTGGCACCG CTAGTGGCCG CCTTTAGTGA CAATCGCGTG
GCCGTCCCGA TGATGGCCGG CCGAGGTTTG GGTCACACTG GTGGGACCAT TGACAAACTC
GAAGCGATTC CCGGTTTTCG TACCAATCTG AGCGTTTCTG AATTTCAACA CGTTGTCCGG
ACCGTGGGTT GTAGTATCGT CGCGGCTGGG CCGGAACTAT GCCCGGCGGA TCAAAAGCTT
TACGCCCTGC GAGACGTTAC TGGGACGGTC AGCTCCCTGC CACTCCAAAC GGCCAGCATT
GTGAGTAAGA AAGTTGCCGA ACATCCTGAT TCACTCGTGC TAGATTGCAA GTACGGATAC
GGAGCGTTTC AAGCCGACGT TGAGGCGGCG GAAACGCTGG CGAACAGCAT GATTGCGGTG
GCCGAAGCCA ATGGCTTGCG TCCGACAACA GCGTTCTTGA CCCGCATGGA CGCTCCACTC
GGATACACGG TGGGAAATTG GGTCGAGATT CGAGAATGTC TAGCTATTCT ACGAGGCAAT
CTGATGTCTC AGTTGCTGAG TCGAGACGTG ATTGCGCTTG TGGTCGTTCA GGCTACAGAA
ATGCTGCTGC AAAGCGGTCA GTTTGAAGAG CACACGTTTG AAAACTTGGC ATCGAAAGTA
TATACCTTTT TGGACCAAGG AAAGGCGTTT GGTAAGTTTG CGGAAATGGT GCAGGCGCAA
GGTGGAGCCG TTGAAGTCTT GCAAAATCCC GAAACCTATC CCGCGGCCAG CACTACTTGG
GATCTTTTGG CCGACCGGAC TGGATTCATC GTTGAAATTA ACGCACTGTA TGTAGGAGAA
GCAACCGTCG ATCTCGGGGC CGGCCGAAAA GTTGCCAACG AACCCGTTGA TCCACTTTCT
GGAATAGTGT TGTTGAAAAA GTTAGGGGAT TCCGTTGTTC AAGGCGAAGT TCTGGCCAAG
ATTCTAACGA ATCATGCCGG TCACGATTTG CAAGGGATTT CGAATCGACT TCAATCCGCC
ATAGTGATTG GCGATTTTCC CATTAATGTG CCCCCAATCG TTTCTTATCG CGTCACGTGT
CACGGAGCAA AAATCTTTTG CATGCCACAT TGTCTGCTCA AGATAGAACA GCTAACAACT
TCATAG
 
Protein sequence
MIQARRQGIV EHDDEELCVW VKAYTEGTLP DYQMAAWLMA VCFHPLNARE TATLTSCMVA 
SGVRVDWTSH DAITSDTMGK VASTTATNTA LVDKHSTGGV GDKISIVLAP LVAAFSDNRV
AVPMMAGRGL GHTGGTIDKL EAIPGFRTNL SVSEFQHVVR TVGCSIVAAG PELCPADQKL
YALRDVTGTV SSLPLQTASI VSKKVAEHPD SLVLDCKYGY GAFQADVEAA ETLANSMIAV
AEANGLRPTT AFLTRMDAPL GYTVGNWVEI RECLAILRGN LMSQLLSRDV IALVVVQATE
MLLQSGQFEE HTFENLASKV YTFLDQGKAF GKFAEMVQAQ GGAVEVLQNP ETYPAASTTW
DLLADRTGFI VEINALYVGE ATVDLGAGRK VANEPVDPLS GIVLLKKLGD SVVQGEVLAK
ILTNHAGHDL QGISNRLQSA IVIGDFPINV PPIVSYRVTC HGAKIFCMPH CLLKIEQLTT
S