Gene PHATRDRAFT_45751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45751 
Symbol 
ID7200776 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp135892 
End bp137362 
Gene Length1471 bp 
Protein Length386 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179980 
Protein GI219118413 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0607057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATGGAGCTC TTTGGTTATT CATTGCAGTC AATCCGAGTC TTCATGGCCC TTGCGTGTTT 
GGTCGACTTG GTGTTCCGAG TAGCCAATGG ACAACATGTA GAGCAGCAAT CCACTAGAAT
GCTTGGTAGT CCTTATCTTT TCTCCAATCA AATGTCGAGG AGGAATGGAA AGAAGAAATT
GAGGCGAGTC AACAAGAAAA TGATGCTCCG GCCGCGAAAA CAAATGCGGA AGAGACAGAA
TATGGTGCAA CCACTAACCT TGGCCCCATC TAGTCAGCCT TTATCGGTTC CAAATCCGAC
AGTGTCAAAG ATACCTTCCC CGCCGGCGAT GCCCTCGTTG ACGCCATCCT CAATGCCACT
GCTGATTCCT TCGACACTGC CTTAATTCTT GCCTTCCCTA GTGCCCTCAT TGATGACTTC
GTTGGAGCCG TCAATATCAC AAATGCCGTC AACAATGCCT TCCTTGGCGC CTTCTTCCAT
GCCTTCGGAG ATGCCATCGT ATATGCCTTC ATTGAAACCC TCAGTATCAA CAGGGCCTTC
GCTGGAGCCG TCAGCAGCAC CAATACCACT GGGGGTGGAT TGGATAAACC AGACAAGCGC
AGCAGATCAT CAGTGGAGTG CTGTCACGTA TGGCAACGGA ATGTTTGTAG CAGTGGCATT
TGGAGGCAGC GACAGTAACC TTGTAATGAC CAGCCCTAAT GGCAGGAACT GGACAAGCCA
GAGAAGTGCA TCAGAAGCTA GTTGGTCCAG CATTACATAC GGTAATGGCA TATTTGTTGC
GGTTGCCAAT GCTGGCAGTG ATCCTATCCG TGTCATGACC AGTCCAAATG GCATCAATTG
GACAATGCAG GAAAGTCCTC CTGAACAAGA CAACTGGAGA AGTGTAACGT ACGGCATGGA
TATGTTTGTT GCACTTGGTG CAGAAGAAAA TGGAGATGTC AGCAAGAAGC TTGCCATGAC
TAGCCCAAAT GGTATGAATT GGACACTCCA GACAACAGAT CCTTTGGGGT TTTGGAACAG
TGTTATATAC GGCGATGGAA CCTTTGTTGC GGTTGAGTTT TCTGGTGGGG TTGACAACCA
GGTCATGACC AGCCCCAATG GAATGAATTG GACAACTCAT CCTGCCCCAG CAGCTCAATG
GATTAGTCTG ACGTATGCCA TGGATATATT TCTGGCAGTG GCTATATTCA GCTCTGACAC
TGAGCAGGTC ATGACCAGCC CCAACGGGAT AAACTGGACC ATCCATCAAA GCGCTAAAGA
TGCTTGGTGG AGTAGCATTA CCTATGCAGA GGCTGAAAAT GTCTTTGCTG CAGTGGCCCG
ATCTGGTGAG GTCATGACCA GTCCCAATGG TAGGAATTGG ACTATCCAAG AAAGTGGAGC
AGCTGCACCA TGGAGCAGTG TCACCTATGG CAATGGAACA TTTGTGGCAG TCTCTTACAA
TGGTGAAGTC ATGACAAGTC AGACTGGCTA G
 
Protein sequence
MELFGYSLQS IRVFMALACL VDLVFRVANG QHSAFIGSKS DSVKDTFPAG DALVDAILNA 
TADSFDTALI LAFPRPSLEP SAAPIPLGVD WINQTSAADH QWSAVTYGNG MFVAVAFGGS
DSNLVMTSPN GRNWTSQRSA SEASWSSITY GNGIFVAVAN AGSDPIRVMT SPNGINWTMQ
ESPPEQDNWR SVTYGMDMFV ALGAEENGDV SKKLAMTSPN GMNWTLQTTD PLGFWNSVIY
GDGTFVAVEF SGGVDNQVMT SPNGMNWTTH PAPAAQWISL TYAMDIFLAV AIFSSDTEQV
MTSPNGINWT IHQSAKDAWW SSITYAEAEN VFAAVARSGE VMTSPNGRNW TIQESGAAAP
WSSVTYGNGT FVAVSYNGEV MTSQTG