Gene PHATRDRAFT_42548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42548 
Symbol 
ID7196254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp385208 
End bp386280 
Gene Length1073 bp 
Protein Length346 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177081 
Protein GI219110659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0379002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACGG TGAGACTTGT GTCGCTTTTG TTGGTGTTGT CGACAGCCAA CAGCTTTCTA 
GCACGACTTT CCAGTACAAG GAATCGTAAC GAGGTTGCTG TGGCTTGTCG GAGCGAACCG
GAAACTAGCA ATTACTGGGG AGATGACACT GATGACCAGG ATACCCTCCA GCCCAGCTTG
ACGCCTCTGT CGTCGTTCGC GGCGTCTCCC GCCTTGTTTG AGCTAGATCC GGCTTCAGAC
CAAGCTAGAG ATATCGTCAT GAACGATTTG AAACTCTCTG GTGCCCAACA CGAACAACTG
GTGTCGTTGT GTCAAGCCGT TGTGGACTGG AACGACCGAA TAAATTTGGT TTCGCGGAAG
GATTGCACGG TGGCAACCGT ATTTGGCCGG CACGTACTAC CTTCCATTGC CTGCTGTGCC
TTTTCAGAAG ATCAAAATCC CTTAAACACT GCTAAAACAC TGGTCGACGT TGGCACGGGC
GGTGGCTTTC CCGGATTGCC GTTGGCGATT GCCTATCCCG ACGTTCAGTT TGTCCTCCTT
GACAGTGTAG GCAAGAAACT GACTGCGGCC CAAGACATGG CAAACGCTCT GGGACTCGAC
CACGTTCGTA CACATCACGG GCGTGCCGAA GATTTACGGG ACGAGGTCTT CGATGTCGCC
ACGGGTCGTA GTGTGTCGGC CATCCCACAA TTTTGTGCGT GGATGCAGCA TTTGGTCAAA
CCCACGGGGC ATCTTCTCTA CTGGATTGGC GGCGACGTCG ACGCGAGTAT TCTGGAACAA
ACTGTTTCGG ATACCCCCAT CGAGTCGCTA GTACCCGACA TGGAATCGGA TAAGAGAATA
TTAATACTCC CCCAGCTTGC TGTGAAGAGG ATCGCCAAGG CTAGTGGAAT TTCTGTGCAA
CCGTCACCAA CCAATCGATC ACAAAGAAAG CGCCCATCGT CCCAGAGAAA GACGACAGCC
AAAGGATCTT GGAGCCGCCG AAACTCGGAA GAGCCCAAGC AGCGCGGCTA CGAAGGCTTC
AAGCGGTATT CTAGCTCGTA ACCTACTTTT ACGATACCAG GACAACAAAA CCC
 
Protein sequence
MSTVRLVSLL LVLSTANSFL ARLSSTRNRN EVAVACRSEP ETSNYWGDDT DDQDTLQPSL 
TPLSSFAASP ALFELDPASD QARDIVMNDL KLSGAQHEQL VSLCQAVVDW NDRINLVSRK
DCTVATVFGR HVLPSIACCA FSEDQNPLNT AKTLVDVGTG GGFPGLPLAI AYPDVQFVLL
DSVGKKLTAA QDMANALGLD HVRTHHGRAE DLRDEVFDVA TGRSVSAIPQ FCAWMQHLVK
PTGHLLYWIG GDVDASILEQ TVSDTPIESL VPDMESDKRI LILPQLAVKR IAKASGISVQ
PSPTNRSQRK RPSSQRKTTA KGSWSRRNSE EPKQRGYEGF KRYSSS