Gene PHATRDRAFT_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_3559 
Symbol 
ID7198979 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp292418 
End bp293539 
Gene Length1122 bp 
Protein Length351 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185168 
Protein GI219130010 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0310467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAGAGTTTC CCAAAATAGA GCTGCACGTT CATCTCGACG GAAGTTTTGA CCCCTTATTT 
TTGTGGAAAT ATATGCAAAA GCATCCCGAA AGCATGTTGT GTCTGCCAAC GGAAACCGTA
CCTCCTTGGC AGCCAACAAG GAAACTTGAA ATCCGCAAGC TTGTCGAAGA CTGTACCACA
TCGCAAGAAT ATCACAAGCT TTGCACATGT CGTGGATACC GTTCGCTCCA AGAGATGCTA
AATTGTTTTG AAATGTTTTT ACCTCTCGTT CGACGCAATT TGGACCTGCT GGAACAACTC
GCGTACGATT TTTGTCAGCG CCAATGGGAA CAAAATGTTG TATATACGGA GGTGCGCTAC
TCCCCCTTTT TGCTTGCTGA AAGTTTTGAA GTCGAAAATA AGAACTCACA GTCAGTGGAC
GCCGAAGCGG TCTTTGCTGC CATTACCAGT GGACTACGTC GCGGATCACA CAAGTTTGGT
ATTATTGTGA ATCAGATCAT TTCCGCAATC ACGTGGCGAC CCGACTGGGC GATGCCTTCA
CTGGAACTCG CCCAGAAACA CCGCGAAGAC TATCCATGTG CAACCTTAGG TATCGATATT
GCTGCCGGCG AGGAACATTT TGACAGGGAC CAGCACTCGG CGCTCTACGA ACCCCATTTT
GCCATGATTC AAAAAGCCAA AGAGTATAAG TTGCCAGTTA CCCTGCATGC GGGAGAAGCT
GCGATGGAAT CTTCCATGGA TAACGTACGC CGGGCAATTG ACGTATACGG TGCAAGCCGT
ATCGGGCATG GTTATAGGAC GGTCAACGAC TTGGATCTCA TAAACTATGT GAAGGAAAAG
AAGATTCACT TCGAAGTGTG TCCAACATCG AGTGACGAAA CGGGCGGTTG GATGTACAAG
GAAGAAAAGA ACTGGAAGGA ACATCCATGC CTTGCCATGC TCAAGCACGG CATTCCCTTT
TCGCTCAATT CGGACGATCC AGCGGTCTTC CACACCTCCT TATCGTGGCA GTACCGGATC
GCTTTGGCCA AAATGGACTT GACGCGGGAG GACATTGTCA AATGCAATCT GCAAGCCATT
GATGCGGCTT TCTGTTCCGA GGAGCGGAAG GTTGCACTGC GC
 
Protein sequence
QEFPKIELHV HLDGSFDPLF LWKYMQKHPE SMLYCTTSQE YHKLCTCRGY RSLQEMLNCF 
EMFLPLVRRN LDLLEQLAYD FCQRQWEQNV VYTEVRYSPF LLAESFEVEN KNSQSVDAEA
VFAAITSGLR RGSHKFGIIV NQIISAITWR PDWAMPSLEL AQKHREDYPC ATLGIDIAAG
EEHFDRDQHS ALYEPHFAMI QKAKEYKLPV TLHAGEAAME SSMDNVRRAI DVYGASRIGH
GYRTVNDLDL INYVKEKKIH FEVCPTSSDE TGGWMYKEEK NWKEHPCLAM LKHGIPFSLN
SDDPAVFHTS LSWQYRIALA KMDLTREDIV KCNLQAIDAA FCSEERKVAL R