Gene PHATRDRAFT_42440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42440 
Symbol 
ID7196644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp58601 
End bp60029 
Gene Length1429 bp 
Protein Length475 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177009 
Protein GI219110515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.727315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATGGCAGAA AGACGACCGC ATCCTCCTTC TCTTTCGGTA GCTATTTTAC AAATTCTGAT 
TCTTTATCAC ATGCTGCAGC GAGCCGCGGC GTTGGCAACG TCGCGAATTT CTTCTCCATA
CACAAAACGC GGTCTGGTTC GAGTGGTCGC ATTTTCGGTT TCCAACCCCA ACGAAGAACA
GGTGCAGGAC ATGCTATCCA ACGTTCTGTC ACGAGCCACA CGCACGTTGT CCTCGAAATC
CTATACCAAT TTCTCTGTAA ATAGCACCTG CTCTTCCGAG CCGTGGCGGA ATGAATCGCG
TGTCGACGAT GATTCCGTAT TGGATATGCT GGAGGCAATT CGAGTGGGCA CCGCGAATGG
GAACTCGTCA ATGGTTCGCA CCGATGCGAC AAGGAGCACC AACAGTGCTG TATCCTCTTC
TTTCTCTCCG TCGGTAACAT CCGCTATTAG ATTGGATTCC CTTTTCACGA ATTCCATATC
GCGAGCCAAA CGCACATTCA CCACCAAAGT GGAATTCCAG GAAGCATCCA AATCGAACAA
ATTGGTCGAC GATACGGTCA CAACCATACT CGACAAGGTA CAGCAGCAAG GTCGCGTCCC
ATCGCCGACA CCCGCTCCAG ATCCCTTGCC TCGTCCCACG GACACTGATT TGCACTACCA
GCAAAATCCC GCCATTTCAG CCACTGCCCT GGCCCATTCC TTGTGGGGTT ACGTGCTCCG
ACCGGGCCTG GATTCGGCCA TTGACGCAAC GGCAGGAAAC GGTGGGGATG CCGCTACAAT
TGCCACGATG CTCTTTTCAA ACGTGACCCA AAGCTCGACG TCATTAATGC CAACCTCGCG
ATCCGAACTC GTTTGTGTCG ATGTGCAGAC CCAAGCGTGT GCGAATACGC GCAGCGCGTT
GGAAGACTGC GTCGGCTCGG ACGTAGTGGA GCAACGCGTT CGGATAATCC AAGCATCTCA
CGCACCATTG CCACTACCCA CTGATACCTC CTCAATCGCC CTGGTAGTCT TTAATCTCGG
GTTCTTACCC CAATCGGAAA ATAAAGCCCG TCAAACCCAG ACCGACACAA CCTTGGCTGC
CATGGCCGAC GCCTGTACGG TCTTACGCAT TGGCGGCCTG CTATCCGTCA TGACCTACCC
AGCGTCCAAC GCTCACGAAG ATGCGCTGGC CCGGGCCTTT ATGGAATCTT TGGCGCTATA
CAGCTCCAAA ACAGAAAATT GGGAAACCTT TGTGGACGAA TTGATGTTTC CAGCCGAGAA
TGATGACACT GCAACAGCCG ACGCCGAAGA TTGGAACGAA CAGCTCCGAC GGTCACTACG
GTATGTTTAC GAAGAAAATG GACCCACACA GACCTGGCGG GTTCACGAAC ACCGGAAAAT
TGGATGGAAG AACGCGCCCG TCTTGCTCAC CGCAATCCGA ATCAAGTAG
 
Protein sequence
MAERRPHPPS LSVAILQILI LYHMLQRAAA LATSRISSPY TKRGLVRVVA FSVSNPNEEQ 
VQDMLSNVLS RATRTLSSKS YTNFSVNSTC SSEPWRNESR VDDDSVLDML EAIRVGTANG
NSSMVRTDAT RSTNSAVSSS FSPSVTSAIR LDSLFTNSIS RAKRTFTTKV EFQEASKSNK
LVDDTVTTIL DKVQQQGRVP SPTPAPDPLP RPTDTDLHYQ QNPAISATAL AHSLWGYVLR
PGLDSAIDAT AGNGGDAATI ATMLFSNVTQ SSTSLMPTSR SELVCVDVQT QACANTRSAL
EDCVGSDVVE QRVRIIQASH APLPLPTDTS SIALVVFNLG FLPQSENKAR QTQTDTTLAA
MADACTVLRI GGLLSVMTYP ASNAHEDALA RAFMESLALY SSKTENWETF VDELMFPAEN
DDTATADAED WNEQLRRSLR YVYEENGPTQ TWRVHEHRKI GWKNAPVLLT AIRIK