Gene PHATRDRAFT_37636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37636 
Symbol 
ID7202455 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp589214 
End bp591338 
Gene Length2125 bp 
Protein Length657 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181590 
Protein GI219122518 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCCCT TCACACCTTC TTTTGTGAGC TGTCGTCCTG TTTCACCCTA CGGCTTCCAG 
CCCATCTCCG GAGACCCGCA GCAAGACCAG CGCGTAATGA CCCCGCTTCA ATTCTCGCCA
ACACCGATGT CGCCGTACGG AAAAATATCT CCCAGTCCTT CCGTAATGAC GGAGGAAACA
TCTTCCCTCA CATTTGAAAG CAACAGCCGC TACAGCCCTG CAACTTTTCG CTCTAATACA
CCATCGTCGT TTCGATCAGT TACACCCCAG TCGGACCAGT TCATAGACAG TAGAGGCCAA
CGTGTCACCA CACCACGCCC CAAAACTCCA AACGGCCGCA AGTCACCCTT TGCCAAAAAA
GAAGAAGACG ATGCGGTCCG AAAAACTCGT ATTAAAACTG AGCTTTGCAT ACATTACGCG
AATGGTAGAC CGTGTCCTTT CGGAGCAAGT TGCACCTATG CACACGGTGA AGAGGAGCTT
CAGTTGACTA AACTTCTGGA TTTACACGAA GCTGGTCTGA TTGATGTCGG GATCTTTCGT
ACAAAACCAT GCTTGACTTG GGTTGCAACA GGATCGTGGT ACGTACAAGG GCCTGCAAAT
TGATAGAATT CTACTTGACT GAAAAGAGCA AAAGACTGAC AATGATTTTT CTGGCTATCC
TATTCGTAGC CCGTTTGGAA AACGATGCAC CGCCATACAC GATCCTCGAG TGGGCGGATC
CCATTCATCT TGGTTGCCTC ATACCGAGAC ACAAGGCAAC ACAATGGCTA CAGACATCAA
TGTGGAGGCT CTTCACCAGA AGCGTCAACA TTCGATTTTG TATGGAACCC CGTTTGGGAG
TCATTTTTCG CTGGAAAATG ATTCTTGGAG TGACCTGTAC AAGCTCGTAT GTCATATCAA
CTATGCCAAA AAGGGATGGA TTGACAAGCG TCGTCGTATG ACTGTTGATC CAGTAACCAA
GTTAGAAGTT GCTCTTCTTA TGCGAGGCGA AGCCAACTGG AGTTTCAAGT TTCGACCACA
ACACATCATT CACGACGAAC TTTGCATGGT TCTTCAGGAA CGTGCCTTTC GAATAGACAG
TCAACTACTA CCTGTGGAAA TTCCGCAGCA CTCATATACC GCTAGCAATC AAAGCCACAT
ATTTGTACGA GAGATCGCCT TCGGACCCGA TGAAGATCCG ACCGTACGAA CGGTTGGTCT
TTGGTTCAAT ATTGATGAGC GAGATGTCCT AGTGTGTACG TCTCAGCAAG CTAAGCGATT
CCGTTGGAAG CGGGGCGTCA ATATAAAGGA CGACACTCAA CAAACAGGGA AATCTTCCGC
ATTCGAGACC CTGGATCACT TCCCTATGAT TCGCCCCCAT GACAGGGAAA CATTCGGCTT
TACCACAAGT CTCTTAAAAC ACCGCCTTCG AGTCGTGCGC GCAGAACGTA TATGCAGTAT
GAGAGGACGA TTTGACGCCT TGCAGAAACT TGAGGGTGAC AAGCAAGTTC TTTATAAGCG
ATTCTTGAAT TTGGCACATT GCTGGAAGGT TTGGCTGTGG CCAATCAATG ATGGAAGGGC
CAGCGTTGAC AAGCACACGC CAGTACCACC AGTAGATGGA AAGTACGAAT TCGGTAGGAC
TGCATCGAAC CTTACTAGAC TGAACTCAAA GCTTGAAGAA ACCAGTGCTC CAATATGGCA
TACTGTGAAC GAAATTTGGG AGTCCTTCGT ATCTGCAGAC TTTGAAAACC TTCACGTGAG
TTTTTACATT TACAAAACTG GATTGTAAAA TGTGACCTCT TTACTAACAT TTCAGGGGAA
ATCATCCCTT TTCATCATAC AGGTTGAAGA ACGTGTATTG CTCAACGTTC GACTTACATC
AAGCAAGCGT CTACGACCGT TTCTGCAGCT TGCGCAAGGC AAACCCCTGT CCCTGGACAG
GCGCTCGCCG CATATCCTGA AACACGATAG GACCACCGAA GAAAACCATC CTTCGCATTC
TCAAGATCAG GATCGTTGCT GGAAGTCGCT GTTGCTGACT TCTGGAAAAT CCATAGAAAA
CAGTGAATGG GAACTGGTTG AACAGCATTT CAAGAACTCT CGAAGCAATA AGGTTCTAAA
CATCATCCAA GATAAAACTG CATGA
 
Protein sequence
MSPFTPSFVS CRPVSPYGFQ PISGDPQQDQ RVMTPLQFSP TPMSPYGKIS PSPSVMTEET 
SSLTFESNSR YSPATFRSNT PSSFRSVTPQ SDQFIDSRGQ RVTTPRPKTP NGRKSPFAKK
EEDDAVRKTR IKTELCIHYA NGRPCPFGAS CTYAHGEEEL QLTKLLDLHE AGLIDVGIFR
TKPCLTWVAT GSCPFGKRCT AIHDPRVGGS HSSWLPHTET QGNTMATDIN VEALHQKRQH
SILYGTPFGS HFSLENDSWS DLYKLVCHIN YAKKGWIDKR RRMTVDPVTK LEVALLMRGE
ANWSFKFRPQ HIIHDELCMV LQERAFRIDS QLLPVEIPQH SYTASNQSHI FVREIAFGPD
EDPTVRTVGL WFNIDERDVL VCTSQQAKRF RWKRGVNIKD DTQQTGKSSA FETLDHFPMI
RPHDRETFGF TTSLLKHRLR VVRAERICSM RGRFDALQKL EGDKQVLYKR FLNLAHCWKV
WLWPINDGRA SVDKHTPVPP VDGKYEFGRT ASNLTRLNSK LEETSAPIWH TVNEIWESFV
SADFENLHGK SSLFIIQVEE RVLLNVRLTS SKRLRPFLQL AQGKPLSLDR RSPHILKHDR
TTEENHPSHS QDQDRCWKSL LLTSGKSIEN SEWELVEQHF KNSRSNKVLN IIQDKTA