Gene PHATRDRAFT_18335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18335 
Symbol 
ID7197229 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1159193 
End bp1162459 
Gene Length3267 bp 
Protein Length995 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177765 
Protein GI219112027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACCG CGAACTTTTT GGATATTTTG AAGCCTCCAT TGGATGATCG AGAATATGTT 
GCGTACACAT TAGAAAACGG ACTTCGCGTC TTGTTGTGCT CGGACGAGTC TTCGAACGAA
GCAGCCGTAG CTATGGATGT GCATGTCGGT GCATGCTCCG ACCCGGCGGA AGTTCCAGGA
ATGGCACATT TCAATGAGGT ATGAGTCGTA AAAGTAAATT GTCCTTTGCC ATAGTCTCCA
TGTTGAGTAA TTGCTCACAG TTGTCGGTTT TACATAGCAC ATGCTGTTTC TCGGGACGAA
GAAATATCCA AAGGAGGACT CCTTTGAAGC CTTTTTGGCT TCAAACGGTG GTTCTTCTAA
CGCTTACACG GCAAGCGAGG ATACGGTATA CTTCTTTGAT ATGGCAGCGG AAGCCAATGC
AAAATTCGCG GAAGGACTGT CTCGCTTCGG TGCTTTCTTT ACAGCTCCTT TGTTTACAGA
AGGTGCAACG GGTCGAGAAC TCAACGCTAT TGAAAGCGAG AACGCGAAGA ATCTGCAGTC
AGATACTTTT CGTATTTTCC AAATCGATAA ATCCCGAGCA AATCCAGACC ACCCTTACAG
CAAATTTTTT ACTGGTAACA AAAAAACTTT GTTAGACGAT ACCAAGGCAA AGGGCCTAAG
CCTTCGAGAG GAGCTCATCA AGTTTTACAA CAACTACTAT TCGGCCAACC AAATGACGTT
AGCTATTGTT GCTCCGCAGT CCATCGAAGA CCTGAAAAAC ATGGTTACGG AAGCATTTTT
GGATATTCCG AATCGAAATG TTGATACGCC TGAGTCCTCA TGGGCCGGCA TTCCTCCTTT
CATAGACGAG AGTTCGATCC CATCTTTCAA AAACGCGATC GAGATAGTTC CTGTGCAGGA
TCTTCGACAA ATTATGATTT CATGGCCAAT TGTGTATAGC TCAGAGGATC AAAGGCAGGA
TGACTTACTA AATAAGCCGA CAACGTACAT CGCACATTTA CTTGGGCACG AAGGACCCCG
CTCTTTGCTT TCCTACCTCA AAAGTAGGGG GTGGGCAAAC TCTGTTGGTT GTGCCAACAG
CGAGGAACTT TCTGACTTCG AGGTTTTTGA GGTGGTAGTA GGACTTACGA CCCAAGGCTT
GGCGCAAGTG GATGAGGTGG TAGAGTCAGT GTACGCCTAT ATCAACATGC TTCGTGACCG
CAAGATTCCG AACTATGTGT TTGAGGAAGT CTTTCGGCTT GAAGAACTGC AGTGGCGATT
TTTGACAAAG GGAAGCCCTC GGAGTTATGC TTCGTCCCTG TCTACTGCAA TGCAAAAGTA
TCCGCCAGAA CTGTACGTTG CTGGACCGAG GCGACTAGCG TTGGATGAAT TTATCATCGA
GAAAAGAATG AACGGGCTCG CTCGCTCCGA GTTTGTATCT AGGGAAGCGC TAGAGCGCTC
CCGGAAGCAA GCAGAGCTCT TAGCTGACAA TCTGACTGTA GATAATGCGC TTCTAACCGT
GATGAGCAAA GACTTTGACA ACAAAACGGA TCGCAAAGAA AAATGGTACG GGACGGACTA
CCGGGTCCGC CCTCTATCCG TTGAAACCCT CAGCCGATGG AGACGTGGTA TACGAGCGGA
GCAAATTAAG ATCGACTTTC CAAGACCCAA TCCGTTTATT CCTACCGAGC AAGGTTTGCG
CGTTAAAATT TCACCGTCCG GCTCAATGAA GGCTGCGAAG AGGTCTTTTG AATCCAGAAT
GATGCCCGTC CCCCCTCCGT CTCTGCTTCG AGATGATGGA CCGGACGGTC GATGGAAGGT
TTACTTTAAG GCTGATGATC GTTTCGGGTT GCCAAAAGGT TATATTGTCT TTCAGGTAGT
CACTGGTGAA GCGTTCGCTT CGCCTAGAAG TGCAGCCTTG TCGAATCTTT TTGAAGTCAG
TATTGCGGAC AAAATAGGGG AATACGCATA CGATGGTACG TAAAAGCAGA CACGAAATAT
GGTTACAAGC ACAGCGTTCT AGACTAATTG TGTGAGAAAA TCGTCTGTTG TATCAAGCCA
GCCTTGCCGG CTTAACGTAC GATGTAAAAA TTATGCCAAG AGGAATTCGA TTGACTTTTG
GGGGCTACAA CGACAAACTG AAACGCTTCG CTTCGTACAT TTCGTTGAAG CTGACGACCG
AAATACGTGA TGTTCTTCCG ACGAGTGAGA GTGTGTTTGA TCGATACAAG GATCAAGTAA
TGCGGGGATT GTCTGCATTT GATGTCAAGC AACCGTACTT TCATGCGTCT TATTATTCCC
AGATTGCTCT TCAGCCGCCT CGGTTTCAGT ACGACAATAC CGCACTAAGG GAAGCTATTA
GAGAAGTAAA TTTGAGTGAT TTGATTGAAT ACGTCAACAC TCTTTGGAAG TCGGGCCGCG
GCGAGGCTCT TATACAAGGA AATTTTGATC AAAAAGAAGC CATGGAACTC GTCAAAAACA
TTGGTGATGT CTTGCCGTTT CGACCGATTG TCCAGGAGGA ATACCCTTCA CGCCTGGAGG
CACTGCCTTT GCCTGCTTAC GGCCCAAAGA AGCTGCCAAC CAAGCTAATC GTTGCCGAGC
CAAACCCTGA CAACGAAAAC TCTGTTGCCA CAGTAATGCT ACAAAGTCTC GGCACGTCAG
AGAAGGATCA CGTACTGATC GAATTGATCA GCTCCATTGT GCAGGAGCCG TTTTACAACG
AACTCCGTAC AAAAAAGCAG CTCGGCTACA TTGTATCGTC AGGAATTCGT GCCGTGGGTA
ACAGCCGAAC GCTCTCATTC ATAGTCCAGT CCAGCGTGGC GCCGGCAGAC AAGTTGTCCA
TCGAAATTGT CAAGTTCTTG AATACAGTGG AAGATCGTTT TCTCAACAAG CTCCTTAAAG
CTGACCTCGC CGTGTACGTC AAAAGCCTGA TTGATCGCAA AACGGAACCC GACAAGGAAC
TCGCTACAGA AGTGACTCGC AATTGGGCGG AGATTGCGAG CGGACGATTT CAGTTTGATC
GCATCCAAAG GGAAGCTGCC GCGCTGCTCG ATGTACAAAA GGAGGATTTG CTAGATTTTT
GGAGACGAAT TTATACCGGG GACAATTGCC GTGTATTGGT GACACAGGTA GTTCCTCGCC
AAGGGCCAGC GTCTTCGCCC GTCCCAGCCA AGAGCACGGG ATACAATGAC AAGGATCCGC
TACCCGAAGG ACTAGTCCTC GGGATTGACG ACTTGGATCA ATTCCGCGCC GATAGGCAGA
TGTCAACTTA ATGCTAGGTC TACGACT
 
Protein sequence
MATANFLDIL KPPLDDREYV AYTLENGLRV LLCSDESSNE AAVAMDVHVG ACSDPAEVPG 
MAHFNEHMLF LGTKKYPKED SFEAFLASNG GSSNAYTASE DTVYFFDMAA EANAKFAEGL
SRFGAFFTAP LFTEGATGRE LNAIESENAK NLQSDTFRIF QIDKSRANPD HPYSKFFTGN
KKTLLDDTKA KGLSLREELI KFYNNYYSAN QMTLAIVAPQ SIEDLKNMVT EAFLDIPNRN
VDTPESSWAG IPPFIDESSI PSFKNAIEIV PVQDLRQIMI SWPIVYSSED QRQDDLLNKP
TTYIAHLLGH EGPRSLLSYL KSRGWANSVG CANSEELSDF EVFEVVVGLT TQGLAQVDEV
VESVYAYINM LRDRKIPNYV FEEVFRLEEL QWRFLTKGSP RSYASSLSTA MQKYPPELYV
AGPRRLAEAL ERSRKQAELL ADNLTVDNAL LTVMSKDFDN KTDRKEKWYG TDYRVRPLSV
ETLSRWRRGI RAEQIKIDFP RPNPFIPTEQ GLRRSFESRM MPVPPPSLLR DDGPDGRWKV
YFKADDRFGL PKGYIVFQVV TGEAFASPRS AALSNLFEVS IADKIGEYAY DASLAGLTYD
VKIMPRGIRL TFGGYNDKLK RFASYISLKL TTEIRDVLPT SESVFDRYKD QVMRGLSAFD
VKQPYFHASY YSQIALQPPR FQYDNTALRE AIREVNLSDL IEYVNTLWKS GRGEALIQGN
FDQKEAMELV KNIGDVLPFR PIVQEEYPSR LEALPLPAYG PKKLPTKLIV AEPNPDNENS
VATVMLQSLG TSEKDHVLIE LISSIVQEPF YNELRTKKQL GYIVSSGIRA VGNSRTLSFI
VQSSVAPADK LSIEIVKFLN TVEDRFLNKL LKADLAVYVK SLIDRKTEPD KELATEVTRN
WAEIASGRFQ FDRIQREAAA LLDVQKEDLL DFWRRIYTGD NCRVLVTQVV PRQGPASSPV
PAKSTGYNDK DPLPEGLVLG IDDLDQFRAD RQMST