Gene PHATRDRAFT_33072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33072 
Symbol 
ID7197053 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1451461 
End bp1454903 
Gene Length3443 bp 
Protein Length872 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177837 
Protein GI219112171 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.988083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTTC CATTTATTCT TGACGGTTTT CAACAACAAG CCGTTGTTCG ATTAGAACGA 
TCTGAGTCTG TTTTTGTTGC CGCTCATACT TCGGCGGGGA AAACCGTTGG TGAGTTGACG
TTGCGTTCAA GCAAGGTGTT CTCTTTTGTG GATTTCTCAC AGTCAGTGCA ACACTCTATT
TACTACTTCT CCAAGTTGCG GAATACGCCG TGGCCTTGGC GAAGCAGCGT GGGACGCGCT
GTGTATACAC GTCTCCAATC AAAGCCTTAA GTAACCAAAA GTTTCGCGAC TTCTCGTTGA
AGTTCGGTGC GGAGAATATT GGTTTGATTA CTGGAGATCT ACAGGTCAAC GCAGACGACT
CAACCTGTTT GATCATGACG GTAAGTGAAT AACAATTTCG AGCGGTCTCT ACCGCATCTT
TTTTAATTTC ATGGTTCTTA CATGAACTGT AAATTGTGCA GACTGAAATT TTGCGGTCTA
TGCTTTATCG AGGGGCCGAT TTAGTTCGCG ACATTGAGTT CGTGGTTTTC GACGAAGTAA
GTCAAAGAGC GATTTCCTGC CAACATTATA TATATGTAGT TGGCCTTATA CGATTCTTCA
CCTCGATAGG TACATTATGT CAATGATACC GAGCGAGGAG TTGTTTGGGA GGAAGTAATT
ATAATGTTGC CCTCTTACGT GAACTTGATT TTCTTGTCGG CGACTACACC AAATACCTTG
GAATTCTCAG ATTGGATTGG ACGGACTAAA CGAAAGCCGG TGTTCGTTAT TAAGACAGAC
TACCGACCGG TTCCCTTATC GTTTAATTTG TGGGCAGGTC TTAAGCTTCA TACCGTAATG
GAGGGTCGAG ATGGATTTCT TGAGAGAGGA TTTGCGTCAG CCGCGAACGC GCTCCTCCCT
GCGTCCGCTA GAGACTCAAA AAAAACTAAG AGCGAATCCA AGGGTCGCCC GCCCGCTAAG
ACTGCAATCG GCTCAAAACA GATGGCATGG CAAGCCCAGG GAACCAAGCA AAACTGGATG
TCCCTTGTGC GCTTTTTGGA CAGAGAAAAT ATGACTCCAA CCGTTGTGTT CTCGTTTTCG
AAGAAAAAGT GTGAGGAGAT TTCTATCATG TTGCAATCGC TTGATCTAAA TACGGCTAAG
GAACGAGGTG CTGTCCAAGG TTTTACGCTG CAGACGGTGG CTCGTCTCTC CAAGAATGAT
TCGAATCTAC CTCAGGTAGT AATGGTATGT GAGATGGTGC AACGGGGAAT CGGTATTCAC
CATGGGGGTC TTCTCCCGAT ACTGAAAGAA ATGGTCGAGA TATTGTTTGC CAAATCACTG
GTGAAAATTC TTTTTGCAAC TGAGACATTC GCCATGGGTG TGAATATGCC TGCACGCAGT
GTAGTTTTCA ACAGTGTTCG GAAGCATGAC GGCAAGCAGT TTCGTCAACT CGAGCCTGGA
GAAATAACGC AAATGGTAAG CTGACCTTCC GTTGACCTTT ACTACTCCAC CACACTCGTC
ACTAAGATTA TCCGTTTCGT TTTTGAAAGG CCGGTCGCGC CGGGCGTCGT GGACTGGACA
AAGTAGGCAC TGTGATTATA TGCTGTTTCG GCGAAACACC TCCACCGCAA CCTATGTTAA
AGCAAATGTT GACTGGGTCA TCGACAAGAT TAAACAGTCG CTTTCGACTC ACATACAACA
TGATTTTGAA CCTACTAAGG GTCGAGGAAA TGAGCGTTGA ATCAATGATC AAACGGTCAT
TCTCCGAGTT TGCTACACAA CGAGCCCTGA CTACGAACGA CTTTCCCCAG TTGCTGACTC
GAGGGATCAG AGCACTAGAG AAGTTGGAAG AAACTTACAA AGTAGAGGCA GCAAGTCGTA
TTGGGTCTGA GGACGTGGAA GAGTATTTCT CGACTTGCAG CGAAATCCTT TCAATAACCG
AACGTCTACT GACAAATGTG AGAGACACGG AGGCAGCATC GTTCGAAGGC ATTCTACAAA
AGGGTAGAAT CGTTTTGATT TCGGCCTGTC GAGAGCTTGG TGCGGTCAGA GCCCCCGCAC
TTGTACTTAA ATCGCCCTCG TTGTCTTCGA AACTGACTCC AAACGTAAAC GCTCGTACCG
ATAGATCCAA TAGCAAGGAA GTACTTGTTG TTTGCCTTGT CTTACTACCA AGCAGTTACA
TCGCATGCCA AAGTGACATC AATAAAAAGC CAGGGACAGT CGGCTACGTT GGGTTGACGC
GCAGCCGTCA TTTTTCTATC AGGAAAATAC GGGTTGGACA GATCCTTCTG GTCTCTTCGC
AGAAGTGTAA TGTTGACACG ACTTCTATTC TAAGGGAAGA GCACAGCTGT CTTGGGGATC
CACGATTCAA TGCGACGTCA TTTCTAGCTC CAGCTCAAGC AACAGAGAAT CCTTTTGCTG
GAATGAAAAC GCGGGGCAAA AAGGGGGCCT CAATGGACAA CAAAAGGGGA TCTGGCACTG
CAAAAGCAGA TGAAGAAGTC GAGAAAGTCC TAGACTCGTT AATGGAAGCG GAAAGAGCAG
AACTTTGTGA TTCTGGTGTA CCATTGTTAG ACCTCCGTGA CTTTTTGAAA CGGGGGGACA
GTGTCTTACG ATCTCGACAG CTGTTCGGCC GACTTGAAGC TGAATTGGAT CAAATGCGGA
ACTACGAAAT CCATCGCCAT CCCAGCCTCG AATCGATGTA CTCTACTGTG GAACGAAAAG
AGAGTTTAAG AAGCAAGGTG AATACTTTGC GCCATCTTTT GTCGAATGAG TCGTTACAAC
TTTTCCCAGA TTTCCTTCAG CGAAAAGCAG TACTTCGCAA ACTTGGATAT ATCGACGAGA
AAGAAACCGT GTCCATCAAA GGACGCGTCG CTTGTGAAAC AAACACCTGC GAGGAGCTGA
TTGTGACTGA GCTGGTTTTT GAAGGGCTCT TGAACGAACT CGATCCAGAA GAGATTGTCG
CCGTCCTTAG TGCTCTAGTT TTTCAGGAGA AAGGCAAGGA AACTTCATTG AGCGTCGAAC
TTCCTGAAAG ATTAATTGTT GCTGTGAGCA AATGAAGACA ATAGCATTAA ATCTGGGCCG
TATTCAAAAG GATGTGGGCT TAGACATAGA TCCTGCTGAA TACAGCGAAA GCTCGCTCAA
CTTCGGTCTC GTTCATGTCG TCTACGAATG GGCACTCGGA GTCCCTTTTA AAAGCATTTG
CGACTTGACT GACGTTCAAG AAGGTTCTAT TGTTCGAAGC ATTACCCGCT TGGACGAGCT
TTGCCGTGAA GTCCGAAATT GTGCACGAGT GGTTGGAAAT CCTACTCTGT ACAGAAAACT
GGAAGCCGCA AGCATGGTGA GTATCTATGG AAGTTCGTAA ATGTTTGCAA AGATCTCGGC
GAAACAACTC ATGCCTTCTC GTTTAAATGC CGTAGACAAT CAAGCGCGAC ATTGTGTTTG
CTTCGAGTCT ATATGTGAGC TAG
 
Protein sequence
MTFPFILDGF QQQAVVRLER SESVFVAAHT SAGKTVVAEY AVALAKQRGT RCVYTSPIKA 
LSNQKFRDFS LKFGAENIGL ITGDLQVNAD DSTCLIMTTE ILRSMLYRGA DLVRDIEFVV
FDEVHYVNDT ERGVVWEEVI IMLPSYVNLI FLSATTPNTL EFSDWIGRTK RKPVFVIKTD
YRPVPLSFNL WAGLKLHTVM EGRDGFLERG FASAANALLP AMAWQAQGTK QNWMSLVRFL
DRENMTPTVV FSFSKKKCEE ISIMLQSLDL NTAKERGAVQ GFTLQTVARL SKNDSNLPQV
VMVCEMVQRG IGIHHGGLLP ILKEMVEILF AKSLVKILFA TETFAMGVNM PARSVVFNSV
RKHDGKQFRQ LEPGEITQMA GRAGRRGLDK VGTVIICCFG ETPPPQPMLK QMLTGSSTRL
NSRFRLTYNM ILNLLRVEEM SVESMIKRSF SEFATQRALT TNDFPQLLTR GIRALENRIG
SEDVEEYFST CSEILSITER LLTNVRDTEA ASFEGILQKG RIVLISACRE LGAILLVSSQ
KCNVDTTSIL REEHSSPAQA TENPFAGMKT RGKKGASMDN KRGSGTAKAD EEVEKVLDSL
MEAERAELYL RDFLKRGDSV LRSRQLFGRL EAELDQMRNY EIHRHPSLES MYSTVERKES
LRSKVNTLRH LLSNESLQLF PDFLQRKAVL RKLGYIDEKE TVSIKGRVAC ETNTCEELIV
TELVFEGLLN ELDPEEIVAV LSALVFQEKG KETSLSVELP ERLITIALNL GRIQKDVGLD
IDPAEYSESS LNFGLVHVVY EWALGVPFKS ICDLTDVQEG SIVRSITRLD ELCREVRNCA
RVVGNPTLYR KLEAASMTIK RDIVFASSLY VS