Gene PHATRDRAFT_30585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_30585 
Symbol 
ID7198343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp175129 
End bp178302 
Gene Length3174 bp 
Protein Length942 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184578 
Protein GI219128770 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTCGTCCGG GGTCGCCTGG GGCGAAGAAG ATGATCACAA CGAGGAGCTT GGACTACACG 
ATCAGGTGCA GACTAGCCAC CAAGCCAAGG TACAGGAGCT GCTGAAGGAA TCAGATGACG
AGTTTAAACA ACAACGCAAA ATACGAAAAT GGGGAAAATT TGCCAACGTA ACTAAACGCG
AAGACCTGCA GGATGTCTTA CAGGAGGAGC GCAACGCAAT TGATCGAGAA AATGCGCTGA
AGGCATTCAT GGCCCGTTCG AGCGGGATTG AGCTGCAAGT CTTGGATCCA CGGGATGAAA
TGGCAAGCGG AGTGCCGTCC GTCTGGGATG AAACTGGAAA CGTGCAGATC ACCGGTGGCT
CGGTGAAATC ATGGTTTGCC GAAGTGGACG AGGATCTCGA GTCTGAGTGG CAAGCCCTCA
TGGGCGCTGG CGGTGCTGCT GGTGACGTGG CGGTAGAAAA GACATCTGGT GAATTGGTCG
CGAGGGACAA ACTAGCGGGT ATACGAGTAG GAAGTGCGGG CGGCTGGACG TTGGAAGTCT
TTCCGGGCGA TTTTGTTGTT CACCGCAAAT ACGGTATCGG TCGATTCGAG ACGACGTGCT
TGCGGCCAAA AACGAAGCTC AACGAAGAAG AACGACTAGC GCAAGAAGAA AGAAGGGCTG
AAATTCTCAC TACCGAATTA CGCAAGCGCA AGCGGGTAAC ACCCGACGAA ATTCAAGAAA
TACGTGCAAG ATTTGGCACG GAAGAAGATA CGGACCCACT ATCCAATCCA CAAACTACTG
TCTTGGAGAT TACGTATGCA GACGCCGTCG TGCATGTACC TGTCGATCGC GCATACCGTC
TCAGTCGGTA TCGCGCTGGG GATGCCGTGG TCAAACCCAA ACTTTCCCGT GTCAAAGGTG
AAGCATGGAG CAAGGCGAAA CAAAAGGTGG AGGAAAATAC CTTACAGCTG GCACAGGATG
TGCTGGCACT CTACGCAACC CGTGAAACAC TCCAAAGACA ACCCTTTGAT CCATCAGTGG
AAGACGTTGT CCAAGAATTT AGCAAGTCGT TCCTGTATGA ACCGACGACG GACCAAAAGA
AGTGTTTCGA AGAAATTGAA AACGACATGG TTTGGCGAAG TCGTCCAATG GATCGTTTAA
TTTGTGGTGA CGTTGGCTTC GGAAAGACGG AAGTGGCTAT TCGTGCTTTA TTCCGGTCCA
TTATCAACGG TCGCCAAGCG GCCTTGCTAG CACCTACTGG AGTCTTGGCT GCCCAGCACT
ACAAAAATAT TGTCAAGCGC ATGGGACCCG GCACAGAGTA CAATATAAAC ATTGCCTTGT
TGCGAGGGGG GATGGGTAAA CAGACCAAGG CTGGAAGAGA ATTGCGTGGA GAGATTGAAG
GAGGCAAGAC ACAGCTTATC GTGGGAACCC ATGCACTCTT GTCCAACGAA ATGAAGTTTA
AGAACTTGGG TTTGCTGGTA GTCGACGAAG AGCAACGGTT TGGTGTCAAG CAAAAAGAAC
GCCTCAAGTT AATCTGTGAT GGAATCGATG TTTTGACGTT GTCTGCTACC CCAATTCCTC
GTACTTTGCA AATGAGTTTG AGTGGAATTC GCGATACATC GACAATTCGG TCGCCACCGC
CGATGCGAAA ACCCACAGTC ACGCACGTGC AGGATTTTAG TGAAGATATT GTAAAGACTG
CCATCTCGAC AGAACTGGCG CGTGGAGGAC AATGCTATTA CGTAGTTCCT CGTATTTCTA
TGCTTGATGA AGCCGAGCAA ACGATCCAAA GCCTGTTCCC AGGAATACGC ATCATTCAAG
CGCACGGCCG AATGCAACGC AACGGCGCGG AGGAAAACGT CGCCGAATTC GCCGAAGGCA
ACTACGATGT TTTGCTCGCT ACGACGGTCA TTGAAAACGG TGTTGACATT CCTTCCGTCA
ACACAATTGT CGTGCAAAAC AGTCAAGCTT TTGGAATGAG CACCCTGTAT CAGTTACGTG
GTCGTGTTGG TCGTTCTGAC AAGCAAGCCT TCGCGTACTT TTTGTACCGC GAAGAATCTA
TCACGGAACA AGCAGCTATG CGTTTGCAGG CAATAGGGGA ACTTTCAGAA CTTGGCTCCG
GATTCGACGT GGCGAATCGA GATTTGGAAA TTCGTGGAGC CGGAAGTTTA CTGGGAACGG
AACAGAGTGG TATGGCGGCC AAAGTCGGTT TTGATTTGTA CATGCGCATG TTGAAAAAGA
GCATACGCAA GCTCAGGGGT CTCGACTTGC CTCTAGTACC ACGTACTAAC ATTCTATTTC
CGACAGATGG ATCGCCCAGT ACCTTTAGCT TGCCAATGTC TTTCATAGAG CGTCAAAGCG
AACGTCGCAG TGAAGAAACC AAGGCTCGTC TGGCCGAAAG CACTTCAGCG TTGGTCACCT
TGACCAATGA GTGGAAATCT AAATACGGGT CGCTCCCCTC CACCCTGCAA AACCAGCTCA
AGACTTTACA TCTGCACGCT TGTACTCGTA GGTTGGGAAT TGATCTCGTC GGTCTGGTGG
ATGTTTTTGG CAATGGGAAG CGCATCGATT GTATTCTGCG TTCACCGGGT CTTCGCCCGC
GGCACTGGGC CACGATTGTC CCAATGCTGG CCAAGGGTAT TGCCCCCAAG GGTTTAGACG
TTGTATTTCC TGCTCGTTTC ACGGTCACAG GTGAAGAAGT AGAAGTGAGA GGTGGCCGAA
AGATGAATCT ATTAGAACTC GTCAAGGAAG AGACTTTCAA CGAAGAGTTG GAAGAGGAGG
ATTGGGACGC CATGGACGAA GAAGAGGTCG AGGCAATGAA GGACATTAGT TCGGCCGTAA
ACGTTTTGGA TATGGACGAG GTTGATCTGG AGCAGTATCC ACGTTTTGTG GTGAGGGATT
TTCAGGATGC CGACAAGGCC GTTGACCGCC TTTTGAAATT GCTACTGCCG GTTGCCAAGA
TCGTATATGA GAAACAAGAA GACCAAGCGG AAGCCGCTCG CATGGCCGCA GAGCTTCGTG
ACAAACAAGA GCTTTTACGC CAACGAAAGA AAACGAACGA AAAGCGAGAA GCCCAGCGTC
TGGGTTACCA GTATTAATTG CCCGGCAGTC GAGTTTCCAC TCGTTACCAC TCGTAAGAGT
TAAACGCTCT GTACAACTGT AGCTAACATA ACACGTAAAT TTAGTTGTTG GTCT
 
Protein sequence
MARSSGIELQ VLDPRDEMAS GVPSVWDETG NVQITGGSVK SWFAEVDEDL ESEWQALMGA 
GGAAGDVAVE KTSGELVARD KLAGIRVGSA GGWTLEVFPG DFVVHRKYGI GRFETTCLRP
KTKLNEEERL AQEERRAEIL TTELRKRKRV TPDEIQEIRA RFGTEEDTDP LSNPQTTVLE
ITYADAVVHV PVDRAYRLSR YRAGDAVVKP KLSRVKGEAW SKAKQKVEEN TLQLAQDVLA
LYATRETLQR QPFDPSVEDV VQEFSKSFLY EPTTDQKKCF EEIENDMVWR SRPMDRLICG
DVGFGKTEVA IRALFRSIIN GRQAALLAPT GVLAAQHYKN IVKRMGPGTE YNINIALLRG
GMGKQTKAGR ELRGEIEGGK TQLIVGTHAL LSNEMKFKNL GLLVVDEEQR FGVKQKERLK
LICDGIDVLT LSATPIPRTL QMSLSGIRDT STIRSPPPMR KPTVTHVQDF SEDIVKTAIS
TELARGGQCY YVVPRISMLD EAEQTIQSLF PGIRIIQAHG RMQRNGAEEN VAEFAEGNYD
VLLATTVIEN GVDIPSVNTI VVQNSQAFGM STLYQLRGRV GRSDKQAFAY FLYREESITE
QAAMRLQAIG ELSELGSGFD VANRDLEIRG AGSLLGTEQS GMAAKVGFDL YMRMLKKSIR
KLRGLDLPLV PRTNILFPTD GSPSTFSLPM SFIERQSERR SEETKARLAE STSALVTLTN
EWKSKYGSLP STLQNQLKTL HLHACTRRLG IDLVGLVDVF GNGKRIDCIL RSPGLRPRHW
ATIVPMLAKG IAPKGLDVVF PARFTVTGEE VEVRGGRKMN LLELVKEETF NEELEEEDWD
AMDEEEVEAM KDISSAVNVL DMDEVDLEQY PRFVVRDFQD ADKAVDRLLK LLLPVAKIVY
EKQEDQAEAA RMAAELRDKQ ELLRQRKKTN EKREAQRLGY QY