Gene PHATRDRAFT_49723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49723 
Symbol 
ID7198404 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp66437 
End bp68565 
Gene Length2129 bp 
Protein Length658 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184481 
Protein GI219128567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.382168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACT TCGGTGACGT CGGCATGTTG ATGCCGAATC GTCCTCCAAC AACCGATATG 
AGAACGAGAA CACCTTGTGA GTGGCAGCCG GTGCTGGACT GCACCCACCG GCCATCTGTG
GACGACCCCA AAAATATGGC TTCCCTCATT GGAGGACTGT CTGTGCTGCG CAGGTCTGGT
CCCACTACCT TGAAATTCGA ATCTAGCTCT AGCTTGAATT TTGAACAATA ATAGTTCGAA
AGTTTTGTGC GCCGTGGTCC TGTGGACCGT GCACCACCGG GAATCACTGT CCGCATCCAC
GTGGAAGAAT CGAGAAACAC GAAAACGTCG TGTGGTGTTT TGGTGCGTGA CGGTGAGAGT
GCGCACAGTG TGGCGTTCGG GTCTCCGATG GATGAAACCC GTACACCCCA ACAAGAAGGA
CAACGTAGTA AGCCAAACTC GTGACAAGCT CCCGTACGCT CATGCCCATT CCAAGCAATA
CCAAACATGC CTACGTCGTT TACAATGAAC CCCGCACGGA GTCGACATGG ACAGCTGCAC
CGTTCTAGCG TGAGAAAAGC GCTTCTTCCG TTGCTGGTGT CGATGGTGCA CGCGGGTCGA
TTGCCGTTGG GAATTCTCCT TTCGTCGCCG TCGGCTTTTT CGGAAATAAG AACGAATAGT
GAATCAGCGT TAGGCACAGT ACGGCACCAG TTACACAACG AAGCCTTTGG TGGGTCCGGA
GACCAAACGC GGTTCAGATA TCCGGGTCGA AAAGGACCGC TTTGCGCTGT CACCAAGTTG
CGAGGAGGTG ATGTTTCCGA CGGGGAAGAC GAATCTTACG ACAGTGACGG GGAAGATACC
AATGATTCGG ATGAAGAGGA GGAAGCTGAG ATTGCCGCGG TACCGGAGTC TCGAGATCCA
CAAAGCGAAC CAGAAGACGA ATCCGAGGCC CATCCTTTAT CTTCGGTCAT TTCTATGGAA
CCGGTACCAA TCACCATCAA AACATCGTTA GGTACCAAGG CTCTAGACCA CATAGTAGAG
CTGACGGTAC ATCGCTCGAG GAATATTGCT TCGCTTAAAC TGAGTGCTAG TCGCCAGCTA
CCAGGACGGC CACCAGTTTC TGTGATGCAC ATGTTGCTCA ACGGAAAAGT ATTGAGTGAC
GAAATGCTAC TGGACGAGTT GATTGATGAC GACGACGAGA AAGACGATAC CGACGGTACT
GGTAGTGAAA GCCAGCTCAC TTTAACGTTG GATATGCTAC CACCTGTCGA TCCCAAATTT
GTCGGCCAAC TAGAGTCACA AATGAAGGAT ATGACCACCG CAGAGCTGTT GGGAACATTC
GCTGCCAATG AGGCTGCATT GTACCAGAAC GCGGCACTCT TGCTTGCCGA GCAAATGGAG
CCAGTATACG ACGATGATCA AGTGGAGGTA GGCATTTCCG AGGTGGCGAC ACACCCACCG
CCGCTCGTGA ACGTGCAAGT TCGCGAGCAA GCAGCACGCA TCCGTCGAGA CTTGGAAAGT
AAAATCTTGG CTTCGGAGCA CTCGCAAAAG ATTCTCGCTG ACCCTTTGCC CCCTTCCGCC
AAACTAGCAG ATCTACAACG TGTCGAGCGT CGTGGCCAAC GGGTTCGACG AGTCGCGGGA
TCGGGTGGCG TAACGACCGG CTTGAAACGA TCGATTCAAA AAAATCTGAA CGTACACTGG
GGTGACGCTA TACGGAATTT TTGTCTCTTT TTGTTCTTTG GATACTTTGG TGGGCGTACA
CCCGTAAGTC GGGCTATTCT GTTGCTGGGT GCACCAAGCG TCTTTGTGCT ACAGGCACGG
CCCGTCAAAC TGTGGATCAA ATGTCTCATG TACGCAATGC TCGACCATCC GCCTGGAATT
TTCTTGAGTT TGCTGCCCGC CCCCCAACAG GCCATTTTGA GTCTGAATGT GGGCGAGGAA
ATGAAAACTA TTTATGGTAA TACACTGACC AACACGGTGG TGAACGAGGT TGATGCAGAG
CCGGAAGAAC TGGCAGACCT GTACGAAATG ACCGACGTAA TAATTGACGG CGAAGATGAT
GACGAGTTCT ATGCTGTCGA CGAATACGAA AGTAGCTATG ATGATGACGA CGAGTAATTA
TGTATACTAA TTGCTGTTTC GAAATCGCT
 
Protein sequence
MSNFGDVGML MPNRPPTTDM RTRTPCEWQP VLDCTHRPSV DDPKNMASLI GGLSFESFVR 
RGPVDRAPPG ITVRIHVEES RNTKTSCGVL VRDGESAHSV AFGSPMDETR TPQQEGQRTI
PNMPTSFTMN PARSRHGQLH RSSVRKALLP LLVSMVHAGR LPLGILLSSP SAFSEIRTNS
ESALGTVRHQ LHNEAFGGSG DQTRFRYPGR KGPLCAVTKL RGGDVSDGED ESYDSDGEDT
NDSDEEEEAE IAAVPESRDP QSEPEDESEA HPLSSVISME PVPITIKTSL GTKALDHIVE
LTVHRSRNIA SLKLSASRQL PGRPPVSVMH MLLNGKVLSD EMLLDELIDD DDEKDDTDGT
GSESQLTLTL DMLPPVDPKF VGQLESQMKD MTTAELLGTF AANEAALYQN AALLLAEQME
PVYDDDQVEV GISEVATHPP PLVNVQVREQ AARIRRDLES KILASEHSQK ILADPLPPSA
KLADLQRVER RGQRVRRVAG SGGVTTGLKR SIQKNLNVHW GDAIRNFCLF LFFGYFGGRT
PVSRAILLLG APSVFVLQAR PVKLWIKCLM YAMLDHPPGI FLSLLPAPQQ AILSLNVGEE
MKTIYGNTLT NTVVNEVDAE PEELADLYEM TDVIIDGEDD DEFYAVDEYE SSYDDDDE