Gene PHATRDRAFT_47597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47597 
Symbol 
ID7202653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp224543 
End bp227078 
Gene Length2536 bp 
Protein Length810 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182029 
Protein GI219123433 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCA GCTTTTTCTA TGGCTTTCGA ACCCGACTGT TGATGCTGTT TCTCTTCACA 
ACTCTGTTTG GATTGGTATC AATTCACCTC ATCATTACCA GCTCAAATTC TGTTACGCAA
TATGAACCAA GGCCAGCTCC AAGGAGAGAA AGTTCCGGGG TATTTCACGA CGCGAAAGCC
CATCCAGAAT TTGCTCCAAA GGAAACAAAG CTGAAAGCGC TAGTTGGACT TCCAAAATAT
GCACCCCCGA CGTCTTCTAT TATTCTCATT CGGGCTCTGG GCAACGCCTT GCCGCCAAGA
CACAGTACAA ACCAGACTTT GGATAATCTC GACTTTATTC TTGCGCACGA GGATTCGTTT
CCCAACACAA CGCGACACTG GTTCGTCAAT CGCTTCGTCG ATCCCGAGAT GGAAAATCAA
GTTTTGGATA GACTTCGGAA AGCCCAGGAA TCCTACACTG TCATTCCTTT TGACTTGCAG
ATCTATGACA AAATCGAATA TGCATACGAT CAAATCCCCA AAGACCAGAT TCATCTTCCT
ACCACAAGGA GAATCACCAA GAAGGAAGTA CATCTTGCTG AAGGGCAGAT ACAGCACAAC
AAAATCCTAT ACGTCATCAA CGTAAATGGG GTCAGGAACG CTATGTTAGA CTACGGCCGT
ACTCATTCGA ATGCTGAATA CATTCTTCCT TGGGACGGGA ACTGTTTCAT GACGCGAAAT
GCGTGGTCTT CGATCCAGTC GTCCTTGGCC GAGAATCCAC AGGCTAGGTA CTTTAAAACG
CCCATGGATC GATTGCAGGA GTCAAACGAA GCTCTTCTAT CAGATACGTA CGAGCCCAAA
CCTGTTGAAG AACCCCAAAT TATTTTTCAC CGAATGGCGA GGTCCAACTT TAACGAACAG
CTAAGATATG GGCGAAGAAA CAAGGTCGAA TTGTTGGTGA GACTCGGAGT TCCAGGACCT
TGGGATAAGT GGTCGTGGCT GGATAGTGAG ACGGCCATCT CAGATCGCGC CCATGCTTTC
GACGCCGTCG GAGATGTACC AATAGCCGGG TGGATAACGC GGTTGAATTC AGGAACGTTC
TTGGCTGAAA GGTCTGCCAA GGCTCGAGGC AGACTTCGAA ATAAAGCAGT CACCTTGCTT
CTGGAACGAC TCGATTTTCG AGCTGCTCGG GACTTATATG GCTTGACTTC TTCAACTCTC
CTTTTCTTCA ACGAAAAGCG ATTGTTGGTG GAACGCGCCG AGTGGAAGGC CGGCAAGAGA
AAACAAATCT TTCGAGAGCT GGTACGGCTG GCTGACCAGG CGCTACTAGC TGGACCTTGG
TCGGTCATGG ACAAGAAAAA ATTCGGTTGT GGTATTTCTG GGGATTGTCA TGATTACTTT
CACCCCTCAC CGTACATGTG GCCGCAGAGG AATGAATCCG GGCACACTGA CTGGTCGAAA
CCCTTCAAGC GACGTGACGG TGTGCGAGCT CCTGGTACAT CCTTATTCAG CTCCGGGAGT
GAGCAGTATG ATCGATCCGG GTTGGCTGCA ATGAAGTACA ACACAACACT CCTTGCGTTG
GCCTATTCGT TGACAGACAA CAAGGCATAT GTTGAGAAGG CAGCAAGCAA TCTCCGACAT
TGGTTTATCC ACAATGCAAC ACGTATGAAT CCACATCTCA CCTACGCCCA GGTAAAATGG
AAGGCAGATG CAACAGCAGT GGGATCATCG TATGGTCTTA TTGAGATGAA AGATGTGTAC
TTCTTCTTGG ATGCGGTCAA AATTGTGGAA AAATCGGAAG CACTGTCACT ATTGGAACGC
GACTCTATGC GTGAATGGTT TGCTGACTAT CTTGAGTGGT TGGTCTCAAG CTTGCAGGGT
CAACAAGAAT TTGTCCAAGA CAACAACCAT GGTCTCTTTT ACGATGTCCA GGTTGCACCC
ATTGCCTTGT ATACTGGCAA CATAGCATTG GCATTGTCAA GGATGCAACG ATCAGCTTCG
CGTCTCTTGA CACATATAAA TACCACTACT GGTGTTCTTT CCCAGGAATT AATTCGTCCA
ACGTGTGAGC ACTATCAAGC CTTTACTCTG CAAGGATGGG CTAACATGGC CCGCATGAGC
AGAAAAATTG GCCTGGACTA CTGGGGCCGC TTCCGTGACA AGGCAACCAA TCAGAGTATC
CTGTGTCAGG CAATGCGGTA TGCAAACCCT TACTTGCAAA AGCGAGAGAT ATGTCCCGGG
AACTCACACA GCGAGGACGT GCGGCGATGG TGGCCTCTGC TTGTAGACTT CTCTCAGCAT
TGCCAACAGC CCTCCAACGA AGGGTTGCTC AATGTCAGTG ACTGGATCCC TTCAGCGCTT
CGGAATCCCG ATATAGATCG GTATTTGATG CCCCCTATGT ACGACTATGG AGATGGAATT
GCTCCATTTT GGAATCTAGG GTATCATTGG TAAACACAGT TCACGTAGCA AGCTACCTGC
AGTTGATGAT GTGATATTTG CGGGCACCAC CACTAGACTC CCGTAACAAA TAGATGAAAT
TTCAATTTTC TTCTCT
 
Protein sequence
MARSFFYGFR TRLLMLFLFT TLFGLVSIHL IITSSNSVTQ YEPRPAPRRE SSGVFHDAKA 
HPEFAPKETK LKALVGLPKY APPTSSIILI RALGNALPPR HSTNQTLDNL DFILAHEDSF
PNTTRHWFVN RFVDPEMENQ VLDRLRKAQE SYTVIPFDLQ IYDKIEYAYD QIPKDQIHLP
TTRRITKKEV HLAEGQIQHN KILYVINVNG VRNAMLDYGR THSNAEYILP WDGNCFMTRN
AWSSIQSSLA ENPQARYFKT PMDRLQESNE ALLSDTYEPK PVEEPQIIFH RMARSNFNEQ
LRYGRRNKVE LLVRLGVPGP WDKWSWLDSE TAISDRAHAF DAVGDVPIAG WITRLNSGTF
LAERSAKARG RLRNKAVTLL LERLDFRAAR DLYGLTSSTL LFFNEKRLLV ERAEWKAGKR
KQIFRELVRL ADQALLAGPW SVMDKKKFGC GISGDCHDYF HPSPYMWPQR NESGHTDWSK
PFKRRDGVRA PGTSLFSSGS EQYDRSGLAA MKYNTTLLAL AYSLTDNKAY VEKAASNLRH
WFIHNATRMN PHLTYAQVKW KADATAVGSS YGLIEMKDVY FFLDAVKIVE KSEALSLLER
DSMREWFADY LEWLVSSLQG QQEFVQDNNH GLFYDVQVAP IALYTGNIAL ALSRMQRSAS
RLLTHINTTT GVLSQELIRP TCEHYQAFTL QGWANMARMS RKIGLDYWGR FRDKATNQSI
LCQAMRYANP YLQKREICPG NSHSEDVRRW WPLLVDFSQH CQQPSNEGLL NVSDWIPSAL
RNPDIDRYLM PPMYDYGDGI APFWNLGYHW