Gene PHATRDRAFT_50124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50124 
Symbol 
ID7198926 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp117088 
End bp121271 
Gene Length4184 bp 
Protein Length1144 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184970 
Protein GI219129595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.190181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGTC CTCTCGAGGT TCCTCCTCCA TACTCCTCAT CGTCCTCCTC CCCACCCGGA 
CCGAACGTTA CGCGGGGTAA CCAGAACTTG CCGCCACCGG GTAGCACCTC TTCGTCGTTG
CCGGATGACA ACAATGGATT GGACGCGTCA ACAACCAAAC CGACGCCGAT TCACGTTTGT
TGTTCCGTCT TGAATACCGT CATTGATCAC GTCATGGTCT CACCCAGTGC ACTGGAACCG
GAAGAGCTTC GAGCACAGTA CGTCGACGAT GCGGTGTTGT CGAGATCCAC CGAGTTGGGA
GCCGCATCAC AGACACGCAC ACCCCTGCAA GTTCCCGTTC TCCGTGTCTT TGGTCCCATC
TTGCGTCGGG ACAACGACGA CAGCAATCCA GATGGAAATG GATCGTTGGA TCCGCCAACC
CAGTCGGCCT GTTTGTACAT TCACGGTGCC TTTCCCTACT TGTTGTGTCG GCCAGTCGTG
GCAGGCGCCG ACGGATCCTG GCACCGATCG TCACACAATC ACCACGGCCT GACGCCATCC
GGACACCTCG ATTGGGACGA TGCCGCGGCG GTGGAACGCA TATTGCCCGT TTTGCACGAA
CATCTCGAAG CCTCGCTGCA AGCGTCCCTC CAACAATCCT CCCTCGGTCT CGACAAGCAC
AGCAACAGTA ACCGTGAAGC TACCGGGAAT TCACGACAAC CACCCAAGCC ACCGGCAACC
AAAATTATTC GCCGACTGTC TCTCGTGGTC GGTCGTGGAT TTTACACCTA CTGCGCGGGT
CCGCCCGCTC CCTTTGTTCG GGTCGAATAC TACAATCCCA AATCACGATG GAAAGTTAAA
ATGCTCCTGG AGCGTGGCTT GGAGCTCCCC AGTTTGTACC ATCCCGATCC AATCCAGTAC
GAACCGGCGG CTCGGGAAGA TCACGTCGAG CCAGACATTG ATGCGGCCAA TGCGGAAACC
TTGTCCTTTC ACTGCTACGA AGCACACATT CCGTACACGA TGCAGTTCTT CAAGGACTAC
AATCTGGCCG GCATGCGGTA CATTCATATT GGTAAGACCA AATTCCGACA ACCCTTGCCA
CGAACCCGAC GCGCCCGATT TTGGCACAAG CATGACGTTC TGGTGGAACC ACACGTGCGG
GACGAAACCA TGTTTCTCGA GTCCAACACC CCAGCCGTCT ATCGATGGAA TGACGACACC
CAAACCAATG TGAGCCTATT ACACGTATCC CCGACCGCAC AAAACGATTT CGGCGTGGAC
GATTCGGTGG CGCATTCGGG TTTGGGAGAC ACGTTTGCGT TGGCGCAACT ATCACAGACA
GATATTCCGA GCAGTCCATG GTCGGACCGC CCCGACAATG CCAGAGCGGA AGAGGCTGTG
TCCCACGCTG AAAGTACACC CCAACACTCC AATCCAGACG CGCGAGAATG GATTGTCTCG
CCAGGAACGT ACGAATGGGA GAAACGCCTC GAAGCGCAAG CGCCACCCAC GAAAGAAACC
ACCTGCGATT TGGAACTCGA TATTCACGTC GACGATATAC TAAACGTCCA CGAAGTAATT
CGAGAAGTGC CAACCATTAC CAACAATTCG CGACAACCAG TGCATTGGAG GGCGGTACCC
AGTCTACGAG AGATTTGGGC CGAAGAACGT ATCCGCATGG GCAAACTACT ACCTCCACAG
GATGACTTTC TGAGTCCAGA CCGTGGAGCT AGTACCACCA CACCACCCTT CACGTTAAAT
GTCCAGCTTC CAGATTCGGC TTTGCCGGGC ACCAGACTTG CAGTCACAGG AATGAAACGT
CTACGAGACT TGACGCTCGG CTTGGACGAT AACTTTCGTC GGGTCATGAA AGATATAATT
GCCCGCCACG ACGTGGCTGT GCAGCGAATT GACGAAGGTC TCGCACGCCG ACGATTAAAT
TCCGATAGAT TGGCAGAGGC TCAGGAATGG GGGCTTTTGA ACCCCAACCG TTCGTCGGGT
CCGAAAGGTT TGACCCCCTC CGATCAGGAA GCGACAGACA CTTTGGCAAT GCTTGGTTCT
TTATTCAAAG AACCATCACC ACCACACGCA AACAACAATT TCTCGAGTGA TCAAGGACCG
GGATCATCTT CGAAGTATAG TTGGTCATCC AGTCCGCAGA ATTTTTCTTC GAGCCAAGAC
ATCACAGTCA ATGGCAAAGC GCATGGTATC GAAGAGGATC GACATTCTTC GCAGAGCGAG
CGCTTTCAAA GCTTGTCTCA AACGTATTAC GATGGTCAAG CTGAAGCTGC TGCGGAAAGT
GACTTTGAAT TGAGCCAAAG GATGGAGCGA GGAGATGGGA TCTTCGAGGG GCCGTTCGAA
TATGTAGAAG ACTTTATTGA TCCGGAAACG CTGGCGCCTT TTGAGTCAAT CGACGAGGAT
GGGGATGATT TGTTCGATCT AGACAGCGAT AGCGACGATG GCGCAATGGA TGAGTCTCGA
ATAGAGGAAG AGCTAACGCA ACTTGCAACT CAAACGTACA ACAAATCAAT GGAAGATTAC
ACCGACGATT CTAGATCAAA ACCGTTCGAT ATGAAGGGTA GCCAAGATGG TTATTGTGCC
TCATCGGGCC GTGACCCAGT CAGCACAGAA AAGTCCATGG CAGAAGCCGA CTTATCGGAC
CATGATCCAG CCAGCAGAGA AAAATTCATG GCGGTTGACA TGAAAGCCGA CAACCCAACG
CCTATGTCTG GGAAGCAAAA TTTGCAAGGG TGTGGTAATG AGAGCGCTAG AGAAATTTCC
GCTTTTACAA CAAGTCTTCT GAGCTCATCT GACTGCCATG CTCCTTCTAA GTGCTACGTT
GAAATTTTGC GAAATCCTCC TACACGAAAG GCATCGAAGA ACAGCGGTAC ATCCGTCGGG
TACCATCCGT TAGCAACCGT TGGTGATGTC CCGCCGTGGT TGTTTTTTGC AGAGTATCAA
AAGCTTCGTG GTACTACCTC CAGCGCTGCA CCATTTTCTT CGTTTCCTTC CATTCCCGAT
GGTGGATTAA GTGTCCTTCC GACGAAATCC CCTCCTACTC GGCGAGCTGT ACAAGGGTGG
ATGCTGCGAG AACGCAAACG AAAGCAGCCT TTGGATTCGG GATGCGACCA GGAGCACGTC
GCAGAGAAAA AGCGAATTGC AGGCGCTTCT GCGAGCCTTT CTGTTGTCGC TGAGATGAAT
GTTGCCATCG ACAGAATTCC TCGTCAGCTT GAATTGCCAC AGACAGCAGA AATTCATTGC
AAGGAATATG CTGTCAGGGA CAAAGATCAA GCTGGAGCTT TGACCGTTGA AGAAGTAGAC
TGGTCCAAAA GCCAAAATCT TTCACAGTAT CAAGCTTCGC AAACCGGAGA CGAAATTGAG
ACTGACCATT CAACAAGTAA CGGAGGGTTT CAAGTCGTTT CGACTGCTGT TCAAGAGAAG
GTCAGAAGTA GCGGTGACAC TGCGAAAAAT CACTTACCCA CAGACTATGA GTCATTGAGT
CAAAGCATGC CAATCAGCGA ATCAGGCGGA TCGTTTTTTG TTGCTCAGCC ACTGGATGGT
ATTGGTAATC AAGGGGGTCG AATATGGGTG GAAGGTGGTG GCGTTTTGAA AGCAAACACA
AGGTCCTCAG CCCGACAGTC AACAGCCGAC AAAAGCCCGT GCTCTACTAA GGCACATTCC
GAGACTATCG GCGGGCCTCA CTTACCCTCA CCACTTTCAG TCATGATCAT CGACGTTCAT
GTGCAGTGTC GGTCGGGTCG CGCTGGAACA TCTGATTCGA AAACGATTGC TTTGACCCCC
GATTCTGATC GAGACAAAGT CGCTTCAGTG ATTTACTTGT ATGGAAAGGA TCCTGGCGGA
GGAGAGTCTC TGGAGATTCT TGAAAGAGGA TGTATCTTTA TTCCTGTAGA GAAGGAACTA
TCCAACACGT CCTTAGACTT CAACGAGAAA AAACTCCTGA AACGCTTCGC AGACGATCTT
CGTCTGGCGA TGCCGACGAA AGTCTTAGGA TACGACGCGC CCTTTAGTGT AGACTGTGTG
AGCGACGAGA GGCAGCTTTT ACTCAGGCTT TCTTCGGTAG TATATTCCAA GGATCCCGAC
TTGCTGCTTA GTTGGGACAC TCAAGGTACA GGGCTCGGGT ACTTGATAGA AAGAGGCTCC
AAAGTCTCGG GTACAACAAA TTCAGACTTA TCTCGAGAGT CGGA
 
Protein sequence
MSRPLEVPPP YSSSSSSPPG PNVTRGNQNL PPPGSTSSSL PDDNNGLDAS TTKPTPIHVC 
CSVLNTVIDH VMVSPSALEP EELRAQYVDD AVLSRSTELG AASQTRTPLQ VPVLRVFGPI
LRRDNDDSNP DGNGSLDPPT QSACLYIHGA FPYLLCRPVV AGADGSWHRS SHNHHGLTPS
GHLDWDDAAA VERILPVLHE HLEASLQASL QQSSLGLDKH SNSNREATGN SRQPPKPPAT
KIIRRLSLVV GRGFYTYCAG PPAPFVRVEY YNPKSRWKVK MLLERGLELP SLYHPDPIQY
EPAAREDHVE PDIDAANAET LSFHCYEAHI PYTMQFFKDY NLAGMRYIHI GKTKFRQPLP
RTRRARFWHK HDVLVEPHVR DETMFLESNT PAVYRWNDDT QTNVSLLHVS PTAQNDFGVD
DSVAHSGLGD TFALAQLSQT DIPSSPWSDR PDNARAEEAV SHAESTPQHS NPDAREWIVS
PGTYEWEKRL EAQAPPTKET TCDLELDIHV DDILNVHEVI REVPTITNNS RQPVHWRAVP
SLREIWAEER IRMGKLLPPQ DDFLSPDRGA STTTPPFTLN VQLPDSALPG TRLAVTGMKR
LRDLTLGLDD NFRRVMKDII ARHDVAVQRI DEGLARRRLN SDRLAEAQEW GLLNPNRSSG
PKGLTPSDQE ATDTLAMLGS LFKEPSPPHA NNNFSSDQGP GSSSKYSWSS SPQNFSSSQD
ITVNGKAHGI EEDRHSSQSE RFQSLSQTYY DGQAEAAAES DFELSQRMER GDGIFEGPFE
YVEDFIDPET LAPFESIDED GDDLFDLDSD SDDGAMDESR IEEELTQLAT QTYNKSMEDY
TDDSRSKPFD MKGSQDGYCA SSGRDPVSTE KSMAEADLSD HDPASREKFM AVDMKADNPT
PMSGKQNLQG CGNESAREIS AFTTSLLSSS DCHAPSKCYV EILRNPPTRK ASKNSGTSVG
YHPLATVGDV PPWLFFAEYQ KLRGTTSSAA PFSSFPSIPD GGLSVLPTKS PPTRRAVQGW
MLRERKRKQP LDSGCDQEHV AEKKRIAGAS ASLSVVAEMN VAIDRIPRQL ELPQTAEIHC
KEYAVRDKDQ AGALTVEEVD WSKSQNLSQY QASQTGDEIE TDHSTSNGGF QVVSTAVQEK
TMSH