Gene PHATRDRAFT_50572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50572 
Symbol 
ID7199396 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp158070 
End bp161111 
Gene Length3042 bp 
Protein Length860 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185533 
Protein GI219130776 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATTCCAAAA ACGGTCAAAG CGGCGAGTCA TATGACTCTT AGATAAAATT ACTAATTATA 
CATGTTCAAG ATTGTGACGA CACTGAAGAA AGTAGAAAGC CAAGATGACC AGACAGTTTG
TTTTCGCTTC TTTCCGTTCG CGAGAGTCAC TTCTTTCCTG GTCCGTCAAT TGCGATTGAT
CTCTTGGTGA TACAAAGTCA CGTGGTATGG AAACTGAAGA ACATAGTAGT AGCCATGTTT
ACGACGACGA GGAAGACGAA GAGCTCTTTA TGTTGATACA AGAGTCGAGC GGAAAAGAAG
GAGACGTGAT CGAGAGATTC CCTGTTCGCA ATATTTGGTA CCTCGTCGGC GCAGGACTCG
TGGTATTAGC AGCGTCTTCG TTTTTTACCG ATCAGGTTGA AATAAAATTT GATTCGCGGC
TAGAGGACGT TGAAAAGAAG AATCTATCTG CTGTGGACGA TGCTCGGGTC GAAAACGAAT
CGGATATTGG CAACGATAGC GGATACGGGA CTGTGGACGT CAAAGCAGAC TTGCATGGCA
AATCCAACGC GCTCGATGAT TCGAACAAAA GCGCTAGCAC AAATTTCGCT GAACATTCCG
TGTCCATAAA TGATGTGGAA GATTTTTATG CCTCCATCGA GCCTGGCTTG AAACCCCGTT
TCAATGTGCT CTCCCCATCT TATATTCCGC GAGGCCAGCC GCTTGCGCAG GAAAAGCGAA
ATGAAATCAA AACGAAGTGG GGGTCTTGGA CCCTTGTCGA CGAAGTGAGC CGGCCGGAGG
AAGATTTTTT TGCACGGTAT CCTTATCGCG ATGTCCCGAG GAATGATTTT CCAAGTAACG
CCTGGCAAGT GGACGTCGCA TATTTATCGC GATTTCTGCC AGAAGCTGAG GCCTTAGTGA
CGAGGTCGAT GGAAGCGATT CTCGCGGAAT TGGCCCATTC TCCAATAGAA GAGCCAGGTA
TGACAATAGA GGAACGCTCC AAACTCTTCG AACTTGAACA GACTGATCAT GATATCAAGT
TGAAAAACGT CATGTTTGAT GGATGCGGCT ACACTTCTCC AGATAGTGAA GGTTTGCTAG
CGAGACGCAT ATTACACGCC GTCATGACCG AAGATAATTT TAATTTTGTG ATGGGAGGCC
ATTCCGCTGC TGCTGGTAAG AAGCTTTTCG TTTCCCCGTG TGCACATAGA AGGATTGATA
CTTACTTCTT CATATTTTTT CAGGGCATGG CAACAACTTT CAGCAATCCT ACACATTGCA
GTTTCAACGT ATTTTAGAAC CCATCTTGGC ACGGTTGGGT GTACGACTAC AAGCACATAA
TTTTGGTATG GGGGGACTAG GCACGAGCCA AAACGCAATG GCCGCCAGAG ACCTTTACGG
CAATGAGATC GATATTTTAA TGTGGGATTC TGGTATGACA GAGAAATCGA ATTGGCACCA
GAACTTGTTC GTCCAGCAGA GTCTACTGGG AGCCGAGGGG CGTGTTCCGA TGCTGTGGAA
CACTGCCAAT CTATTTACCT ACCACGATGA ACTAGGTGTG GATATCATGC AGGTTGGGGG
GTTCTGGAAA GTGGGAGATA AATATTTGCC ACTTTCAGAC GACCCAGTCC AGGTCAAAGG
GCTTCCTTAC GCAGTGCAGT ACCTGAAGCC GTCTTCTGGA ATGCAAGGTG AAACTCGAGC
TAACCGTTAC AATGGTACGT GTTGGATCGA TCGACCAGAT ATCGATCCGC CTACTAAGCA
AGCCGACTTC CCCGGGGGTC GAGCGAGCTG GCATCCGGGC AACCGTGAAC ATCAATACGC
AAGCCGTCTG ATGTTGTTTC AGGTGCTGAA GGCTTTGCAA AAGGCTATTC ACATTTGGCG
AGATGCAGAC GACTTTGCCT TGCCTGACAA GGCTTGGCAC ATGGGCGAAC ATTACGACTC
CATCAAATCC AAGCTGCAAT CGTGGAACAA TACAGCCTGC AAAGCGAATG CCGTTCTGCC
ACCTCGATGG TGCGAAGTGG CTTTTCAAGG GCGATCCGAA TACCTTCCAC GGGCCAATCC
AAGCGAAACT AGCATTCGGT CGTTGGTTAA AGAGAGTATG ACAATACCGG AAGTGAAACG
CAACTTGTAC GATCCTCCGG ATGTTTGGAT GCCCGTGCTG GATCCTCCTG ATGGAGACAT
CGATGTGCTG TCAATAGTCG AAAATGGGGT TGAATTCGCT GCCAATCGTC AGCGCATAAA
ACATGCTCTA GACTATATTC AAAAGTCGAA ACATCGGGTC GCTTTGTTGA ACAACAATTC
TGAAATCGTA CCTGGAAAGG GATGGGTACG TAAAGTAAAC GATGACGAGG CTCACTCGAT
TCGTTGCATC AAAGTATCTC GACTCACAGG TAACCTGCTC TCATTTGTAT CGGCAGTACT
CTCACACAAA GTCGGCTCCC GACAACTGCG ATGGAACTTA CGATTCCTTT TGCGGGCATT
CTGCGGATGA ACGGTGCCTT TTGATAGGTC ATAACGACGA CCGCGGAGGG CTGACGTACA
ATAGCTTGAG TGGCTGGCTA ATTCTGAACA TCGAAAATGT CCGTGAAGGG ATCATTGCCA
TTAAGGTGTT TGCAAACGCT GACAACCCCT TGACGAAAGG GTGGTGCTCC GTGAACAATG
AAGAACCGTG CACAGGCACT GAAGTCAACA GCGGAGTAGA GGACAATCCT AGCGATCGGC
GCCTGGTCGA TGCTCCACTG TGTGAGGACT TTCGGTTCGA GTTCGCCATT GATGGTGAAA
TAACGAGTTG GACGAGAGTC GAATGGGAAG AGAAAAAGAT GAACGTTGAC CGGGTTGTTG
CCGTATGGAC TCTACTGGAT GATCCCGGCT TTTCCAACGG TAGTTCTGTT GATGTGGAGT
TAGCGATTCG AATGATGGGT TGCGACCAAA AAACAACGCA TGACTTAACC CATGTGTACT
GGGCATAGGA TGTATCGTCT CTCTCTAGAC CCTTACACCC AACCCAAACT ACCGACGATC
TCCTAACTTT AAAGTAGGAG TATAGGCCAT GCGCAGCACC AG
 
Protein sequence
METEEHSSSH VYDDEEDEEL FMLIQESSGK EGDVIERFPV RNIWYLVGAG LVVLAASSFF 
TDQVEIKFDS RLEDVEKKNL SAVDDARVEN ESDIGNDSGY GTVDVKADLH GKSNALDDSN
KSASTNFAEH SVSINDVEDF YASIEPGLKP RFNVLSPSYI PRGQPLAQEK RNEIKTKWGS
WTLVDEVSRP EEDFFARYPY RDVPRNDFPS NAWQVDVAYL SRFLPEAEAL VTRSMEAILA
ELAHSPIEEP GMTIEERSKL FELEQTDHDI KLKNVMFDGC GYTSPDSEGL LARRILHAVM
TEDNFNFVMG GHSAAAGHGN NFQQSYTLQF QRILEPILAR LGVRLQAHNF GMGGLGTSQN
AMAARDLYGN EIDILMWDSG MTEKSNWHQN LFVQQSLLGA EGRVPMLWNT ANLFTYHDEL
GVDIMQVGGF WKVGDKYLPL SDDPVQVKGL PYAVQYLKPS SGMQGETRAN RYNGTCWIDR
PDIDPPTKQA DFPGGRASWH PGNREHQYAS RLMLFQVLKA LQKAIHIWRD ADDFALPDKA
WHMGEHYDSI KSKLQSWNNT ACKANAVLPP RWCEVAFQGR SEYLPRANPS ETSIRSLVKE
SMTIPEVKRN LYDPPDVWMP VLDPPDGDID VLSIVENGVE FAANRQRIKH ALDYIQKSKH
RVALLNNNSE IVPGKGWYSH TKSAPDNCDG TYDSFCGHSA DERCLLIGHN DDRGGLTYNS
LSGWLILNIE NVREGIIAIK VFANADNPLT KGWCSVNNEE PCTGTEVNSG VEDNPSDRRL
VDAPLCEDFR FEFAIDGEIT SWTRVEWEEK KMNVDRVVAV WTLLDDPGFS NGSSVDVELA
IRMMGCDQKT THDLTHVYWA