Gene PHATRDRAFT_50171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50171 
Symbol 
ID7198948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp261348 
End bp263570 
Gene Length2223 bp 
Protein Length740 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185000 
Protein GI219129658 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00469189 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATA GTGCATCCTC GGAAAGTTCC AAGGCTACTA TCGGCGTTAC TCTTAGCAAC 
GCTATACCGA GTGATGCGTC GTCACGAGCC CGTTCCATTC TGGGCTCCAT GAAGGAGCAC
CGGGAAACGT TTTCACGGGT CGTACACCAT GGACATTCCA GCCATGGCGA TGATTTCCGT
ATGGCCTCAG CTGTGTCGTT GTCACCCAGT AATCCATTTT CGTCAAAGTC TGAAGATACC
ATGCTGACGA TTGCGCAGCA ACTAAACGAT TCCTTGAACG ATAGGCGGCA GCGACTGTCT
CGAGTTTCGG GTATCGATGT CGGTAACAAA ACAACTACAG TCGACAGCGC CGCTGGAGAG
AACGTGCGTA CAGCTCCGAC ATCCCCAGTG AGTCAACGTT CGGGACACAC GGTTGCAACG
GAAGAATCCA CGGTGGCAAC GGGAACTTCG TCACCTGGAC AGTCGTCCTT CGATCCGTCG
TATGCGACAA CCCAGCCTCC CGATTCTCCG ACACAACAGC AGCAGAGACT TGGAATAGTT
CAGTCTCGGG AACCCGTTGC GATTGCCTCC ACTGCGCAAT TGCCATCGAA CGAGGAAGTT
GAGAATCCTG ATATTCTAGA GAATGCGGAC GTCCACGAGA AACATGGCGG TTCCGACAGC
AAGGCAGCTT GTGAATCGGT ATCTACCGTC GACGTTAGCT TCGGTCCGGA GACGCAGTTT
CGGCATGTTC GCAGCTCCAA GTCGTTTGGT GACAGTGCGC TAGAAAGTCG TAATGCTTCC
GTTCCTCGAT CCATCATCAC AACAAGCTCC TCGTTTGGTC ATTTTTCATT TTCATCGTCT
CCGTATGTAA CGGACAAGAC TACATCCAAG CAGGAGACTG ATTTGCGTTT CTCTCCTTTG
CCCCGGCTGT GTCATATACT TCCTGCGTCT ATACCCCCAG TGCCATTGGA GGATCCACTT
TGGGCAGCAG TCGATGATTG TGCGTCTCCC TTGCTGCGTG GATCCAATCG AACGAACGAC
AATTTGGATA ACACCGCTGG ACAAACGGGC AATATTCAAG GCTCATATGC CACGCTCTAC
CCTAATCAAA TTGAAGCTGA TGCTTGTATT GATGAAATAC ACTCAGCTTC GCAGCGCTTG
CAATACGCAT CAGATGATGA CAATCATCCG GCGCAAGGTG AAGCCGAGTA CTCGCAGCAA
TCGGCAGCCC GAACGTTACG CAGACCACAC GAACTCGAAA AGAACGTATG TGATTCGACG
TCTCCGGATT ACCGAGAATT GCTCAACATT TCAGTCGTTG AAGAGTATGC TGACCGGCGC
CGACTTCTCA AAGCCTTGCG TACCGACGAC CGTCGACTAA AACAAGCTCC AACGGAGCTC
ACCGATGCCT CCAACGCTGA AGAAGCAACG TCAAATTCGG ATGCGCATGG AATCGGAGGG
AGTTCGGAAG AAGACGATCC TTTTAGCAGT GAAGATTCGT CGCTGGATTG TACTACCGAC
TCCGATACAT CGGGTTGGGT CAGTAGTAGC GATTCAATTA ACAGCGCGTT ATCGACTCGA
GTCGGCAAGC GATCAGTCGA CTTTTCCTCA CGGGAAGGAA AGTGCAGAGC AGAATTGAAA
GCGGACTGCG CATCCGCCAA CGTTTCGACA TCCCTGCTAC TATCTTCTAG TAACGACACG
GCACAAAAAG TACCGCGGTC GGAACGGTTA ATTTGTCAAG GTGGACTGTT CTGGATAGCT
TTACGATCCG ATTGGGATCA GTCTCATCCT GATCGAATTC AAACACTTCC CAAACCGAGC
TCGGCTGACG TTCTTCACCT CTCGTCAGAT CTTCGTTGCG GCACGCAAGG ATCCATGTAT
GACCTCTGTC TGAAATCTGG CTTCAACACG AATGCCAAAA CATGGGAACC TGTTGATTAC
GGCACGCAAG GATCTATATA TGACCTCTGC CTGAAATCCG GCTTTAGTAC GAATGCGAAA
GCAACGGAAC CTGCTGAAAT TCGAGAAGAG CAATTCAACG AAGCACGAGG TGGTCTTTGG
TATTTGGCTA GTCGCTCTCC TCTCAGTAGG GACCGTCGGC TAGAGCCATA TTCGGTTAAA
GGAAAACCTC GGACAAGACC TCACGGCGCA ATCAAAATTC AGTCTGACTA CAAAACAGAT
CGAGAGTGTG GAGCGAAAGG CACGCTGCAT CTATGCATGA CGTTGCAGCT ACAGGTGAAT
TAA
 
Protein sequence
MKDSASSESS KATIGVTLSN AIPSDASSRA RSILGSMKEH RETFSRVVHH GHSSHGDDFR 
MASAVSLSPS NPFSSKSEDT MLTIAQQLND SLNDRRQRLS RVSGIDVGNK TTTVDSAAGE
NVRTAPTSPV SQRSGHTVAT EESTVATGTS SPGQSSFDPS YATTQPPDSP TQQQQRLGIV
QSREPVAIAS TAQLPSNEEV ENPDILENAD VHEKHGGSDS KAACESVSTV DVSFGPETQF
RHVRSSKSFG DSALESRNAS VPRSIITTSS SFGHFSFSSS PYVTDKTTSK QETDLRFSPL
PRLCHILPAS IPPVPLEDPL WAAVDDCASP LLRGSNRTND NLDNTAGQTG NIQGSYATLY
PNQIEADACI DEIHSASQRL QYASDDDNHP AQGEAEYSQQ SAARTLRRPH ELEKNVCDST
SPDYRELLNI SVVEEYADRR RLLKALRTDD RRLKQAPTEL TDASNAEEAT SNSDAHGIGG
SSEEDDPFSS EDSSLDCTTD SDTSGWVSSS DSINSALSTR VGKRSVDFSS REGKCRAELK
ADCASANVST SLLLSSSNDT AQKVPRSERL ICQGGLFWIA LRSDWDQSHP DRIQTLPKPS
SADVLHLSSD LRCGTQGSMY DLCLKSGFNT NAKTWEPVDY GTQGSIYDLC LKSGFSTNAK
ATEPAEIREE QFNEARGGLW YLASRSPLSR DRRLEPYSVK GKPRTRPHGA IKIQSDYKTD
RECGAKGTLH LCMTLQLQVN