Gene PHATRDRAFT_50103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50103 
Symbol 
ID7198821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp47017 
End bp48900 
Gene Length1884 bp 
Protein Length627 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185036 
Protein GI219129733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCGA GGAAGGCCTC TCGCTACGGC GTGGAAAACG CTCAATGGAC GCAAGCACCC 
CTGATCTGTC GGACAATCAT TTGTTTCTTA CTGATAAACC AGTCTAGAGC TATAGTGATC
GAGGGCATCC CGCGAAGTTA CATAGGACCA TCTTGTCGTT CCAAGTACTT CTGCTGCCGG
AACCCGGCGT TTCCGTTTAT GGCAGTATCC AGAGCACCCT GTTTACGGAA TCGGCGTCAA
AGAACGAGGC GTTATGAGAG CTTTCGAGAC GATAGGGATA GTGCTTTGGT ATCCGCATCC
GACCTTGTAT TTAATGGATC GACATCGGCT GCTTTAACAT GCTCACCCGA GGAGCAAACC
AGCCGTACTT CACAATCTTC CGCCACTTTT TCCGAAGTCG ATGTACTGTA CGGAAGGCGA
GCTGTGCTCG TGTATGATCC CTTACAAGAG CGCTACGTGA AAGTTTCCGA GAAGAACAGA
GTAGCCGACA GCACCAAGCA AGAGTCGGTA GCTCTACGGG CACGCCGATC ATCTCTCGCT
CGATTTATTA CCACCAAAAT ACTCCCCCGT CTTTCGCTCG CCTTCCTACC ATCAGGTGTC
ACAAACGACT ATTATCGATT TGTTCGTTGG CGTATACTGC AGCGTTTCGT CAATGCCAAT
CTGAACGTCT TTGGCACGCA GAGTCTGCTT TTGGCGCTGC GAATTAAGAG CTCGGCTTCG
CAGCTCGGCG CCTTGTCCGC CGCTCTTAAC TGGGTCCTCA AAGACGCCTT GGGAAAGATT
GTCCGGATGC TCTGGGCTTC CCGTATGGGA CGGAGGTTCG ACTCGGACGC TAAACGATGG
CGGTTTCGTT CCAGTTTTGT CTTTGCTGCT GGCAATGGAC TCGAAATCAT CACCTACGTG
TTCCCATCGC TCTTTCTACT GTGGGCAACG TTGGCAAACT GTTGCAAACA AATATCGATG
CTCACGTCCA GCTCTACACG CACGTCAATC TACAACTCCT TTCGGGACGG ATCACGGGAA
AACATTGGCG ATATTACTGC GAAAGGTGAA GCGCAAATTG CCATTGTCGA CCTATTGGGG
ATCGCGAGCG GCGTAACCTT GTCCCGCACG GTTGGTACCT CAATTCGTGC TGTACTCGCC
GTATACGTAA CACTACAAGC GATTGAGATT GTCTGTGTGT ATCACCAGTT GCGAGCGGTC
ACCTATCGAG TTATGAATTT TGAACGAATG ATTTCCGTTG TGGCAGACTT CTGTCAAGCC
CGCCAAGGAC CAAAAGACGG ATTAGAAGGA CTAGCCGCGT CGTGCACGAC GCCTACTCCC
GCTGGAATTC CCACTCCACA GACATTGGCG TCGCAAGAAC GCATATTTTT GCCACCGAAA
CATTTGACTC GTCGCGCCAT TGCCTTTGGG TCCATCGGCC GTGCCAGGCT CTCTCCCGAC
GAGCTGGGAA CGCTTCTCGA AATTTTTAAG AGAGAGCGTT TCATTCTCGT TGTTGGAAAG
AACGTCAAAC ATCCGAGACC ATTTATGGCG AAGACTGCAA AGCAGAATGA AGATCCGGTT
TCGCGGATTC AAGAAAATTG CCATATTGTG CTGCACGAAG CAGCCACCAA TATGGATATT
GTGAAGAGTA CACTTGCGTT GACGCTTTTA CGACGGAAGT TGGCCTTGTC AAAATTCGAT
CCGTCTCAAG TGAGGTCGTC CAATTGTTTT GATATAATGA AGGTGACGCA AGAAGAAACA
AACGATTTGT TCCCCTTACT GCTGCGAGAA ATGAATACGC AGGGGTGGGA GTCGCCGGCG
CGATTCATGT TTGGAAGGGT GCACATGAGA GCTGACTGGC CTCTCACAGC AAGGTCCAAG
GGAAGAACAA CATCTGCCAC ATAA
 
Protein sequence
MRSRKASRYG VENAQWTQAP LICRTIICFL LINQSRAIVI EGIPRSYIGP SCRSKYFCCR 
NPAFPFMAVS RAPCLRNRRQ RTRRYESFRD DRDSALVSAS DLVFNGSTSA ALTCSPEEQT
SRTSQSSATF SEVDVLYGRR AVLVYDPLQE RYVKVSEKNR VADSTKQESV ALRARRSSLA
RFITTKILPR LSLAFLPSGV TNDYYRFVRW RILQRFVNAN LNVFGTQSLL LALRIKSSAS
QLGALSAALN WVLKDALGKI VRMLWASRMG RRFDSDAKRW RFRSSFVFAA GNGLEIITYV
FPSLFLLWAT LANCCKQISM LTSSSTRTSI YNSFRDGSRE NIGDITAKGE AQIAIVDLLG
IASGVTLSRT VGTSIRAVLA VYVTLQAIEI VCVYHQLRAV TYRVMNFERM ISVVADFCQA
RQGPKDGLEG LAASCTTPTP AGIPTPQTLA SQERIFLPPK HLTRRAIAFG SIGRARLSPD
ELGTLLEIFK RERFILVVGK NVKHPRPFMA KTAKQNEDPV SRIQENCHIV LHEAATNMDI
VKSTLALTLL RRKLALSKFD PSQVRSSNCF DIMKVTQEET NDLFPLLLRE MNTQGWESPA
RFMFGRVHMR ADWPLTARSK GRTTSAT