Gene PHATRDRAFT_47575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47575 
Symbol 
ID7202799 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp155497 
End bp157859 
Gene Length2363 bp 
Protein Length744 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182012 
Protein GI219123398 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.759441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACTGGAGTCG TCGCTCTTTT GACCGGCATC CCAGTCTCGT CTTGTCTTGT CTATACAGCA 
GAACTATGAA ATCAGTACAA TTGTTCCTGT GTAGTTTAGG ACTGGCGTTA CTCTTTCTCG
GCACCACGAC GGAATGGCAA CGGCGCCTTC CAGACTTTTT GCGGCTCCCG GGGGAAGCGC
AAGCACACCG TCGCGTCCAA ACGGGTAGTA CGGCTTCGAC TCCGCCCGGG CCCACCGCGC
CGACCCCCGC AATCCGCAAA CCGTCCGTCG GGAGTCCCAC GTCTCCGACT CCGGTGGTGA
CCACAGTCCC CGGCGGGACG ACCCAGACTC TTGCACAAAA CCAAACGACT CCCGAACAAG
CGACTCCCAC GTCACTCCCG ACGGAATCAC CGGCGCCGTC GCCTTCACCG TCGGACGTAC
CCACTGCCAA GCCTACCATC ACGCCCATGC CCACCGACAA GCCCACGGCG TCGCCGACCG
AGTCTCCCGC CCCGTCCCCT TCGCCGTCGT CGATCCCCAC CGTGGCACCG ACCGAATCTC
CGTCGAACGC TCCCTCGCAA GCGCCCACCA CGTCGCCGGC ACCGACCCAA GCGCCTTCGA
CTGCTCCCAG TACGGCGCCC TCGTCTCCTC CGACCGACGT ACCCAGTGCG CATCCCACCG
TTTCGGTACA ACCCAGTTTC CGTCCCAGTC GGTCTCCGTC GGATCAACCC AGTAATACTC
CCACCGTGTC GCTGGCGCCG TCCGCATTGC CTAGTCTCGC ACCCAGTATC AGTCAACGAC
CGTCGCAAGC CCCGTCTTAT ATTAGTGCAC AACAAGCCAT TGTCAACGTG ACGCTTGATC
TCAACGCGCT CTTGACCAAC GCACAGATCG AAGCCTTGGA AGCGGCTACC ATTGTCTACA
TGGAACAAGA GGCCGTTCCG GGAGGCTACC TCGAACAGGC GACTGTCAAC GTCATTGCTC
AGCAAACGCG TCAACCAACT CCCTTTGGTC GAAGCCGTCG GGCTCTCCAG GAATATACCG
TTCTGGAATT GGCACTGGAT GTGGCCGCAA CCTACACGGG ATCGGCAGTC GACTTTGATC
TGGGTCTCTA TATGGCCTCC CGGTTGGATC CACCCAATCC CGTATGGATT CATTTGTTGG
GCAACGAAGA CACTATTTTC TTGCCTTTGC GTCCCCCCAC GCCGGTAGGG ACCCGCAACG
TTACTACCAG TCGAGAAGAA AGTGCCGAGA CACCCTCGGG CATGACCAAA GGCACTTACG
CCGTGGTCAT TCTCACGGCT CTGGCGGCAC TAGCCCTCGG CGCAGTCTCC AGTGTCTATG
CGGTCCGACA GCATCGACTC GAAACCTTGG GGACGGAACT CAAGAGTCCC CGGATGGTCG
CCAACGCCAC GACGTGGAAT ACGGCCGATA GTAACGAATG GCACCAATCT TGTACTCTAG
CGGAACAGGA AGAATCGGGG GAGAACTAGG AGGAAAAAGT AGAAGAAGAC CAGGTCAGTA
CATCGGCATC CTGTCAAGTC TTGCCCCTGG GACTCGACTC GTTGTGTGCG ACCGATACCG
TGGACCATGA AAGCTACTCC CGAGCCACCT CGACTGTCCC GTCTCCCATG GAAAAGGCAC
AGGCGTCCGT GTTTCGACAT ACCGATCCGC CCGTCCGGAG GATGAGTCCG CGGGAATCGT
CCGAAATTGA CTTTGGCCGG AACCGGGCCC TCTTGGATCA GGATGATTCC CTACTGTCCG
GATCGTATGG TGACAGTCGC ATTTCGGCAC GTCCGCCTCC CCCACACCAA ACCACGGGTC
GGTACCCCGG GGCCGCGGCA CATTTTCCCC ATTTCCAACC ACAACGAACA CACCATGCCG
ATTTGGATCA GGAAATTCTT TTGACGAAAA GCGTGGGTGG TACTTTGGGT GCCAAGGCTG
TGGACGATGA CAAGGCCAGT GCGTCGGATT TTTCGTCACA GGCCAAGTTC TATTTGAGTC
GACTCTTGGG AACCAACACA GCGTCCGGTC CACTTTCACA AGCATCAAGT CGCGACCACG
GGACGACGAT TCAAAAGACG GTGTCGTACG ATTCGGTATT GCGTCGACCG GGGTTGTACG
ATGTCTTTGC CCCACCCGGT CCCATTGGTA TCGTCGTCGA CACGACCAAG GATGGTCCAG
CCGTGCACGC TCTGAAGACG ACCTCACCCA TGTTGGGATT GATTCAACCA GGCGATTTGA
TTGTCGGTTT GGACGATCAG GATACCCGCA GCATGACGGC CGCGACTCTG ACGCGCCTCA
TGGCGGCGAA AGCCCAGGAG CACGAGCGTA AAATTACGTT GCTTACGAAC GAGCATGTTC
AAACAGCCTA TCCTTTGTAC TAA
 
Protein sequence
MKSVQLFLCS LGLALLFLGT TTEWQRRLPD FLRLPGEAQA HRRVQTGSTA STPPGPTAPT 
PAIRKPSVGS PTSPTPVVTT VPGGTTQTLA QNQTTPEQAT PTSLPTESPA PSPSPSDVPT
AKPTITPMPT DKPTASPTES PAPSPSPSSI PTVAPTESPS NAPSQAPTTS PAPTQAPSTA
PSTAPSSPPT DVPSAHPTVS VQPSFRPSRS PSDQPSNTPT VSLAPSALPS LAPSISQRPS
QAPSYISAQQ AIVNVTLDLN ALLTNAQIEA LEAATIVYME QEAVPGGYLE QATVNVIAQQ
TRQPTPFGRS RRALQEYTVL ELALDVAATY TGSAVDFDLG LYMASRLDPP NPVWIHLLGN
EDTIFLPLRP PTPVGTRNVT TSREESAETP SGMTKGTYAV VILTALAALA LGAVSSVYAV
RQHRLETLGT ELKSPRMVAN ATTWNTADRR IGGELGGKSR RRPVLPLGLD SLCATDTVDH
ESYSRATSTV PSPMEKAQAS VFRHTDPPVR RMSPRESSEI DFGRNRALLD QDDSLLSGSY
GDSRISARPP PPHQTTGRYP GAAAHFPHFQ PQRTHHADLD QEILLTKSVG GTLGAKAVDD
DKASASDFSS QAKFYLSRLL GTNTASGPLS QASSRDHGTT IQKTVSYDSV LRRPGLYDVF
APPGPIGIVV DTTKDGPAVH ALKTTSPMLG LIQPGDLIVG LDDQDTRSMT AATLTRLMAA
KAQEHERKIT LLTNEHVQTA YPLY