Gene PHATRDRAFT_50482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50482 
Symbol 
ID7199322 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp180208 
End bp184144 
Gene Length3937 bp 
Protein Length1188 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185442 
Protein GI219130584 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCATCGA GGAGTAGATT TAACTGTCTG CAGCTTCACT AACCTTTATT GCACACCGCT 
TTGAGGCTCG TGTGCATATT GGAGGACATT TGCGAAATAA GTTTGCGGTA AGATAAAGAA
ATCGTCCAAG CTTTTGGATT ACCATATTTC AACACCGATC GTCCTTTGGC TAACTTCATC
TTCTACACGT TTACTACGGT AGCTCACGTT GAACTTTAGT TTTCTGACAG AGTTGGCCGG
TAAAAAAGGA TATTTGTAAT TACAAACAGA TTCAGCCCCT CTCTTCTTAA CTACTTTTCC
TAATTCGCTC AAAATGGATC CTGAAGAGCA AAAGATAAGC CGAGGCATTT TGCTATCCGA
AACTAGGGGT ATCTCTAACG AATACGGTGG CGATAGCACT CGCGAGGACC TCGCTAGAAT
CGACTCCAGC TCACCAGCGA CAGAGACACC GCCCGCCAAG AAGCCGAAGC GCGAATCTCC
GTGCCCTGAA AGTAACGGCA TCGCCAACGC CGAGAGCAGC TCCAAACCAA GCAGTACAGT
AGGCAACGAT GCTGCACCTA GCGTTGACTT TCCGCCAGAA GCAGGAGGTA CTCAGCCGGA
TTCTCAAGGT CGAGATGAGA TTGTTAGCTT GTATATTTCT CTCGAGCCAA TCTCCACAAA
AGTGCCACAA ATTCCAAAAT TGACCGAGTT TGAGGTCAAA CAACTGGAAG CAGTCTTAGA
GTTCAAGAAT AGTTCAGAGT GGCGAGATGA TTGGATGGGT AACCTCGCCT TTGCCGATCT
AGACGTGGGC AACCCAGCTA TTAGTAGTAA ATCACGGGAC AAACAACATA CTTTCCGCCA
GCCTTTGATC CAATGGGCGC ACAATGATCA ATCGAACGCC AAATACGTGT GGTTTTTGAT
TTGCCATGTC TATAACATTC CAAGTATTCC TCCGGCAGCC AGAAAAATTC TTGGGGCCGT
GGATTTACGC TCCTCTGTTA ACATGGAAAA AACTTTACGG CGTGTTATGT ATGATCCCGA
AGTCTTGCGC GAAGATGGTT GGACTACCGC CAAGTCAAAT GAGTACATAG GGGCGACCGG
TGGCCCACAC AACATTGGAG AGCAGATCTA TTGGGATGGT AGCAATGCCG TCGTAATTGC
CTACATTCAT GACCCAGGTA AGACCAAGGC AAAAAAGAAT GACAGAATTT AAATCAAGGT
ATTCTCAACT CAAGGATTGC GCGTTTCAGA TATTGGCGAC CTTTGGAAAG CCATTTGGGC
TGGCAACGAC GACGACGATG ATGTACTCAC GACCACCTTT GATCTCGAAG CTGAAGAGCT
ACTAGAAGCG AAGCGAAAAT GGCAACGACG GCAAAGCTCA AGATCTGGAG GAGGTTTATC
GAGCTCACGA CACCCGAAGA AGTCGAACGT AAGCGACGAT TTCAACGTTG CCGGGGTCGA
ATTGGGGATA GTTCTTGGAG CGAGTTACAG TAAAGGTGCC CGAAACGGCG TTTACTGGCC
GGCCCGTGTT ATGCACGCCT CTGAAAAGAT GGGCACGAAA TCTCAAACAA AAAGGCAAAG
TAGCAAAAAC AAGATCGATC TTGTCTTCCT TGCCCCCTAC TGGAATTCAC TTGAGCAGTC
TTTCGCGGCT AGGAAAGTGG AGGCGCTATC TGAAAATCGG AAATCATCGT TTCATTCGAA
TCCTTTGTTT CAATTTGAAA CTGTTGAAGC CACTGACGAT ATGATTAAGG AGTATCTTTA
TCGCCCCGAG TGTGAATTGG ATTTGCAACA GCTTCGCCTT TCGTTCCGAT TCACCGGTTT
ACCTAAGGGA GCATTCTCTC GTTTCGTGGA TGCTCATCGT CTCGCTCTAG GCTTGCAGAA
TTACGCCGTT CGGCATTTGA AGAAAAACGT ATCCGCCACT GATCGTGCTA CTGCTGGATT
GTTCGAGGCC CACCCACTAG CGGTGAGAGC GCCGATATAT CCTTCAATCG TCCTTGAGCT
TCCCTTCGCA TTTATCCTTT CGCAGTTACC AACTCTTTCG AGCCAGTCTG GTTTTGAGCA
CGAAAGACAC GAACCTGTGT TGGAACTGAA GACGATTGTC GACTCTATGA AGCCACCATC
ATGCTGGGGT AAGAACATCA TCGCAGCACT CCCTACGTCA GCAGACGAGG GTCGAAACAT
TCACGAGGGA ACCGGCGATA TCACTCCATG GAAGATTACT ACTATGTCTA ACGGGAAAAG
TTCGAAGAAT AGAGAGTATG AAATCGGGCA TTTTCTTTCA GGTTTGTTAG CTCTTCAGCA
GGCGTTTGAT GATCATACTT CATCGCCTGC AGTTCTTGGA GTCATACACG AGTTCGACAA
CCTTTTGGTA TTAGTGTCAC AGAAAAATAT TACAACCGCT GGAACTGAGT CTGAGCGCAC
TTCCCGGCTT AAAGATTTGG TAAGAAACTG GGCCATCTTA AAAGGGCATG GGGAAGAAGG
GCTGTCGATT GAGAAATCAA GCGGGGAAAG TTTAGTTGTC TCAGAGTGGC ACAGAGCAAC
TGAGCGTCTT TTCAAGTATA TTGTGGAATC TTTCTCTGCA AGCATTAGTC GACGAGGAAT
CTCGACTGTC TTTACCGACA CACGGTGCAA TGGGCACATT ACGTCAAACG ATTGCTTCGA
ACGTGCAGTG AGACTTCCGG CGGCTTTGAA AGGTGCAAAA CTTGCGGGCG CTGGAAGTGA
TGAAAACTGT CGACTTATCT CAGCAGTCGA CGAATCCTAC CTGAAGTATG TCGAACATAC
ACTTTTACCG AAAGCTCACG ACAGTGCATA CCTGAAACGT ATGCGCGGAC GATGCGCAGC
AGCGGTGAAT GAGACTGAAA TTCTTGTACT CACCGAAGAT TCGGAAGGCA ATGGAGGCAG
TGACACTCAC GGATCAAAAG GAACATGGGC GGCAGCGGTT ACAGCGGTGG CAGCGGCCGT
CGCTGCAGCG GATATGATAG TAGGTGGAGA GTCAACAAAC GCTTTCTGCG CTACGCGACC
CCCAGGGCAT CACGCAGGTA AGGGTCTGCA TCCTATGAAA GCAGTCTCCA ATGGCTTTTG
CGTTTTGAAT GCTGTTGCCT GTGCAGCTAT CCATGCGACT TCCTCAATAT TGGAAGGGGG
CCTCGGACTC AAGAGAGTGT GCATTATAGA TTTCGACGTG CACCATGGAA ACGGCACTCA
GGATATTCTC TGCTCGACTT TCAACCCACA TTTTCTCTAC GTCTCGATTC ATGCGGGAGG
TCCGCATGTA AATGGAGTAG CTATTGACGA CGATCCAGAT CATGAGCTAC ATGAACTAGC
AAGTAACCCA AAACAAGGCG GTGGCATTTA TCCGGGTCGC TGCGGTGACA CCTCTCCCCA
CAAAGGAGTA TTGAATATTC CGTTAGGCTC TAAAGTTACT GCCCATGCCG TAGGGGCAGC
TTTGTTGAGC ACTGTAACTC CTGCTGTCAA CAAATTCACA CCGGACCTCA TTATTCTATC
CGCCGGCTTC GATGCACACA AAAGTGATCC AATGTGCTTG GGAAGTTTAA ACGCTGAAGA
TTTTGGCCAC ATCACCGAGG TTTGCTGTCA ACTTGCATAC AAATCCTGCA GTGGTCGAGT
ATTAAGCGTA CTGGAGGGAG GCTACGGTGT TCCCTGCTGC CGACCACAGA AGAATGTATT
TATTCCTTCT CCCGGTCGAG GCGACCGGGA AATCGAGTCG ATTTCTCAAA AGCAAAAGGA
CGGACCTGAA TTGCCTCCAT CCACTCCGAG TGCTTTACAA ATACCCCGTC CACAGCCATC
GAGGTTATTG CAGTTGGGAG ATGACTTACC GGAATCAATG GACGATCAGG TTCCATTCGC
TCTACAGCGT CGGCTCGAGA AGTGCCATGC CGAGGGTTTC GTCGAATGCG TCAAGGAGCA
TGTCGCCTCG TTAATGCGAT GCAACAAACG CACGTAG
 
Protein sequence
MDPEEQKISR GILLSETRGI SNEYGGDSTR EDLARIDSSS PATETPPAKK PKRESPCPES 
NGIANAESSS KPSSTVGNDA APSVDFPPEA GGTQPDSQGR DEIVSLYISL EPISTKVPQI
PKLTEFEVKQ LEAVLEFKNS SEWRDDWMGN LAFADLDVGN PAISSKSRDK QHTFRQPLIQ
WAHNDQSNAK YVWFLICHVY NIPSIPPAAR KILGAVDLRS SVNMEKTLRR VMYDPEVLRE
DGWTTAKSNE YIGATGGPHN IGEQIYWDGS NAVVIAYIHD PGLRVSDIGD LWKAIWAGND
DDDDVLTTTF DLEAEELLEA KRKWQRRQSS RSGGGLSSSR HPKKSNVSDD FNVAGVELGI
VLGASYSKGA RNGVYWPARV MHASEKMGTK SQTKRQSSKN KIDLVFLAPY WNSLEQSFAA
RKVEALSENR KSSFHSNPLF QFETVEATDD MIKEYLYRPE CELDLQQLRL SFRFTGLPKG
AFSRFVDAHR LALGLQNYAV RHLKKNVSAT DRATAGLFEA HPLAVRAPIY PSIVLELPFA
FILSQLPTLS SQSGFEHERH EPVLELKTIV DSMKPPSCWG KNIIAALPTS ADEGRNIHEG
TGDITPWKIT TMSNGKSSKN REYEIGHFLS GLLALQQAFD DHTSSPAVLG VIHEFDNLLV
LVSQKNITTA GTESERTSRL KDLVRNWAIL KGHGEEGLSI EKSSGESLVV SEWHRATERL
FKYIVESFSA SISRRGISTV FTDTRCNGHI TSNDCFERAV RLPAALKGAK LAGAGSDENC
RLISAVDESY LKYVEHTLLP KAHDSAYLKR MRGRCAAAVN ETEILVLTED SEGNGGSDTH
GSKGTWAAAV TAVAAAVAAA DMIVGGESTN AFCATRPPGH HAGKGLHPMK AVSNGFCVLN
AVACAAIHAT SSILEGGLGL KRVCIIDFDV HHGNGTQDIL CSTFNPHFLY VSIHAGGPHV
NGVAIDDDPD HELHELASNP KQGGGIYPGR CGDTSPHKGV LNIPLGSKVT AHAVGAALLS
TVTPAVNKFT PDLIILSAGF DAHKSDPMCL GSLNAEDFGH ITEVCCQLAY KSCSGRVLSV
LEGGYGVPCC RPQKNVFIPS PGRGDREIES ISQKQKDGPE LPPSTPSALQ IPRPQPSRLL
QLGDDLPESM DDQVPFALQR RLEKCHAEGF VECVKEHVAS LMRCNKRT