Gene PHATRDRAFT_51136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51136 
SymbolPEPCase_2 
ID7203602 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp403161 
End bp406016 
Gene Length2856 bp 
Protein Length877 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182829 
Protein GI219125106 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGACG CCGCCAGCAA GCTCACCGCG ACGGAAGCCC TGGGCGTGAC GCGCGTCTTT 
TCCATCATGC TCAATCTCGT CAACGCCGCC GAAGTCCAGC ACCGCAACCG ACAGATTCGG
GCACACGAGT CCACCAAGGA CCCCTCCGGT GGCCCTCTCC CCAAAACGGA AGATTCCATT
CGCGGAACCA TGGAGACGCT GTTGGAATCG AAACAGGCGA CACCGGAAGA AATATTTGCC
CAGCTGCAGA AGCAAAAAGT GGAAATCGTC CTGACGGCTC ATCCGACTCA AGTCCAGCGC
AAATCGCTTC TGCGCAAGTA CCGTCGCGTT TCGGAGATGC TCGCTTATTT GGAGCGACCC
GATTTGGATG GTTTTGAAAA GTCGTCCGCC CAAACGAGCT TGCAAACAAT CTTGAGCAGC
ATTTGGGGAG CTGACGAAAT TCGAAGACAA AAACCGACAC CACAACAAGA GGCCGCAGGG
GGTAACGCAA TATTGGAGTC GGTTTTGTGG GACGCGGTGC CAGCCTATCT GCGCAAATTG
GATCAACAGT GCCGACTTAC CCTGGGGCAG TCGCTGCCCG TGGACGTATG CCCCATCAAG
TTTGCTTCCT GGATCGGTGG GGATCGCGAT GGTATGTGGA CAACTCTTCA CCGAGGCAAA
GAGTGTAGTA TCTTTGATCC AAACAAATGT GCGCTTTCGA TACAGTAAAC TCTAACGACG
CGACTTTCGT TTGCTGCTCC GCGCAGGTAA CCCCAACGTG ACGCCCGAAG TTACCCGCGA
GGTTGTTCTG CAACAACGAT TGCGGGCTGC TCGTTTGCTT CTCAAGGACA TGTACGATTT
GATCTCCGAA TTGGCAATTT CTAGCCGCTT TTCGCCCGCC ATGGATGCCT TGGCAGATTC
CGTCAAGGAC TCGCAGCATA AGCGTGAAAA GTACCGTCGT GTGATTGGAC ACTTGATCAA
ACGTCTCGTC AAAACGGCCC GTGAATGTGA ATTAGAATTG TCGAAACTCA ACACCTCAGC
TAGTATGGTC AGTCAGACTC TCGTTGAGGA AGCAGTGGAT GGTTGGCAAG ACGTCGATGC
TCTTGACGAT GCGACTGATT TGATCAAGCC TTTGCGCATA ATGTACGATT CGTTGGTTGA
AACGGGCTTC GGTTTGGTGG CCGACGGTTT ATTGGTCGAT ATCATTCGTC GATTGTATGT
GTTTGGTATG TCCCTCGTGC CCTTGGATAT TCGCGAGGAG AGTACCAAGC ACACGGAAGC
GTTAGATGCC ATTACGCGTT GGTTGGGAAT TGGCTCCTAT AGTGAATGGA CCGAAGAGGC
TCGTCTCAGC TGGTTGACTT CTGAGCTTTC CAACAAACGT CCCTTGTACC GAATTCGCGA
ATTGCCCAAG CTGGGTTTCA ATGACAGTGT CTTGAAGACG CTCAACGTAT TCGGCACCAT
AGCTACCCTA CGACCATCTT GTTTGGGAGC CTACGTCATT AGTCAGGCGC AGACCGCAAG
TGATGTCTTG GCCGTCATGC TTTTGCAAAA GCAGTACGGT ATGACGGACA AGAACAGAAA
CATGATGCGT GTGGTTCCGT TGTTTGAGAC CTTGAATGAC TTGACCAACG CGCCCGACAA
ACTCGAACAG CTCTTCAGTA TTCCGCTTTA CGTCGGCGCC GTCAAAGGGA AACAGGAAGT
AATGGTCGGG TATAGTGACA GTGCCAAGGA TGCCGGACGT CTGGCTGCCT GCTGGGCGCA
GTACAACTCG CAAGAACGAA TGGTGAAGGT AGCGGCGAAG CACAACATTG AATTGACTTT
CTTCCACGGC AAAGGGGGTA CCGTAGGACG TGGCGGTAAC CCATCCGTCT ATCGTGCCAT
TATGAGCCAT CCGCCCAATA CCATTAATGG CCGTTTCCGG GTGACGGAAC AGGGTGAAAT
GATAACGCAA AACTTTGGAG CTCCGTCCAT TGCTGAACGA ACTTTGGACA TTTACACGGC
TGGCGTATGT CGCGAAGCTT TTTCTGAGCG CGTGGAACCG TCGCAAGCAT GGCGCGACCA
GATGCAACGG ATCTCCGATG TGAGTTGTGC CGAGTACCGC CACTTAGTCC GTGAGGAACC
GCGGTTTGTT CCCTACTTTC GCCAGGCGAC ACCGGAGTTG GAACTCGGAA GTTTGAACAT
AGGCAGTCGT CCGGCCAAAC GTAACCCGAA AGGCGGTATT GAAAGTCTCC GCGCGATTCC
GTGGACCTTT GCTTGGACGC AGACGCGCAC ACACTTATCG GCGTGGCTGG GAGTTGGCGC
TGGTCTCACA ACGACAGATC AAAGCGAATT GAAGACGCTT CGAGCAATGT ACATTGAATG
GCCTTGGTTT CGTGAAACTA TTGATCTAAT TGCCATGATT GTATCCAAGA CAGACTTTTC
CATATCCAAA AATTATGACG ATCAACTGGT GGAAAAGAAA GAAGGTTTGT TGAAGCTGGG
AGACGAGGTC AGGGAGAAAA TGGTGCAAAC TCGTCAAGCT GTTCTTGATG TGACCGAGTC
TACGGATGTT GCTGGGGCTC ACGTCGCCCT TATGCGAGGG TCGTCGACCA TTCGTCATCC
ATACGTCGAT CCGGTCAACG TTATTCAAGC CGAATTGCTC AAGCGATTGC GAGTAATGGA
CAAGAAAAAG TCTCTGTTGG CGGATGAAAT GGAAGAACAA GAAATTTTAA AGGATGCCCT
GATTATCAGT ATCAATGGCA TCGCTCAGGG AATGCGAAAC AGTGGATAAA GTGCTCCATA
ATATTCTCGG TGGCTCGGCA ACCGTTACAA TGAGTGGGGG TCTCTAGTGA GAAGAGTAGG
TTGTTCTGTA AGCTAACTTG ACTTTATGAT CGAATG
 
Protein sequence
MIDAASKLTA TEALGVTRVF SIMLNLVNAA EVQHRNRQIR AHESTKDPSG GPLPKTEDSI 
RGTMETLLES KQATPEEIFA QLQKQKVEIV LTAHPTQVQR KSLLRKYRRV SEMLAYLERP
DLDGFEKSSA QTSLQTILSS IWGADEIRRQ KPTPQQEAAG GNAILESVLW DAVPAYLRKL
DQQCRLTLGQ SLPVDVCPIK FASWIGGDRD GNPNVTPEVT REVVLQQRLR AARLLLKDMY
DLISELAISS RFSPAMDALA DSVKDSQHKR EKYRRVIGHL IKRLVKTARE CELELSKLNT
SASMVSQTLV EEAVDGWQDV DALDDATDLI KPLRIMYDSL VETGFGLVAD GLLVDIIRRL
YVFGMSLVPL DIREESTKHT EALDAITRWL GIGSYSEWTE EARLSWLTSE LSNKRPLYRI
RELPKLGFND SVLKTLNVFG TIATLRPSCL GAYVISQAQT ASDVLAVMLL QKQYGMTDKN
RNMMRVVPLF ETLNDLTNAP DKLEQLFSIP LYVGAVKGKQ EVMVGYSDSA KDAGRLAACW
AQYNSQERMV KVAAKHNIEL TFFHGKGGTV GRGGNPSVYR AIMSHPPNTI NGRFRVTEQG
EMITQNFGAP SIAERTLDIY TAGVCREAFS ERVEPSQAWR DQMQRISDVS CAEYRHLVRE
EPRFVPYFRQ ATPELELGSL NIGSRPAKRN PKGGIESLRA IPWTFAWTQT RTHLSAWLGV
GAGLTTTDQS ELKTLRAMYI EWPWFRETID LIAMIVSKTD FSISKNYDDQ LVEKKEGLLK
LGDEVREKMV QTRQAVLDVT ESTDVAGAHV ALMRGSSTIR HPYVDPVNVI QAELLKRLRV
MDKKKSLLAD EMEEQEILKD ALIISINGIA QGMRNSG