Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51136 |
Symbol | PEPCase_2 |
ID | 7203602 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 403161 |
End bp | 406016 |
Gene Length | 2856 bp |
Protein Length | 877 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182829 |
Protein GI | 219125106 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGACG CCGCCAGCAA GCTCACCGCG ACGGAAGCCC TGGGCGTGAC GCGCGTCTTT TCCATCATGC TCAATCTCGT CAACGCCGCC GAAGTCCAGC ACCGCAACCG ACAGATTCGG GCACACGAGT CCACCAAGGA CCCCTCCGGT GGCCCTCTCC CCAAAACGGA AGATTCCATT CGCGGAACCA TGGAGACGCT GTTGGAATCG AAACAGGCGA CACCGGAAGA AATATTTGCC CAGCTGCAGA AGCAAAAAGT GGAAATCGTC CTGACGGCTC ATCCGACTCA AGTCCAGCGC AAATCGCTTC TGCGCAAGTA CCGTCGCGTT TCGGAGATGC TCGCTTATTT GGAGCGACCC GATTTGGATG GTTTTGAAAA GTCGTCCGCC CAAACGAGCT TGCAAACAAT CTTGAGCAGC ATTTGGGGAG CTGACGAAAT TCGAAGACAA AAACCGACAC CACAACAAGA GGCCGCAGGG GGTAACGCAA TATTGGAGTC GGTTTTGTGG GACGCGGTGC CAGCCTATCT GCGCAAATTG GATCAACAGT GCCGACTTAC CCTGGGGCAG TCGCTGCCCG TGGACGTATG CCCCATCAAG TTTGCTTCCT GGATCGGTGG GGATCGCGAT GGTATGTGGA CAACTCTTCA CCGAGGCAAA GAGTGTAGTA TCTTTGATCC AAACAAATGT GCGCTTTCGA TACAGTAAAC TCTAACGACG CGACTTTCGT TTGCTGCTCC GCGCAGGTAA CCCCAACGTG ACGCCCGAAG TTACCCGCGA GGTTGTTCTG CAACAACGAT TGCGGGCTGC TCGTTTGCTT CTCAAGGACA TGTACGATTT GATCTCCGAA TTGGCAATTT CTAGCCGCTT TTCGCCCGCC ATGGATGCCT TGGCAGATTC CGTCAAGGAC TCGCAGCATA AGCGTGAAAA GTACCGTCGT GTGATTGGAC ACTTGATCAA ACGTCTCGTC AAAACGGCCC GTGAATGTGA ATTAGAATTG TCGAAACTCA ACACCTCAGC TAGTATGGTC AGTCAGACTC TCGTTGAGGA AGCAGTGGAT GGTTGGCAAG ACGTCGATGC TCTTGACGAT GCGACTGATT TGATCAAGCC TTTGCGCATA ATGTACGATT CGTTGGTTGA AACGGGCTTC GGTTTGGTGG CCGACGGTTT ATTGGTCGAT ATCATTCGTC GATTGTATGT GTTTGGTATG TCCCTCGTGC CCTTGGATAT TCGCGAGGAG AGTACCAAGC ACACGGAAGC GTTAGATGCC ATTACGCGTT GGTTGGGAAT TGGCTCCTAT AGTGAATGGA CCGAAGAGGC TCGTCTCAGC TGGTTGACTT CTGAGCTTTC CAACAAACGT CCCTTGTACC GAATTCGCGA ATTGCCCAAG CTGGGTTTCA ATGACAGTGT CTTGAAGACG CTCAACGTAT TCGGCACCAT AGCTACCCTA CGACCATCTT GTTTGGGAGC CTACGTCATT AGTCAGGCGC AGACCGCAAG TGATGTCTTG GCCGTCATGC TTTTGCAAAA GCAGTACGGT ATGACGGACA AGAACAGAAA CATGATGCGT GTGGTTCCGT TGTTTGAGAC CTTGAATGAC TTGACCAACG CGCCCGACAA ACTCGAACAG CTCTTCAGTA TTCCGCTTTA CGTCGGCGCC GTCAAAGGGA AACAGGAAGT AATGGTCGGG TATAGTGACA GTGCCAAGGA TGCCGGACGT CTGGCTGCCT GCTGGGCGCA GTACAACTCG CAAGAACGAA TGGTGAAGGT AGCGGCGAAG CACAACATTG AATTGACTTT CTTCCACGGC AAAGGGGGTA CCGTAGGACG TGGCGGTAAC CCATCCGTCT ATCGTGCCAT TATGAGCCAT CCGCCCAATA CCATTAATGG CCGTTTCCGG GTGACGGAAC AGGGTGAAAT GATAACGCAA AACTTTGGAG CTCCGTCCAT TGCTGAACGA ACTTTGGACA TTTACACGGC TGGCGTATGT CGCGAAGCTT TTTCTGAGCG CGTGGAACCG TCGCAAGCAT GGCGCGACCA GATGCAACGG ATCTCCGATG TGAGTTGTGC CGAGTACCGC CACTTAGTCC GTGAGGAACC GCGGTTTGTT CCCTACTTTC GCCAGGCGAC ACCGGAGTTG GAACTCGGAA GTTTGAACAT AGGCAGTCGT CCGGCCAAAC GTAACCCGAA AGGCGGTATT GAAAGTCTCC GCGCGATTCC GTGGACCTTT GCTTGGACGC AGACGCGCAC ACACTTATCG GCGTGGCTGG GAGTTGGCGC TGGTCTCACA ACGACAGATC AAAGCGAATT GAAGACGCTT CGAGCAATGT ACATTGAATG GCCTTGGTTT CGTGAAACTA TTGATCTAAT TGCCATGATT GTATCCAAGA CAGACTTTTC CATATCCAAA AATTATGACG ATCAACTGGT GGAAAAGAAA GAAGGTTTGT TGAAGCTGGG AGACGAGGTC AGGGAGAAAA TGGTGCAAAC TCGTCAAGCT GTTCTTGATG TGACCGAGTC TACGGATGTT GCTGGGGCTC ACGTCGCCCT TATGCGAGGG TCGTCGACCA TTCGTCATCC ATACGTCGAT CCGGTCAACG TTATTCAAGC CGAATTGCTC AAGCGATTGC GAGTAATGGA CAAGAAAAAG TCTCTGTTGG CGGATGAAAT GGAAGAACAA GAAATTTTAA AGGATGCCCT GATTATCAGT ATCAATGGCA TCGCTCAGGG AATGCGAAAC AGTGGATAAA GTGCTCCATA ATATTCTCGG TGGCTCGGCA ACCGTTACAA TGAGTGGGGG TCTCTAGTGA GAAGAGTAGG TTGTTCTGTA AGCTAACTTG ACTTTATGAT CGAATG
|
Protein sequence | MIDAASKLTA TEALGVTRVF SIMLNLVNAA EVQHRNRQIR AHESTKDPSG GPLPKTEDSI RGTMETLLES KQATPEEIFA QLQKQKVEIV LTAHPTQVQR KSLLRKYRRV SEMLAYLERP DLDGFEKSSA QTSLQTILSS IWGADEIRRQ KPTPQQEAAG GNAILESVLW DAVPAYLRKL DQQCRLTLGQ SLPVDVCPIK FASWIGGDRD GNPNVTPEVT REVVLQQRLR AARLLLKDMY DLISELAISS RFSPAMDALA DSVKDSQHKR EKYRRVIGHL IKRLVKTARE CELELSKLNT SASMVSQTLV EEAVDGWQDV DALDDATDLI KPLRIMYDSL VETGFGLVAD GLLVDIIRRL YVFGMSLVPL DIREESTKHT EALDAITRWL GIGSYSEWTE EARLSWLTSE LSNKRPLYRI RELPKLGFND SVLKTLNVFG TIATLRPSCL GAYVISQAQT ASDVLAVMLL QKQYGMTDKN RNMMRVVPLF ETLNDLTNAP DKLEQLFSIP LYVGAVKGKQ EVMVGYSDSA KDAGRLAACW AQYNSQERMV KVAAKHNIEL TFFHGKGGTV GRGGNPSVYR AIMSHPPNTI NGRFRVTEQG EMITQNFGAP SIAERTLDIY TAGVCREAFS ERVEPSQAWR DQMQRISDVS CAEYRHLVRE EPRFVPYFRQ ATPELELGSL NIGSRPAKRN PKGGIESLRA IPWTFAWTQT RTHLSAWLGV GAGLTTTDQS ELKTLRAMYI EWPWFRETID LIAMIVSKTD FSISKNYDDQ LVEKKEGLLK LGDEVREKMV QTRQAVLDVT ESTDVAGAHV ALMRGSSTIR HPYVDPVNVI QAELLKRLRV MDKKKSLLAD EMEEQEILKD ALIISINGIA QGMRNSG
|
| |