Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27976 |
Symbol | PEPCase_1 |
ID | 7201832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 553748 |
End bp | 556971 |
Gene Length | 3224 bp |
Protein Length | 1009 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181027 |
Protein GI | 219120583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.205513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTGCGTTCT GTTCCTTTCT GCAAAGAATC CTTGCGTACG TGAGTTGCGA CTCACATCAA GCATCAGTGA AACGAAGTGA CACAAGTCGA TTCATCGTAA GAATGTTGTC GTCTTCCTGC CGTCGAAGTT TCCTTGCGGC GAAGACTCGG TTGCGCTCAT GCGTGACCAC GTCGTTGTCG ACGGGTTGTC CGTGGAGTGC CATTTCCAGC GGATCCACAA GTCGCCATAT CGATCGGTTT TTTTCGACCC ACAGTTCCTT CGATGAACCC AACCCGTCCT TGTTTGGTGC TTCTCCATTG CAAGCGTCGA CGGTATCCAG CGATGCTACT TCGATCCCTT CCAACGAAGC CGATCGCGAT ATTCAATTGC GAGCAGACAT TAAAGTCATG GGTAGTTTAC TGGGACGAAT CATTCAAACG CACGAAGGCG CGGAGGTACT GGAAAAGGTC GAAACCATGC GCGGCTTGGC CAAGACCTGG CGCGATCAAG GGGCAGGCCG CGATCCCAGT ACGAAGCAAG CCGCTGACCA AACCTTTCAA AACCTCGCCG CGTACGCCAA GAGCTTCACC GATGCGGAAC TCTTTACCGT TAGTCGGGCC TTTACGCACT TTTTGGCCAT TGCGAATGCG GCCGAATCGC ATCATCGTGG ACGGCGTCTG AAGCAATCAC GCCTTCTTTC GGACGAGTCG TCGGGAGCGC TCTATCCCAA GCCGGACAGT GTTGGAGGGG TTCTTCCTTC TCTGCTCGCT CAGGGACACG ATGCGGACGC GATCTACGAC GCCCTCACGT CGCAAACCAC CGAGCTTGTT TTGACAGCCC ATCCGACTGA AGTTAATCGG CGCACTATTC TCAACAAGAA GCGCAGGATC CAGCGCATTC TCACCATGGC CGATCAACAG CGTCAGCTTG GTGCCTCTAG CGTCTTTGAG CAAGCCGAAC TCAATGACGC CTTGTATCGG GAGATCTCCA GTATTTGGCT ATCTGATGAA GTCTCTCGTA TCAAACCATC TCCAGAAACG GAAGCTGAAA AAGGAACGCT CGTGTTGGAA ACGGTGTTGT GGGAGGCCGT ACCGACCTTT TTGCGTAAAT TGGACGCCAC CACGCGCGAG TTCCTCGGTA AGCCTTTGCC TCTCGATTCG TCCCCGATTC GGTTTGCATC CTGGATGGGC GGAGATCGGG ACGGGAATCC TAACGTGAAA CCGGATACGA CCCGGCAAGT GTGCTTGCGT AATCGTCAAA AGGCAGCCAC CCTTTTTGCC CGTAATTTGC GAACGCTCGA GGCGGAGTTA TCCTTGACCA CGTGCAGCCG CGAAGTCCGG GAAGTGGTAG GCGCTGCCCG AGAACCTTAC CGTATATTCT TACAGCCCAT GATTCGGAAA ATGGAAGCGA CCACTGATTG GGCCGCCCAA GAGTTGGCGA TTTTGCAAAA ACGTCGTAGC GGTGACAAGA GTGCCTCGGG TATTGCATCT GTCGCTAGCA CCAACGTGGA AGGCATCTAC CTTGATCAGG AAGAATTCAG GGCGGAACTG CTCACAATCT ACCGCTCTCT ACAAGAAACA GGAAACGAAG TGGCTGCCAG CGGCATTTTG ACAGATATTA TTCGGAATCT TTCCTCCTTT GGGTTGACGC TCATTCCTTT GGACGTCCGC CAGGAAAGCG ACCGCCACGA AGAAGCCCTA GACGCTATTA CCCGGTACCT CGGATTAGGT AGTTACATAC AGTGGGATGA ACAGACGCGC GTTAGCTGGT TGACAACTCA AATTTCGTCC AAACGCCCAT TGCTTCGAGC AGGAGTCTGG TACGAACATC CGGACTACTT CTCGCCAACC GCAATTGATA CACTAGAAAT CTCTCGAATG ATTGCCGAAC AGCACGAAGG GAGTTTGGGG GCCTACGTCA TTAGTCAAGC GACCAGTGCA AGCGATGTCC TTGCCGTGCT CTTGCTGCAA TTGGATGCTG GTGTCAAAAA GCCTCTTCGT GTCGCGCCTC TTTTTGAAAC TTTGGACGAT CTGAACGGCG CCGCTGATAC AATGCGACAG CTGTTCAGTC TTCCTGCGTA CATGGGTACC ATAGGTGGGA AGCAAGAGGT CATGATAGGT TACTCCGATT CCGCGAAAGA TGCCGGTCGA ATGGCGGCAA CGTGGGCTCA ATATGAGACA CAAGAGACGT TGGCCAAGCT CGCCAAAGAA TTTGGAGTCG ACATGACGTT TTTCCACGGG AAGGGTGGTA CCGTTGGCCG TGGTGGTAAT CCGCAAACCT TCACAGCCAT TATGGCACAT GCGCCGAAAA CGATCAACGG GCACTTCCGC GTAACCGAAC AAGGTGAAAT GATCAGCCAG AACTTTGGAT ACGCAGATCG CGCCGAACGT ACAATGGATA TTTACACTGC TGCGGTCCTG GCCGAGAAGC TGAGTGAACG ACCGAAGGTC AAAGACGAAT GGAGAAGTAT GATGAAGATC TTGTCGGATA TTAGCTGCGA AGCCTACCGC CAGGTCGTAC GCAAAGATGA GCGCTTCGTA CCCTACTTTC GCTCCGCTAC CCCCGAACTA GAACTCTCGA ACCTCAACAT TGGATCGCGG CCCGCCAAAC GAAAGGCGAC GGGAGGTGTC GAAAGTCTTC GCGCTATTCC TTGGAACTTT GCTTGGACCC AGACTCGATT CAATCTACCC ACGTGGTTGG GCGTTGGCGA TGCCATCGGA CAACTGCTAA AGAGCGATAG AGCTCCTTTA CTCCGGGAAC TCTATCGTGA AGCGCGCGCC TTTCAAACCA TGGTGGATTT GGTCGAAATG GTCCTGGCAA AATCCGAGCC CGCGATTGCC GCTCACTACG ACAGTGTTCT CGTCAAGGAC CCCAAGGCGA AAGAACTAGG TAAGGAGGTT CGTCAACTTC ACATGGCGAC GGAAGAGGCA ATTCTAGATT TGACGGAACA CAAAAAGTTG GGCGAAAACA ACGCGGTGCT TCAGCGTGCT CTGGTTGTGC GCAATCCCTA CGTAGATTGC CTGAATATTT TGCAAGTCGA GACCTTGGAT AGGCTCCGGC AAGTGGAAGA AGGGAAGGAA GATAAGGTCT TGAAGGACGC GCTCCTCACG ACCATTACAG GGGTTGCCAA TGGAATGGGC AACACTGGTT AAAATACTTG CATGTCTGAT CTATGGAATG ATTGTAAAGG ATTACATTTA CTGTTAGCGG CCAAAAAAAG GTGGACCTCC TACGGAGCTA GTAT
|
Protein sequence | MLSSSCRRSF LAAKTRLRSC VTTSLSTGCP WSAISSGSTS RHIDRFFSTH SSFDEPNPSL FGASPLQAST VSSDATSIPS NEADRDIQLR ADIKVMGSLL GRIIQTHEGA EVLEKVETMR GLAKTWRDQG AGRDPSTKQA ADQTFQNLAA YAKSFTDAEL FTVSRAFTHF LAIANAAESH HRGRRLKQSR LLSDESSGAL YPKPDSVGGV LPSLLAQGHD ADAIYDALTS QTTELVLTAH PTEVNRRTIL NKKRRIQRIL TMADQQRQLG ASSVFEQAEL NDALYREISS IWLSDEVSRI KPSPETEAEK GTLVLETVLW EAVPTFLRKL DATTREFLGK PLPLDSSPIR FASWMGGDRD GNPNVKPDTT RQVCLRNRQK AATLFARNLR TLEAELSLTT CSREVREVVG AAREPYRIFL QPMIRKMEAT TDWAAQELAI LQKRRSGDKS ASGIASVAST NVEGIYLDQE EFRAELLTIY RSLQETGNEV AASGILTDII RNLSSFGLTL IPLDVRQESD RHEEALDAIT RYLGLGSYIQ WDEQTRVSWL TTQISSKRPL LRAGVWYEHP DYFSPTAIDT LEISRMIAEQ HEGSLGAYVI SQATSASDVL AVLLLQLDAG VKKPLRVAPL FETLDDLNGA ADTMRQLFSL PAYMGTIGGK QEVMIGYSDS AKDAGRMAAT WAQYETQETL AKLAKEFGVD MTFFHGKGGT VGRGGNPQTF TAIMAHAPKT INGHFRVTEQ GEMISQNFGY ADRAERTMDI YTAAVLAEKL SERPKVKDEW RSMMKILSDI SCEAYRQVVR KDERFVPYFR SATPELELSN LNIGSRPAKR KATGGVESLR AIPWNFAWTQ TRFNLPTWLG VGDAIGQLLK SDRAPLLREL YREARAFQTM VDLVEMVLAK SEPAIAAHYD SVLVKDPKAK ELGKEVRQLH MATEEAILDL TEHKKLGENN AVLQRALVVR NPYVDCLNIL QVETLDRLRQ VEEGKEDKVL KDALLTTITG VANGMGNTG
|
| |