Gene PHATRDRAFT_27976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_27976 
SymbolPEPCase_1 
ID7201832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp553748 
End bp556971 
Gene Length3224 bp 
Protein Length1009 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181027 
Protein GI219120583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.205513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTGCGTTCT GTTCCTTTCT GCAAAGAATC CTTGCGTACG TGAGTTGCGA CTCACATCAA 
GCATCAGTGA AACGAAGTGA CACAAGTCGA TTCATCGTAA GAATGTTGTC GTCTTCCTGC
CGTCGAAGTT TCCTTGCGGC GAAGACTCGG TTGCGCTCAT GCGTGACCAC GTCGTTGTCG
ACGGGTTGTC CGTGGAGTGC CATTTCCAGC GGATCCACAA GTCGCCATAT CGATCGGTTT
TTTTCGACCC ACAGTTCCTT CGATGAACCC AACCCGTCCT TGTTTGGTGC TTCTCCATTG
CAAGCGTCGA CGGTATCCAG CGATGCTACT TCGATCCCTT CCAACGAAGC CGATCGCGAT
ATTCAATTGC GAGCAGACAT TAAAGTCATG GGTAGTTTAC TGGGACGAAT CATTCAAACG
CACGAAGGCG CGGAGGTACT GGAAAAGGTC GAAACCATGC GCGGCTTGGC CAAGACCTGG
CGCGATCAAG GGGCAGGCCG CGATCCCAGT ACGAAGCAAG CCGCTGACCA AACCTTTCAA
AACCTCGCCG CGTACGCCAA GAGCTTCACC GATGCGGAAC TCTTTACCGT TAGTCGGGCC
TTTACGCACT TTTTGGCCAT TGCGAATGCG GCCGAATCGC ATCATCGTGG ACGGCGTCTG
AAGCAATCAC GCCTTCTTTC GGACGAGTCG TCGGGAGCGC TCTATCCCAA GCCGGACAGT
GTTGGAGGGG TTCTTCCTTC TCTGCTCGCT CAGGGACACG ATGCGGACGC GATCTACGAC
GCCCTCACGT CGCAAACCAC CGAGCTTGTT TTGACAGCCC ATCCGACTGA AGTTAATCGG
CGCACTATTC TCAACAAGAA GCGCAGGATC CAGCGCATTC TCACCATGGC CGATCAACAG
CGTCAGCTTG GTGCCTCTAG CGTCTTTGAG CAAGCCGAAC TCAATGACGC CTTGTATCGG
GAGATCTCCA GTATTTGGCT ATCTGATGAA GTCTCTCGTA TCAAACCATC TCCAGAAACG
GAAGCTGAAA AAGGAACGCT CGTGTTGGAA ACGGTGTTGT GGGAGGCCGT ACCGACCTTT
TTGCGTAAAT TGGACGCCAC CACGCGCGAG TTCCTCGGTA AGCCTTTGCC TCTCGATTCG
TCCCCGATTC GGTTTGCATC CTGGATGGGC GGAGATCGGG ACGGGAATCC TAACGTGAAA
CCGGATACGA CCCGGCAAGT GTGCTTGCGT AATCGTCAAA AGGCAGCCAC CCTTTTTGCC
CGTAATTTGC GAACGCTCGA GGCGGAGTTA TCCTTGACCA CGTGCAGCCG CGAAGTCCGG
GAAGTGGTAG GCGCTGCCCG AGAACCTTAC CGTATATTCT TACAGCCCAT GATTCGGAAA
ATGGAAGCGA CCACTGATTG GGCCGCCCAA GAGTTGGCGA TTTTGCAAAA ACGTCGTAGC
GGTGACAAGA GTGCCTCGGG TATTGCATCT GTCGCTAGCA CCAACGTGGA AGGCATCTAC
CTTGATCAGG AAGAATTCAG GGCGGAACTG CTCACAATCT ACCGCTCTCT ACAAGAAACA
GGAAACGAAG TGGCTGCCAG CGGCATTTTG ACAGATATTA TTCGGAATCT TTCCTCCTTT
GGGTTGACGC TCATTCCTTT GGACGTCCGC CAGGAAAGCG ACCGCCACGA AGAAGCCCTA
GACGCTATTA CCCGGTACCT CGGATTAGGT AGTTACATAC AGTGGGATGA ACAGACGCGC
GTTAGCTGGT TGACAACTCA AATTTCGTCC AAACGCCCAT TGCTTCGAGC AGGAGTCTGG
TACGAACATC CGGACTACTT CTCGCCAACC GCAATTGATA CACTAGAAAT CTCTCGAATG
ATTGCCGAAC AGCACGAAGG GAGTTTGGGG GCCTACGTCA TTAGTCAAGC GACCAGTGCA
AGCGATGTCC TTGCCGTGCT CTTGCTGCAA TTGGATGCTG GTGTCAAAAA GCCTCTTCGT
GTCGCGCCTC TTTTTGAAAC TTTGGACGAT CTGAACGGCG CCGCTGATAC AATGCGACAG
CTGTTCAGTC TTCCTGCGTA CATGGGTACC ATAGGTGGGA AGCAAGAGGT CATGATAGGT
TACTCCGATT CCGCGAAAGA TGCCGGTCGA ATGGCGGCAA CGTGGGCTCA ATATGAGACA
CAAGAGACGT TGGCCAAGCT CGCCAAAGAA TTTGGAGTCG ACATGACGTT TTTCCACGGG
AAGGGTGGTA CCGTTGGCCG TGGTGGTAAT CCGCAAACCT TCACAGCCAT TATGGCACAT
GCGCCGAAAA CGATCAACGG GCACTTCCGC GTAACCGAAC AAGGTGAAAT GATCAGCCAG
AACTTTGGAT ACGCAGATCG CGCCGAACGT ACAATGGATA TTTACACTGC TGCGGTCCTG
GCCGAGAAGC TGAGTGAACG ACCGAAGGTC AAAGACGAAT GGAGAAGTAT GATGAAGATC
TTGTCGGATA TTAGCTGCGA AGCCTACCGC CAGGTCGTAC GCAAAGATGA GCGCTTCGTA
CCCTACTTTC GCTCCGCTAC CCCCGAACTA GAACTCTCGA ACCTCAACAT TGGATCGCGG
CCCGCCAAAC GAAAGGCGAC GGGAGGTGTC GAAAGTCTTC GCGCTATTCC TTGGAACTTT
GCTTGGACCC AGACTCGATT CAATCTACCC ACGTGGTTGG GCGTTGGCGA TGCCATCGGA
CAACTGCTAA AGAGCGATAG AGCTCCTTTA CTCCGGGAAC TCTATCGTGA AGCGCGCGCC
TTTCAAACCA TGGTGGATTT GGTCGAAATG GTCCTGGCAA AATCCGAGCC CGCGATTGCC
GCTCACTACG ACAGTGTTCT CGTCAAGGAC CCCAAGGCGA AAGAACTAGG TAAGGAGGTT
CGTCAACTTC ACATGGCGAC GGAAGAGGCA ATTCTAGATT TGACGGAACA CAAAAAGTTG
GGCGAAAACA ACGCGGTGCT TCAGCGTGCT CTGGTTGTGC GCAATCCCTA CGTAGATTGC
CTGAATATTT TGCAAGTCGA GACCTTGGAT AGGCTCCGGC AAGTGGAAGA AGGGAAGGAA
GATAAGGTCT TGAAGGACGC GCTCCTCACG ACCATTACAG GGGTTGCCAA TGGAATGGGC
AACACTGGTT AAAATACTTG CATGTCTGAT CTATGGAATG ATTGTAAAGG ATTACATTTA
CTGTTAGCGG CCAAAAAAAG GTGGACCTCC TACGGAGCTA GTAT
 
Protein sequence
MLSSSCRRSF LAAKTRLRSC VTTSLSTGCP WSAISSGSTS RHIDRFFSTH SSFDEPNPSL 
FGASPLQAST VSSDATSIPS NEADRDIQLR ADIKVMGSLL GRIIQTHEGA EVLEKVETMR
GLAKTWRDQG AGRDPSTKQA ADQTFQNLAA YAKSFTDAEL FTVSRAFTHF LAIANAAESH
HRGRRLKQSR LLSDESSGAL YPKPDSVGGV LPSLLAQGHD ADAIYDALTS QTTELVLTAH
PTEVNRRTIL NKKRRIQRIL TMADQQRQLG ASSVFEQAEL NDALYREISS IWLSDEVSRI
KPSPETEAEK GTLVLETVLW EAVPTFLRKL DATTREFLGK PLPLDSSPIR FASWMGGDRD
GNPNVKPDTT RQVCLRNRQK AATLFARNLR TLEAELSLTT CSREVREVVG AAREPYRIFL
QPMIRKMEAT TDWAAQELAI LQKRRSGDKS ASGIASVAST NVEGIYLDQE EFRAELLTIY
RSLQETGNEV AASGILTDII RNLSSFGLTL IPLDVRQESD RHEEALDAIT RYLGLGSYIQ
WDEQTRVSWL TTQISSKRPL LRAGVWYEHP DYFSPTAIDT LEISRMIAEQ HEGSLGAYVI
SQATSASDVL AVLLLQLDAG VKKPLRVAPL FETLDDLNGA ADTMRQLFSL PAYMGTIGGK
QEVMIGYSDS AKDAGRMAAT WAQYETQETL AKLAKEFGVD MTFFHGKGGT VGRGGNPQTF
TAIMAHAPKT INGHFRVTEQ GEMISQNFGY ADRAERTMDI YTAAVLAEKL SERPKVKDEW
RSMMKILSDI SCEAYRQVVR KDERFVPYFR SATPELELSN LNIGSRPAKR KATGGVESLR
AIPWNFAWTQ TRFNLPTWLG VGDAIGQLLK SDRAPLLREL YREARAFQTM VDLVEMVLAK
SEPAIAAHYD SVLVKDPKAK ELGKEVRQLH MATEEAILDL TEHKKLGENN AVLQRALVVR
NPYVDCLNIL QVETLDRLRQ VEEGKEDKVL KDALLTTITG VANGMGNTG