Gene PHATR_54178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_54178 
Symbol 
ID7204223 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp752089 
End bp755096 
Gene Length3008 bp 
Protein Length863 aa 
Translation table 
GC content52% 
IMG OID 
Productglycoprotein precursor 
Protein accessionXP_002186122 
Protein GI219113077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTCA GTTTTAATCT CGGGGTGACG GCTTTTTTCA CCTACCCTCA CCCAGATGAT 
GTAGAAACCG AATCCCTTGG TTTCTCAGAG CGATCGATGG ATGAGGATAT GTTTCATGAG
GCCGAAAAAG GAGCTGACGA TATTTGGGCA TTATCGAACG CAAATGAGGA CGAAATAAGG
TGGGTTTCTG GCTTACATTG TCTTTGATTT ACAGTTAGTA AACTTCCTTC ATCCTCTCAC
GATAATCCCT TACTTTTAGA AGCGGGGTTG GAAGTTTCGG GAATTTGGCG ATGAAGGGGT
TTGGTGCATT GGGGAATGCT GAACCATTAG GGGAGCATGA GAACTTTTTC GATATGATCG
TAGATAATGA CAACAATGGC GCTTCGGGAG ATCCACCCGA CTTGGCTCCT TCTACTGGGT
CAGATATGGC TGGGCGAGAC GTATCTCAAA ATACAATGTT TGTTGCTAAC GATGGTGTAA
AAAAACTTGC CTCAGCTGTA GATGATGTCG CCAAAAGGCT CCTTGAGACA AACAACGACT
CCGACGATGC TCCAGTAGCG TCAAGGCCTG ATCTATGGAA TGAAATAAGC AACAGTTTTC
GCTTGGAATC TTTGAACACA AGTTTCAACG GAAGTTTTGC AAACATAGCA AGTGAGAGTC
TTCAAGAGTC ACGAAACTTT GTGGGCGCGT ATTGGATGGG AAGTGATCAG TAAGTATGGC
TGTTTGAAAC TGACTGTGAG TCTACGATGA ATTATTCATC TTCGTCCCTA ATGAATTTTT
CACCGTGTTC GTGCAGGCAT AGCGCGATTA TAGCCCATAT GGCTACAGAG GCAGCCAAAG
TCGCGGCAAA GCAAACCGCT TCTGGACTAA CAACTGCGGT GTTGACGACG AATAGCGCCG
GTGTATCTAT TTTTTTGAAG ATAATAGTTG GTGTGAGCTT CTTCTTCTTT TCGGCTGGGA
TCATTTTTAC CGGCGTATCG CTTCGCGACA GAATAGGACC AGAGGTTGTG GCCCAAGGCC
TTAAGATACC TACAAATTTG ACCGAAAAGT TGAACAAGAC ATTGCCCTCA CGGCCAACAA
GATCTCCAAC GATGTACCCT GCTGAGACCT ATTTTCCAAC ATCTTCACTT GGACCTTCCG
TGCAGCAGGT GGATTCACGC TTTCCATCTC CTTCTATCAC CTCCCAACCC TCTATTTCAG
GGGTTCCTAC TCAAAGTCAG TCGACTACAA ACAATTATAC TACATTAATG CCTACTCCAA
GCACCAATCC AGAAAACAAT CAGGTAGTAG GTACAAACTT ACCAACGGTA ACGATGAGTC
CGCTCGGACT TCAGCCAACT CGGCTGCCCA CTGTCCGCCA TTCATCGGCA GTGGAGATGG
GAAGTCTTCC TCCAACAGGA CGACTCGTCT CGAATTCAAA CAGCCCGTCG ACATTAGAGC
CGATAGCGAC CGAATCAACA CCAATGAATC CAAGCAATGC TCCCTCTGAC GAAGCGGCTG
AAGCGATAGT ACCAACGACT TTGCCAGCCA TGAAGCCCTC TTCCTTACCA ATAGGCGAGA
AAACCCTGGT GCCAAATGCA AGACCAAATA CGGTAAAAAC CCACGTTCCA GCAGCATTCA
CAAGCACCGT TCCCTCAACT TCAACGCCCG GGGAAAACCT ACTATCCAAT GACAATCCAA
ACAATACGCC TTCTCCGTCA TTGCAACCCA ACAAGTCATC ACCCGGCGAG CAAAGCAAAT
CTCCATCAAA TCTCATTAAC ACATTGAGTC CTTCCCAAAG AGATCAACCT ATTTCGGTAC
CAAACGCGCA GGACAGCCGG ACACCCTCTA TCGTGACCCC CAGCGGGTCG CCAAACGATT
CCTCGTCAAA CAATACCCGA CTCGGGTCCT AGCTCCAGCT CTGCACCCAA CGCAGCCGCA
AATGAGCCAT CTAGCATGCC GAGCCAGTCG CCATTACAAA GTCCGACTGA AAATCCTTCC
GCCGCCCCAG ACTCGGGGCC TAGCTCCAGC TCTGCACCCA ACGCAGCCGC AAATGAGCCA
TCTAGCACGC CGAGCCAGTC GCCATTACAA AGTCCGACTG AAAGTCCTTC CGCCGCCCCA
GACTCGGGGC CTAGCTCCAG CTCTGCACCC AACGCAGCCG CAAATGAGCC ATCTAGCACG
CCGAGCCAGT CGCCATTACA AAGTCCGACT GAAAGTCCTT CCGCCGCCCC AGACTCAGGG
CCTAGCTCCA GCTCTGCACC CAACGCAGCC GCAAATGAGC CATCTAGCAC GCCGAGCCAG
TCGCCATTAC AAAGTCCGAC TGAAAGTCCT TCCGCCGCCC CAGACTCAGG GCCTAGCTCC
AGCTCTGCAC CCAACGCAGC CGCAAATGAG CCATCTAGCA CGCCGAGCCA GTCGCCATTA
CAAAGTCCGA CTGAAAGTCC TTCCGCCGCC CCAGACTCAG GGCCTAGCTC CAGCTCTGCA
CCCAACGCAG CCGCAAATGA GCCATCTAGC ACGCCGAGCC AGTCGCCATT ACAAAGTCCG
ACTGAAAGTC CTTCCGCCGC CCCAGACTCA GGGCCTAGCT CCAGCTCTGC ACCCAACGCA
GCCGCAAATG AGCCATCTAG CATGCCGAGC CAGTCGCCAT TTCAAAGTCC GAGTGAAAGT
CCTTCCGCCG CCCCAGACTC AGGGCCTAGC TCCAGCTCTG CACCCAACGC AGCCGCAAAT
GAGCCATCTA GCATGCCGAG CCAGTCGCCA TTACAAAGTC CGAGTGAAAG TCCTTCCGCC
GCCCCAGACT TAGGTCCTAG CTCCAGCTCT GCGCCCAATG CAGCCGCAAA TGAGCCATCT
AGCACGCCGA GCCAGTCGCC ATTACAAAGT CCGAGTGAAA GTCCTTCCGC CGCCCCAGAC
TTAGGGCCTA GCTCCAGCTC TGCGCCCAAT GCAGCCGCAA ATGAGCCATC TAGCACGCCG
AGCCAGTCGC CATTACAAAG TCCGAGTGAA AGTCCTTCCG CCGCCCCAGA CTCAGGGCCT
AGCTCCAG
 
Protein sequence
MTVSFNLGVT AFFTYPHPDD VETESLGFSE RSMDEDMFHE AEKGADDIWA LSNANEDEIR 
SGVGSFGNLA MKGFGALGNA EPLGEHENFF DMIVDNDNNG ASGDPPDLAP STGSDMAGRD
VSQNTMFVAN DGVKKLASAV DDVAKRLLET NNDSDDAPVA SRPDLWNEIS NSFRLESLNT
SFNGSFANIA SESLQESRNF VGAYWMGSDQ HSAIIAHMAT EAAKVAAKQT ASGLTTAVLT
TNSAGVSIFL KIIVGVSFFF FSAGIIFTGV SLRDRIGPEV VAQGLKIPTN LTEKLNKTLP
SRPTRSPTMY PAETYFPTSS LGPSVQQVDS RFPSPSITSQ PSISGVPTQS QSTTNNYTTL
MPTPSTNPEN NQVVGTNLPT VTMSPLGLQP TRLPTVRHSS AVEMGSLPPT GRLVSNSNSP
STLEPIATES TPMNPSNAPS DEAAEAIVPT TLPAMKPSSL PIGEKTLVPN ARPNTVKTHV
PAAFTSTVPS TSTPGENLLS NDNPNNTPSP SLQPNKSSPG EQSKSPSNLI NTLSPSQRDQ
PISVPNAQDS RTPSIVTPSG SPNDSSSSSA PNAAANEPSS MPSQSPLQSP TENPSAAPDS
GPSSSSAPNA AANEPSSTPS QSPLQSPTES PSAAPDSGPS SSSAPNAAAN EPSSTPSQSP
LQSPTESPSA APDSGPSSSS APNAAANEPS STPSQSPLQS PTESPSAAPD SGPSSSSAPN
AAANEPSSTP SQSPLQSPTE SPSAAPDSGP SSSSAPNAAA NEPSSTPSQS PLQSPTESPS
AAPDSGPSSS SAPNAAANEP SSMPSQSPFQ SPSESPSAAP DLGPSSSSAP NAAANEPSST
PSQSPLQSPS ESPSAAPDSG PSS