Gene PHATR_18469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_18469 
Symbol 
ID7203970 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp209917 
End bp211915 
Gene Length1999 bp 
Protein Length547 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186017 
Protein GI219112867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0427076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAATAGACG CGCACTTCAC CAACGTGAGC TTTCATTGGG TACCTGTTCA ATGTTATACC 
GCTGGTGTTG GTGTGGTTGA CAGTTTTTCG GGAAGTGCGT TTTTCCAAGC GGACGAACAC
ATGAGCCAAA ACCAAGGAGT ACGGAGAACG TTTTCGTCTC GATCGAATTC GAACGCTGGC
GGACAAGACA GCAATCTTTT AGCAAAGACG GACGCTGTGG ACGAAGGGCA TCTATCCAAA
GGATTTTCGG CAGCCTCGAC ACTCTCGGCA CGAAACAATA GTGTGAATAT GCCTCTCCCA
ACCTTGCCGG CCTTAGCCGA CAGCTTTGAA CCGGCTCAAT GCGACGGAAA GAACCCTGCC
ACTTCGCAGA TAACAGGACG AGCCGGTCCC TCGTGGTTTG TCATTCAAGT AACTTTATTG
GCCAGTTTGG GTGGTATACT CTTTGGATAC GATTTGGGTG TTATATCTGG AGCCCTTCCA
CAATTGACAT CCTACTTTGA CCTACAAAGC GCGCAACAAG AAATCGTAGT TTCCGTCTTG
TACGTTGGTG GAGGATTCGG CGCCGCCTTG GGAGGAGCTC TCTGCGATAC CTACGGACGC
AAAAGTACTA TTCTTGTCAC GGATGTATTG TTCCTGTTGG GGGCAATAAT ACTGTACGCG
GCAGCCTCGT ACGGGATGAT AATTTGTGGG AGAATTGTTG TGGGATTCGC CATTGCTGTC
TCAGGGATTG CCGATGTTTC GTACCTTCAT GAAATTGCAC CAATACAGTA CCGTGGCTCG
ATTGTTTCCG TTAACGAAGC TTGCATTGCA CTGGGGTTCT TACTTGCATT TGGTGTCGGT
GGATGGATGT CTCGAGAGGA AACCAATAAC GAGGGATGGC GAGGTATGTT TGGGATAAGT
GGTGTGGTTG CCTTTATCCA GCTCATCGGA ATGTGGACCA TGCCGGAATC GCCCACATGG
CTAAAGGATC GGGGATTACA TCGAGAAAGT GAGGCAGTGT TGCGGCGCAT TTATCCGGAA
CCATTTGTTT CCAATTTCGT CCATAGTGAC GCCAATTCTG GTCCCGAAAA TAAGGTTGTT
TCCAGCATAA TGTACGAAAC ACTTTCACCC AAACCAAGGA AACCCTCCGT GCATCCATCA
GTGTCCTCTC TTGAAGCAGG AACTTCGCCT TCACTATCGG TCAATGCCGG CTTCTTGGCC
AAATCGACAT ATGCTTGTCG GTACTCTCAT TACCTATGCA CCCAATTGAA AGCATTCGCC
GTGACTTCCA TGCATACCTA CCGAAGGCAA GTCTACATTG CCCTTTTCTT AGCAGTCTGT
CAACAGCTAT GTGGCCAAAC GAACGTGCTT AGCTATGCAC CCTTGATCTT CGCTGGAGGA
AACGCATCGA AAAGCGGAGA TTTCGTCCGA GGATGGGCGA CCCTTTCGAT AGGCATCGTC
AAATTTGCCG TTTCGTGTGT AGTCATCTGG AAAGTTGATG CGCTGGGACG ACGACATTTA
CTACTAGCTG GATTGGGAGT GGTCGCAGTG GGTTTGCTTT TCTTGAGTAT TGCTTTTCGC
GGAGCCGAAG TCTCTGACAA GCCGGTAAAA GGCGGCGATG AGCCGACCAC TACCTTGATT
GACGAAGGCG ACCGTGCCTT CTCACTCGCC TTACCAGGCG TATTGCTTGT TGTAACAGGA
TATTCCATGT CCTTTGGGCC GCTCACATGG CTGCTAACAT CGGAACTATT TCCGACCGAT
ATTCGAGGAC GAGCTTTAGG AGCAAGTACG ATCATCACTT ACTTTTGTGC ATGGGTCGTG
ACGAGCACTT TTTTATCCGC GCAAGAATGG CTGGGTGCTA GCACCGTCTT CACGATGTAT
TTTCTCGTTA CAGTGGCAGG ATTCCTCTTT GCGATAAAGG CAATTCCGGA TACCGGCGAG
AAAAGCACCA GAGAGATCGA TGACAGTTTG GATCAAATGG CGTGGTGGCG TCCGCGGAGA
AATGACGTCT CGCGCACTC
 
Protein sequence
MSQNQGVRRT FSSRSNSNAG GQDSNLLAKT DAVDEGHLSK GFSAASTLSA RNNSVNMPLP 
TLPALADSFE PAQCDGKNPA TSQITGRAGP SWFVIQVTLL ASLGGILFGY DLGVISGALP
QLTSYFDLQS AQQEIVVSVL YVGGGFGAAL GGALCDTYGR KSTILVTDVL FLLGAIILYA
AASYGMIICG RIVVGFAIAV SGIADVSYLH EIAPIQYRGS IVSVNEACIA LGFLLAFGVG
GWMSREETNN EGWRGMFGIS GVVAFIQLIG MWTMPESPTW LKDRGLHRES EAVLRRIYPE
PFVSNFVHSD ANSAFAVTSM HTYRRQVYIA LFLAVCQQLC GQTNVLSYAP LIFAGGNASK
SGDFVRGWAT LSIGIVKFAV SCVVIWKVDA LGRRHLLLAG LGVVAVGLLF LSGDEPTTTL
IDEGDRAFSL ALPGVLLVVT GYSMSFGPLT WLLTSELFPT DIRGRALGAS TIITYFCAWV
VTSTFLSAQE WLGASTVFTM YFLVTVAGFL FAIKAIPDTG EKSTREIDDS LDQMAWWRPR
RNDVSRT