Gene PHATR_46850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46850 
Symbol 
ID7204700 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp620020 
End bp622396 
Gene Length2377 bp 
Protein Length745 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185748 
Protein GI219121033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00101234 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGCGG TCGCATTGAC TGCGCTCGTC GTACTGACAA CGACCGAAGC CTTTCAGACA 
CCTCGCTTTC ACTCGCTCGA CGTTACGAGT AAGCTATTTT CCGCTCGAGT ACAGGATGCG
AGTTCGGTGA TGCAAGATGT ACGGGCAGAG CTGGCCAAGA ACGAAGACGC CAATCTCATG
CTGCAAGCTT TGCGGGGACA AAACTTGAAC GACGACGACT CCGCTGTCGC TGGACTCCAG
ATGCGTCTGG TGGATATTGC ACCCAAAGAA AACGAGGCTC TACCCTTTGA CTACAACCCC
CAGGCACTGA AAGAGTTCTT TTCGAAGCGA CCTCTTGCTG TTGCTACTCG AATTCTGCAG
TTGCTGTCGG TAGGAGGAAT CTTCGCGTTC AACACCATTT TTGATCAGCT TCTGGGCCGT
GTCAAGAACA ATCCGGATTT GGAAGTTCAA CGAGCGGCGG AACTTCGCGA TCTGATTACT
TCTCTGGGTC CCTTCTTTAT TAAGATTGGG CAAGCGTTGA GTATCAGACC TGATGTGTTG
AGCCCACGTT CAATGGTCGA ATTGCAGAAG CTATGCGACA AGGTCCCCTC GTTCGATTCA
ACGATAGCCT TTGCAACAAT AGAAGCAGAG TTGGGACGTC CGGTGGAAGA CATATTTTCT
GAGATCACCC CTGAACCGGT GGCGGCCGCT AGTCTTGGAC AGGTATACAA AGCTGTTTTG
CGGGATACTG GCGAAACAGT GGCCGTGAAA GTCCAAAGGC CTTCGGTTTT GGAAACCGTT
TCTCTGGATT TGTATCTTGC CCGCGAACTC GGTATACTTG CTCGGAACAT TCCCGCACTG
ACAGATAGGT TAGACGCTGT CGGTTTACTG GATGAGTTCG CGTTTCGCTT CTATCAAGAG
CTAGACTACA ATCTAGAATG TGAAAACGGT ATTCGTATTG AGAAAGAAAT GCGTGTTTTG
CCGATGGTAG TGATTCCCAG AAACTACCCG CAGTACACAG CGCGACGAGT TCACGTGGCG
GAGTGGATTG AGGGTGAAAA GCTTTCACAA AGCAAGGCCG ACGATGTCGG AGCTTTGGTC
AACTTGGGTG TAATTACCTA CCTGACACAG CTTCTCGACT CGGGCTTCTT TCATGCTGAC
CCTCATCCTG GTACGTTCTC TGAAAAGTTT CTGTATACAT CTGTTCATCA GTTTTCTCAT
TTCCAAATTC TTGTATAGGG AATATGATGC GCACTACTGA TGGTAAACTA GCGATTCTCG
ATTTTGGGCT AATGACTGAG GTCACAGACG ATCAAAAGTA CGGGATGGTA GAAGCTATCG
CTCATCTTCT CAATCGTGAC TACACAGAAA TTGGACAAGA TTTCATCAAT CTCGACTTTA
TTCCCAAAGG GACCGATACA ACTCCTATTG TCCCTGCATT GACGAAAGTG TTTGATGTTG
CTCTGGCAGG TGGCGGGGCT AAGAGTATCA ACTTTCAAGA ATTGTCGGCC GATTTGGCAC
AGATTACATT CGATTACCCA TTCCGCATTC CTCCATACTT TGCATTGGTC ATCCGGGCCA
TTTCTGTTTT GGAAGGAATT GCTTTAGTCG GAAACCCAAA CTTTGCCATT ATTGATGAGG
CCTACCCATA TATTGCTCGG CGCTTAATGA CTGATCGATC ACCGCGTCTC CGTGCTGCTT
TGCGCTACAT GATATATGGT CGTGAAAATG AGTTTGATGC AGAAAATGTT ATTGATTTAC
TGCAGGCAGT AGAAAAATTC TCTGCTGTAA GGAATCAAGG TGATGGAACA GCTTACAAGG
TGGACGGTGT TCGAGGATCG AAGGCGGTTG GTTCTGCTGG AGATTTTCGT GGATCACAGC
AGGTGGATAC TAGCGACCGA AACACAAATA TCGACGGCGG ACGATTTCGT ATATCGTCGG
AAATGGGTGT GAATGATGTC GGCGAGCTGG CAACGGCTGA CTCCCGAGAT CCACTCCAGG
TGGTGGAAGC CAAAAATGAT GAAAGAACGG TTCGGGAAGG CTTAAGATTC TTCTTTAGTC
CGGAAGGTGA GCCCTTCCGG GAATTCATGC TGGAAGAAGT TGTCACAGTG GTTGACGCAT
CAGGACGTCA GGCCGTACAG GAATTGTTTC GTCGCGTAGG TCTCGGAAAC GTGCCGGTTC
CTGCTTTTTT CCGCAGACTC AGTCCGGAGC TTACCGATGC CGACCGTCGT ATAGTCCAAC
AGATTGGCAA ACTAGTACAG TTTCTTTTGG GGGACTTTGA AGGTACCGTG AACAATTCTG
ACACGAGCGC CCGGGTCCGT AAGCTTATCC CGGTGATACG GGAGTACGCC CCACAGTTAC
GGGATTTTGG GACTTTGTTG GTAGCTCGGT TAACTGA
 
Protein sequence
MYAVALTALV VLTTTEAFQT PRFHSLDVTS KLFSARVQDA SSVMQDVRAE LAKNEDANLM 
LQALRGQNLN DDDSAVAGLQ MRLVDIAPKE NEALPFDYNP QALKEFFSKR PLAVATRILQ
LLSVGGIFAF NTIFDQLLGR VKNNPDLEVQ RAAELRDLIT SLGPFFIKIG QALSIRPDVL
SPRSMVELQK LCDKVPSFDS TIAFATIEAE LGRPVEDIFS EITPEPVAAA SLGQVYKAVL
RDTGETVAVK VQRPSVLETV SLDLYLAREL GILARNIPAL TDRLDAVGLL DEFAFRFYQE
LDYNLECENG IRIEKEMRVL PMVVIPRNYP QYTARRVHVA EWIEGEKLSQ SKADDVGALV
NLGVITYLTQ LLDSGFFHAD PHPAILDFGL MTEVTDDQKY GMVEAIAHLL NRDYTEIGQD
FINLDFIPKG TDTTPIVPAL TKVFDVALAG GGAKSINFQE LSADLAQITF DYPFRIPPYF
ALVIRAISVL EGIALVGNPN FAIIDEAYPY IARRLMTDRS PRLRAALRYM IYGRENEFDA
ENVIDLLQAV EKFSAVRNQG DGTAYKVDGV RGSKAVGSAG DFRGSQQVDT SDRNTNIDGG
RFRISSEMGV NDVGELATAD SRDPLQVVEA KNDERTVREG LRFFFSPEGE PFREFMLEEV
VTVVDASGRQ AVQELFRRVG LGNVPVPAFF RRLSPELTDA DRRIVQQIGK LVQFLLGDFE
GTVNNSDTSA RVLTGFWDFV GSSVN