Gene PHATRDRAFT_48886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48886 
Symbol 
ID7194962 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp581463 
End bp584387 
Gene Length2925 bp 
Protein Length813 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183513 
Protein GI219126540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.515683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCCTCCATCA GTTTCGGATT CCCGCTGACA ATGTCTTGAC TGTGAAACGA TCTCTTTTCA 
TTTCCATAAT CACATATCAT CCGAACAAAG TGTAAGCAAA GCGATGCCGG ATACAATTAG
ACCGTCTCTC AATGGGAGAA TTTCATCATT CAAACAGTTG CTCGTGATGA TTTTACTAAC
AGTGAATGTG GGCCAAGCCG AAGAAGTCTC CTTCGCCAAC ACCACGGAAG TCGAAATGGA
GCTCGAACCG GCTGAAGCCG TCCTCTTTCC ATGGTTTGTC CAGGCATTAG GAGTTCTCAC
ATTCTTCCTT CTCAGTCGAT ACGTCAAGTG GCTTCCCTAC ACTGCCGTGT TGTTTCTTCT
TGGAACATTC ATGGGGTTGG CGACGGCCAA ATTTCAAAAT GACAATCGAC TGTCCCAGTC
CATTCTAGAG TTTTGGATAC CCATTGACAG CGAACTGCTT TTACTGGTTT TCTTGCCGGG
TTTGATTTTC AAGGACGCGT CGTCCTTAAA TGTGCACTTG TTTCAAGTTT CGATCGTACA
GTGTTTTGTG TTTGCCTTCC CAATGGTACT GGGAGGAGCC GTCTTGACTG CTTTGGTGGC
GTACTATATA TTTCCCTATG GCTGGTCTTT TGCCTTGGCC ATGACTTTTG GAAGCATTCT
GAGTGCAACG GTACGTATCG AGAATCGCCG ACCTGCTTGG CACAGAATAA AATCGCCATT
CAATCCGTGG TGAAGAACAC GTCCAAAAAG GAAGCTTATT TTTTCTGTAT TTGTCGCAGG
ATCCAGTCGC AGTGGCTGCT TTACTTGACT CCGTAGGCGC TCCACCACGC CTCAAACAGC
ATATTAGCGG CGAATCGCTG CTAAACGACG GAAGTGCTCT CGTATTTTTT GCACTTTTTG
CTGAAGTTTT CTATACCGAA CTCGGCGTTG AAGGTCTGGG AACCGACTAC AACTGGGGTT
CCGGAACGGC TAAGTTTCTG CGTATGAGTG GCGGTGCGTG CGCAGCTGGA CTATTTTTTG
GATTCGGCTT GATTTTGCTC CTATCAATTT TGGATAGACG ACTCAATCGA GAAGAAAATA
TTGTACAAAC TGCCGCTACC ATTACCGTGG CATACTTGTG CTACTATACA GCCGATGTGG
TGTGGAGTAC CAGCGGTGTC TTGGCCACAG TCGTGTGCGG TATTACGTAT CGGGCTTTTG
GAGATGCCTT GATCAATGAC AATCAGCTAA TTTGTGATTT CTGGGGCTTG GTCGAGCACT
TGTTGAATAC TGTCTTATTC GCGCTAGGTG GATTAGTATG GGGTAGTGTG ATCGCTAACG
CAGAAGAACG TGAAGGAGAA TTTACTGGAA GAGATTGGGG TGAGTAGTAC TGATCTGCCA
TTGATGCCTG ACTTGTTGAT ACAACAGGCT AACAGCAGAC TTTGTTTTCC AAGGCTATTT
GATTATATTG TACATTTTGT TGATTATAAT TCGATTCGCT CTCTTTATCG GCGCGTATCC
GCTCATTTCC AGGATTGGGC TCAAATCAAG CAAGCCTGAA ATGATCTTTC AGGCCTTTGG
AGGCCTTCGC GGTGCTGTGG GAATCAGTTT GGCAATCGTT TTAGACAATA CAGTGCGCGA
GGCGGCTGAA GAGGGAGACT TCAAGTATGT TGGTCAGACC AACAAAGTGT TTGGATTTGT
TGGTGGAATT GCTTTTATGA CGCTCTGTAT TAATGCCACT GTAGCTGGTC CTCTGTTACG
GCGATTAGGC CTGGCTGACA CAACAGCCAT TCGAAAAAAG ATAATTGAAA GCTACAAGCT
CCACTTGCGC TACGAAACAA TTGAAGAGCT AATCCGTTTG CTAGCTCAGC CTCGCTTTGC
CAAGATCAAT TTTGCCCTGA TTCGGGACCA TGTTGATTGG TTGAAAGATC TTAGACAGGA
CGAAGTGTTA AAGGCCTACA AGGATTACCG GAATACTCAT AATCACGAGA AAAACTATCG
TGATCCAAAC TTGTCTAAGG TTTCACCGTA TTTGGAAGAC AACGAAAAAG ATCTGGAAAA
ACAAATGGCA GAACATCAGA AAGAAAACAT CGGCATATCA AGTGACAAAA ACTCTACCAA
AGTGAGGATA GCCAAGGTCA CATCCAGTAT GTCGTTGATA GAACTTCGCA CGGTATTTCT
AGAAATTTTA CGTAGCGCAT ACGCTAGACA AGTGGAACTC GGTGAGCTCT ACAACCGTCA
GTTTCTCGCC TTCTCTTTGG AACAATCAAT TGATTTTGCC CTCGACTCTG TCTCAAATGG
ATCCGAGTTG AATGATTGGG AGTATGTGAG CGTCGTAAAA GCACCTTGGT CAACATCGGT
TTACACTTCG AAAGGAATGA AGTATTTTCG AAAATGTTTC GGAGCTTTTG TCCTGCAAGA
TGTGAAGTAC GAAATGATGC GACTGAACGT GGAGCGATGT CTGGCATTTC TACATGCACA
CGATACTGCG CAGAGGCTAT TGAGTCAGCA GTTCTTGGAT GAGCAATTCT CGGAAGAAGA
GTCAAAGGTT ATTGCCGAGT CAAGACGTCA GTGTGTAGAG GCTGTCAAGC TATTAAAGTC
GTACCACATG AGAGATGTTG AGATGATTGT GTCGCACAAC TTGTGCACGG TTCTCTTGTA
CAATTCATCT CGTTGCGTGG AAAAGCTGCA TAGAAAAGGT CTTCTCAAGG GCACAGAAGC
CGAGACTATA CTGGAGAAGA TTCAAGAATC GCTTCAGCGT GTTTACGCTT GCAGAGAGAG
GGACCATCCA GGGGAGCTCC CTGTAGACAG CGATCTAATG TCGGAGAAAG ATGTTGACGA
AATGGCACCG GCTTCTGGTT AAGGATCGCT GGTCCAGATT TGCTGCTGCT AGTGTGGTCA
TATAATCTTG TTACTCGTGG ATCATCAAGC GTACCGCCTC CAATG
 
Protein sequence
MELEPAEAVL FPWFVQALGV LTFFLLSRYV KWLPYTAVLF LLGTFMGLAT AKFQNDNRLS 
QSILEFWIPI DSELLLLVFL PGLIFKDASS LNVHLFQVSI VQCFVFAFPM VLGGAVLTAL
VAYYIFPYGW SFALAMTFGS ILSATDPVAV AALLDSVGAP PRLKQHISGE SLLNDGSALV
FFALFAEVFY TELGVEGLGT DYNWGSGTAK FLRMSGGACA AGLFFGFGLI LLLSILDRRL
NREENIVQTA ATITVAYLCY YTADVVWSTS GVLATVVCGI TYRAFGDALI NDNQLICDFW
GLVEHLLNTV LFALGGLVWG SVIANAEERE GEFTGRDWAD FVFQGYLIIL YILLIIIRFA
LFIGAYPLIS RIGLKSSKPE MIFQAFGGLR GAVGISLAIV LDNTVREAAE EGDFKYVGQT
NKVFGFVGGI AFMTLCINAT VAGPLLRRLG LADTTAIRKK IIESYKLHLR YETIEELIRL
LAQPRFAKIN FALIRDHVDW LKDLRQDEVL KAYKDYRNTH NHEKNYRDPN LSKVSPYLED
NEKDLEKQMA EHQKENIGIS SDKNSTKVRI AKVTSSMSLI ELRTVFLEIL RSAYARQVEL
GELYNRQFLA FSLEQSIDFA LDSVSNGSEL NDWEYVSVVK APWSTSVYTS KGMKYFRKCF
GAFVLQDVKY EMMRLNVERC LAFLHAHDTA QRLLSQQFLD EQFSEEESKV IAESRRQCVE
AVKLLKSYHM RDVEMIVSHN LCTVLLYNSS RCVEKLHRKG LLKGTEAETI LEKIQESLQR
VYACRERDHP GELPVDSDLM SEKDVDEMAP ASG