Gene PHATRDRAFT_50510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50510 
Symbol 
ID7199288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp266979 
End bp268922 
Gene Length1944 bp 
Protein Length647 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185459 
Protein GI219130619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCGA CTCAAGAACA AGGGGAGGCC GTGCTAGCTA GAAGTATGCT GGAGAAAGCC 
ATCGTCACGA GCGTATGTCG ATTTTGCTTG GTGGGCGCCG CGCTATTGTC TGCTCAATAC
TGCCAGGCGG CGGCAGTGGA GGCATCGAGT CTCGAGCATC TCTACCGACA AGCTCGGCAC
GTACGAACAA CCGAGGGTCA CGCCGCCGCC GTGCCACTCT ACCAATCTAT AGTGCGAGAA
TACAATCCTT CGGACCGCAC GGCAGCCTCC CGTATAGCTG CCTGTTCCGT TTCCGGAGTG
CACCATGAAA ATCTGTGCCG CGTGCCTGAT GCTGACAAAC TTTTCGCGCT ACGATCGCTA
CTGTACCGTT CGCGATTTAC AGCGCGGGCG ATCGCGTCTC TCTTTGGTAT CACAAGCTCC
GATCATAAAC TCTTTCACGC TTCTTGTCCC ATCTATTTGA CACCGTCATC GGCCGGTGTT
ACAACTGTAC CGTCGTTGGA TTGTGGTACC AACGATGCCA ATGTATTGCC CATCAAGAGT
CTGGCTACTC TCTTTCTCTT GGGATTGGCG GTTCCACGGG AAGCACTCGC ACTTGCTCTG
ACTCCCGAGG GTGTCCAGAT ACTGCAAGAC TTGCATCTGA TTGCTCCTTG TGAAATAGAT
AGCAACTTAC TGGTTCCGTA CGCTCAAATC TTTCCCATCG ACTTGCAGAG CGACCGAACC
TTATATATTG TCACAGATTG GCACCCGCGC GTTCTCTCGA CCACGAAGGT CGGTACCTGC
TCGGACGTCA GCAACGCCGT CATGTATATC GGACCCGATA GTTTGGCACT AGTACAGCAT
TGGCTACAAA GTTCTCGTAT TCCCTCCTGT GGGAGCTTGC TAGACCTCTG CACGGGGTCG
GGAGTGCAAG CCTTGGCTGC CTTGACAATG GAAAAGGCGA ATCAAGCGGT ATGCGTTGAT
CTCAACCCCC GTGCGCTGCA AATGACAAGG CTCAATGCTA TTTTGAACGA CTTGGATACT
AAGGTGCAAT GTGTGTTGGG TGATTTGACC TCGGACGTTG GAAGGATATA CACTAACAGT
GAAGGCAGCC ATGATCTCGC CATTGACGAC AAAGCCCAAC CTTTATTGGA TGTACTCCGT
CGAATTTCCC CTCGATTTGA TCTAATCACG GCGAATCCAC CCTTTTTGCC CGTACCACCA
GAAATCACAC AGGCACGGCA CGGTCTGTTT TCTGCGGGAG GTCCCTCCGG CGAGGCCGTG
CTCGCTAGTA TCGTGCAACT GAGTTCCTCC TTGCTCTCCA ACACCGGCTT TCTCGCCATT
GTGTCCGAGT TTTTCTTGAA AAGTGTTGAG GCTGCTTACG TGGCACCTTG CAGTGTCAGT
AGTAGGGGTA GCGAAGAATC AGCGAACGAG CCGGGAGTAC CGTCCTCTCC CGAGGTCTTG
CTGGATTCCA ATTTTGATTT ACTCCCCCAG CACGATGCAC TGGACCGTCC TGCTGACGAG
CTCCTGTCTC GCATAGAGTC TTGGTGGAAC AACAATCAGC ACGATCCTAA GCATGACCTC
TCCAGCGTAG TCGCGTCATC TACCACGAGT ATCGCTACCA CTTGTAGTAG TACCAGGGCT
CGTGGACTCC TCTTGACGAA CGAATATCCC ATTACTGCCG ACCTGTATGC CGAGCGACGG
GCCGACAATG CGGAAGAATT TGCCATTTGG CAACGCCACT TGCAAAGTTT GGAAATTGGT
GCTTGCTCCC CGGGCTTTTT CATTCTGCAA AAGATGCACC CCGCGGTCGA CGGAGAAGTA
CCGGCATCGT CGTGCTTTGC GCACCAAACC GTGCCGCAAA CGTCCTGGGG GTCCCTGTGG
ACGCCGTCAA ACCCGCAAGC CGTGGCGTAC ACAAACCGAG TTCTGTCTGA TTTTTTTACG
AGGCCAAATT TGGAAGAGGG GTAA
 
Protein sequence
MKSTQEQGEA VLARSMLEKA IVTSVCRFCL VGAALLSAQY CQAAAVEASS LEHLYRQARH 
VRTTEGHAAA VPLYQSIVRE YNPSDRTAAS RIAACSVSGV HHENLCRVPD ADKLFALRSL
LYRSRFTARA IASLFGITSS DHKLFHASCP IYLTPSSAGV TTVPSLDCGT NDANVLPIKS
LATLFLLGLA VPREALALAL TPEGVQILQD LHLIAPCEID SNLLVPYAQI FPIDLQSDRT
LYIVTDWHPR VLSTTKVGTC SDVSNAVMYI GPDSLALVQH WLQSSRIPSC GSLLDLCTGS
GVQALAALTM EKANQAVCVD LNPRALQMTR LNAILNDLDT KVQCVLGDLT SDVGRIYTNS
EGSHDLAIDD KAQPLLDVLR RISPRFDLIT ANPPFLPVPP EITQARHGLF SAGGPSGEAV
LASIVQLSSS LLSNTGFLAI VSEFFLKSVE AAYVAPCSVS SRGSEESANE PGVPSSPEVL
LDSNFDLLPQ HDALDRPADE LLSRIESWWN NNQHDPKHDL SSVVASSTTS IATTCSSTRA
RGLLLTNEYP ITADLYAERR ADNAEEFAIW QRHLQSLEIG ACSPGFFILQ KMHPAVDGEV
PASSCFAHQT VPQTSWGSLW TPSNPQAVAY TNRVLSDFFT RPNLEEG