Gene PHATRDRAFT_12095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_12095 
SymbolFru5_1 
ID7200689 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp194726 
End bp196806 
Gene Length2081 bp 
Protein Length521 aa 
Translation table 
GC content50% 
IMG OID 
Productfrustulin 5 
Protein accessionXP_002179600 
Protein GI219117616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCTA GCGCTGGCGG TATCTCGACG GATGATGACG GCATCGATCA CGTACAGGCG 
GACGCTCTCG CCTCCGCCGG TACCATTTGT ACCTTTTCAG CCTCGCGCAA AGGCGTCGAC
GCTCGGTCCC GAGATATTAA TGTTCAAAAC TTCACCCTTC AACACATGGG AGCCGTGCTG
CTGGATGAAA CCGAGATTGT TCTTAATCAC GGCAATCGCT ACGGTTTGGT AGGACGCAAC
GGATGTGGCA AGTCAACGCT TTTGCGAGCG CTGGGAGCCC GAGCCATTCC GATTCCGCGC
GGTATTGATA TCTTCTTTTT GTCTGAAGAA GTGGAGCCAT CTGATACTAT GACGGCACTC
GACGCGGTCA TGGCAGTTGA CGAAGAACGA TTGCGGCTCG AACAACAGGC CGACGAACTC
AATCACTTGC TCGCGGCACT GGCTGACGCC AGTGTGAACG ACAGCGGTAA CAACGGTGTG
AGTAGAGAGG ATAGCGAGGA CGATAAGACT CCGGAAGAGC AACAAGAAGA TGTCATGGAA
GTGCTGAATG CGGTATATGA GCGTTTGGAC GCGCTGGACG CGAGTACTGC GGAGGTCCGT
GCGCGTTCTA TTCTAAAGGG CTTGGGCTTT ACACACGAAA TGCAGTCCAA GTTGACCAAG
GACTTTTCCG GAGGATGGCG TATGCGTGTT TCCTTAGCGC GGGCCTTGTT CATTCAACCG
GTATGCCTGC TACTCGACGA GCCCACGAAT CACTTGGATA TGGAGGCTGT TATTTGGTTG
GAGGATTATC TTTCGAAATG GAACCGTATT CTGTTGCTGG TTTCACATTC GCAAGATTTT
CTCAACAACG TTTGTTCGCA CATGATTCAC TTTACGAATA GAAAGCGATT GCAATACTAC
GATGGCAACT ACGACCAGTT CATCAAAACG AAGTCGGAAA AGGAAGAGAA TCAGCTCAAA
CAGTTCAAAT GGGAACAGGA ACAGATGAAG TCCATGAAAG AGTACATTGC GCGATTTGGC
CACGGAACGT CCAAAAACGC TAAACAGGCG CAGTCGAAAG AAAAAGTTTT GCAAAAGATG
ATCCGTGGGG GTTTGACCGA CAAACCAGAA GAAGAGAAAC CGCTTAATTT CAAATTCACT
GATCCGGGAC ATTTACCACC ACCTGTTTTG GCCTTTCACA ACGTTTCTTT CGGTTATCCA
AATTGTGAAA AGCTCTACAC CAACGTTAAT TTTGGTGTGG ATTTGGATTC TCGAGTGGCC
TTAGTCGGTC CGAACGGAGC TGGAAAAGTG CGTCTGGGTG GAAGCTGGTC CTGAGCATCG
CTCGCTGATA TATTACGCCT ACTAATACGA TTGTGCTTCT TCCTTTCACA GACTACGCTA
ATGAAACTCA TGTCCAGCGA ATTGCAGCCA TCCATGGGTG ACATTCGACC CCACGGACAT
TTGAAGCTCG GTCGATTTAC GCAGCATTTT GTGGACGTTT TGGATTTAGA CATGACGCCG
CTCGAGTTTT TCGAGAGCAA GTATCCCAAC GACCCGCGGG AAGAACAGCG CAAATATTTG
GGTCGTTTTG GAGTGTCGGG GCCGATGCAG GTGCAGAAAA TGAGGGAACT GTCCGACGGA
CAAAAGTCTC GCGTCGTCTT TGCCAAGGTA GGATCGGATA CGGCATGAGC GGGACGCCGA
GCAGTTCGAC GAATTCGGGA TGACGCATAT CTCTCACCTT TACTTATTTA TTTGGCTCAG
CTTGGTCGCG ACGTTCCGCA CATTTTGCTG CTCGATGAGC CGACGAACCA CTTGGATATG
GTACGCTGTG GGAGCTTGGA ATTGGGCAAA GGCGATTCCA AACGCATTCC AGCAACACTC
ACGGTCGGTC TATCGTTCTC GACATTTTAC AGGAAAGCAT TGACGCGTTG GCTAAGGCAG
TGAATGAATT CCAGGGTGGT TTGGTATTGG TGTCGCACGA TATGCGTTTG ATTGGTCAGG
TGGCCAAGGA GATTTGGATA TGCGACAACA AAACGATAGC GATTCACCGG GGAGACATTC
AGTCGTTCAA AATGGACATG CGCGCTGCCA TGGGCATTGA T
 
Protein sequence
SRDINVQNFT LQHMGAVLLD ETEIVLNHGN RYGLVGRNGC GKSTLLRALG ARAIPIPRGI 
DIFFLSEEVE PSDTMTALDA VMADSEDDKT PEEQQEDVME VLNAVYERLD ALDASTAEVR
ARSILKGLGF THEMQSKLTK DFSGGWRMRV SLARALFIQP VCLLLDEPTN HLDMEAVIWL
EDYLSKWNRI LLLVSHSQDF LNNVCSHMIH FTNRKRLQYY DGNYDQFIKT KSEKEENQLK
QFKWEQEQMK SMKEYIARFG HGTSKNAKQA QSKEKVLQKM IRGGLTDKPE EEKPLNFKFT
DPGHLPPPVL AFHNVSFGYP NCEKLYTNVN FGVDLDSRVA LVGPNGAGKT TLMKLMSSEL
QPSMGDIRPH GHLKLGRFTQ HFVDVLDLDM TPLEFFESKY PNDPREEQRK YLGRFGVSGP
MQVQKMRELS DGQKSRVVFA KLGRDVPHIL LLDEPTNHLD MESIDALAKA VNEFQGGLVL
VSHDMRLIGQ VAKEIWICDN KTIAIHRGDI QSFKMDMRAA M