Gene PHATRDRAFT_48267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48267 
Symbol 
ID7203370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp684636 
End bp686836 
Gene Length2201 bp 
Protein Length570 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182584 
Protein GI219124593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000901628 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTTGCGCAG GGTTGCTTCC TTGCTTGTTT GCACGTCCTA AAATCAATCT CTCTAGAGTA 
GCATTCATTG CTTACAGTTG CTTTGTTAGT AGAGAGTTTC CTTGGTTGCG ATCGTGCGAG
TGCCAATCCT GCCGTACAAT CCCAAACCAA ACTGTACCAA TTCACCTTCG TTCCAACTCC
AACGCTTCGA CAGACCCATC CGTCCCTACC AACATGACGG CGGCGAACAG CGTTCAACCG
CAGAGCGATC ACGCTCCGAG CGAAGAGCGT CGAAAGCTAC TCAACGAGGA ATTAGCGGTA
CCACTGGTGT CGAATTTCTT CCCCATTGAA GGCTACTACA CCGCGGCGCA AAAGGTTTTC
GATAGCTTCC AGGATGCCTT TGAACACCGA CAGATCGACA ACGCCTTTGT CTACGGGAAA
CGATACTGTC TCTTTGTAGT AGACGCTATT CCACAACACA ATTACTTCAA CGCTACAAAA
ATCAAAAAGA TGCAGAACCA GCATCATCGG CAGGTCGACC TCGTCATTGA CCAACTCGAC
GTCGTGGCGA CGTGGATGGA CGAGTCGGAA ATGGAGCGCC AAACGCGAGA GCGAGAAGAA
GCCAAACGAC GACGACAGCT CGCTATTCAA AGAGCCAAAG CGGAAACTGT GCGCTACCAA
CAACAGGAGC AGGAACGCTA TCGCCAATTG CAGCTACGGT TCGATCAACA CAAGACACAC
AAAGAGAGTC TCAATGAGGA TCCGGAGCAC GTACAGGCAT CGGCGATGGA AAAGTTGGAA
AAGCTTCGAG CTCTCCAAAA TGGTGTCGAT GTGGCGGCAC GCATTCCCCA AGATCCTTCC
GGCGAAGAAG CGGGCAAACC TGGATCACGA TACCGCCTTT TATCGGATTC CGAAGAAGAC
CACGCCGAGC AACAGCAAGG CGATCAAAGG AATCCACCCT ACGATACAAT CATTAGTGGG
ACCGTTCTGC CACCTCCTTT ACCCCTTCCA AGTGCACCGC CATCCTACGA TGCCATAGTT
ACATCTCGGT CGTCCCGCAA CTTTTTGGGC CCGGCAGTGC CCTCGGAACC ATTCCCCAAA
TCGACATTTT TGAACGGCAA CAAGTTTGTG GACGAAACGA CGGTAGCATT GCCCGCAACC
CCCGAAACAC CAGCACGCCG CCAGCGCGTT CCTATGCGAG AGCTTCAGCA CCGGTACAAG
CAAACATACG TAAAATACCA ACAGGCGGGG AAAATCAAGG TCTCCGGTAT CAACACTTAT
CAAGGGCGGT TAATCGAATC GACCAACGGA TGTACAGTCA TTTCAGCTTT AGTCGCCGCG
CATCAATTAT CCTCTCGATC GGGGGCAGTC ACGGACGCAA CCGTCATCAA CGTGATCGAT
CGGCAATGTG GACCACTCTT ACGTGAAATA CGGGGCAAAC TTGGCTTGGG TGGTCACGCT
CTGATTATTC CATCTGATGT ACACGATCAT CTTGTCGATC ACAACATATT GTCACAAGAG
ACATTCGTTG GTGCCGCCGG CGGCAACATA TTGGACGAAG GCCACATGAA CGAGTTTCTC
AAACTTTTGC AAGGAGATAG CGCACAGCAT GCCAAAGCTG CCGCCACACT ATTTTTTCGT
GAGCACGTCA TTTCAATTGT TAAATCACAG CATGGCAAAG CCATTAGTGG TAGCCTTAGT
AACCAGGGCT TGTGCTGTTA CGAATTAATC GATTCAATGC CGGGAATGTT TGATGGTGGA
CGAGGCATGG CGACTCGTAC ACGTTGTACG GATATGGATT CTTTGCAAGT TCTATTGCGG
TGGTACGCCT CGCGAAAATT TTCTGACTCC AACTGTTCCT ACATTGACAA AACCATTTGG
GATGACAGCA TGGCTGACTT TGATCCCCGT GTCTTTCAAG GATTTGTTTG GGCGGCAGCC
TCTTGAACGA ATAAGTTTTG CAAAAGATGT CTCGGCAAAA GATATACGTG ACTTTTTTCC
ATTGTTCGGC TATCCGCGAA TGCGCGGCCC TTGCTTGTAC AACAATCCGC ATCAGAATTG
AAATTAGAGC CTACCAGAAA CGAAGCTCCA AGTGGTACAT TGATAATAAA ACCATCTTCA
ACGGAACAAA AAGGAAACTG CTGGAAAACT TTGTTTCGCC GTCATTCAGG AGTCCAGAAT
CTTGGGAAAT TTGCGGAGTT CAGTTTTTCT ATCTTGATCT C
 
Protein sequence
MTAANSVQPQ SDHAPSEERR KLLNEELAVP LVSNFFPIEG YYTAAQKVFD SFQDAFEHRQ 
IDNAFVYGKR YCLFVVDAIP QHNYFNATKI KKMQNQHHRQ VDLVIDQLDV VATWMDESEM
ERQTREREEA KRRRQLAIQR AKAETVRYQQ QEQERYRQLQ LRFDQHKTHK ESLNEDPEHV
QASAMEKLEK LRALQNGVDV AARIPQDPSG EEAGKPGSRY RLLSDSEEDH AEQQQGDQRN
PPYDTIISGT VLPPPLPLPS APPSYDAIVT SRSSRNFLGP AVPSEPFPKS TFLNGNKFVD
ETTVALPATP ETPARRQRVP MRELQHRYKQ TYVKYQQAGK IKVSGINTYQ GRLIESTNGC
TVISALVAAH QLSSRSGAVT DATVINVIDR QCGPLLREIR GKLGLGGHAL IIPSDVHDHL
VDHNILSQET FVGAAGGNIL DEGHMNEFLK LLQGDSAQHA KAAATLFFRE HVISIVKSQH
GKAISGSLSN QGLCCYELID SMPGMFDGGR GMATRTRCTD MDSLQVLLRW YASRKFSDSN
CSYIDKTIWD DSMADFDPRV FQGFVWAAAS