Gene PHATRDRAFT_32848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32848 
Symbol 
ID7197476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp930098 
End bp932155 
Gene Length2058 bp 
Protein Length685 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178036 
Protein GI219112569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.270203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCATC TGCTAGCATT GATGGAGGAC GAGCCTCTGG AAGACAACAG CAAAAGTTCC 
CACTCAAAGA TACCAGAACA TATCGTCCGA CAAGCCCGCA ATGTTTCCCT AGAACAAAAG
ACCACACCCG CGCTAGCAAC TGACATTCAG GATGCCGTGC ATCTTGAACC GCAGCACCAA
TTGTCTCTGA GTGTGGACGG TCGAGTTGGT ATTCGTATGA TCAAGCGAAA GATCTCGGGG
TCGCAGTTGT TGGACATTCT AACAGCACAT CCTTACCAAT CACCGGCAAG CTTGGCGGCC
ATGAGTCTCG AGTATTTGAA TCGTTTGCTG CTAGAGCCGG CCTCAATAAT AGACCAAGCA
ACTGTTTGCG GGCGTACCAA CGTGATGACG GTCGGCATCG TGTTTTCCAA CTCGGGTACA
CGAGTTGCCT CTTCGGGAAA TGCTTATTGT GTTGTTACTC TCGGATCACT AAATACGGGT
CCAGCTGTTT CCGTATTGTT GTTTGGTTCC GCTTACGGTA AGCACTGCCG TAGCTGTATC
CCTGGTAAAG TGGTGGCCTT GGTCAATCCT CGCTTAATTC CTGCTAAAGG TGCAGCCCAG
GGAGACACCT CCATTTCGTT TTCTGTCAAC GAGGAACGTC AGCTTTTGGA CGTCGCTGAT
GCCCGTGACT ACGGAACTTG CAAGGCTGCG GTCCGGGGGA AAAACGATAA CGGTCATTGG
GTTGCTGGTG GTAAATTTTG CGGTCACTTC GTGGATAAGC GCATCAGTGA GTATTGCAAT
CCACACCGGA AACAGGCCAA CGTCAAGACG GGCACAGCCA ACCACACCTC CACATCCGCC
CTCCAAAACC TCCGAAATCA AGCCGTCGCT TTCCCTAGAA TTCAGACTAG AGTGATGATA
CCACCTGGTG GTGTTAAACC CTTTCAGACA CCAAAGCAAC AAACGAAATT GCGATCCGCC
CAAATGATGT CTGACTTTCT AGCTCAGTCT ACCGCCCCTG GGGCGGGATG CTTACTTCCT
TTCCAGCAGC CAACACTATC ACGCAGCCAA CCTGCTATGA ACAAAAATAC TATTTTGAAT
CCGAAACAGT CAGCTACTAA GTCAGTCGAA ATTCCAGGGA GAGGATTGCT CAATCCGTAC
GCCAAAGGAG CATCTTCCAC CGCATCGTCG CACGCGAGGG GTGGGTTTTC TCCACCCAAT
TCCGTACGTA TCCAGGACAC CACCATACGA AAACCGTTTA CAGTCAACCG GGTTAGCCCG
CCGGCTTCAA CCTCGCACTC GGTCACAGAA GATTGGTTAC AAAAAGCTTC CAAGAAGCGC
ACTCCACTTG GCAACGCACA TAACCAAAAA CGCCAGCGCA GATTTGTCAA TACCGACACA
AGAAACTTCA ACGGGTCCGT ACCAGTTCCT AAGCCCTCAC AGATGTTTCA GACTGCACGC
ATAACCAAGC GAGTACCTAT GATACAGACG AGCGATGTCA AGGAAAAGGC AAGGGCGGCC
CAGGTTCTGT CGCAACAACA GATCTTGGCG TGTCGACTGC GGGAACAGGA TGGCGGAGGT
AGCCAGAGTT CCGTTTTTAA AGCGTCTCGC CCATTATCAG GGGCCGATTG CTCAGCTGTA
AGACAGCAAG ACCGAAGGGA GGAATTTTAT GCGTTGCTTG ATGACATCGA CATTCAAAAT
GCTTCCGCCG CTACAAGCCA ATTCGCGGAC GAGGTCAGTG CAGAAGAATA TGCCAGAAGC
CGGCGCGTCG TTACTGAACT CGAGGAACAA GAAGGAAAGA AACAGAGCAA GGTGGCCAAG
TCGAAGACCC TCGGTGACAA GGATAAGACA GCTATTCGAA AAGAATGGTA CTGTCAGCAA
TGCCGAAAAT CGTCCCCGTT CAAACCAGCT GGATGTGTAC GCCGAGGACA CAGCGTTGCC
ACGAAGCGAG AGATTGTTCA AGCCAAATCT ACATCTGAAC GACGCCTGGA TTTGGCTAGC
AAGGATGCCA ATGATGGCGG CTTAACTCTA GGCAGTGGTA TTGAATGGTC CGCAAATAGA
TGGAGTCGCT TCAACTAA
 
Protein sequence
MDHLLALMED EPLEDNSKSS HSKIPEHIVR QARNVSLEQK TTPALATDIQ DAVHLEPQHQ 
LSLSVDGRVG IRMIKRKISG SQLLDILTAH PYQSPASLAA MSLEYLNRLL LEPASIIDQA
TVCGRTNVMT VGIVFSNSGT RVASSGNAYC VVTLGSLNTG PAVSVLLFGS AYGKHCRSCI
PGKVVALVNP RLIPAKGAAQ GDTSISFSVN EERQLLDVAD ARDYGTCKAA VRGKNDNGHW
VAGGKFCGHF VDKRISEYCN PHRKQANVKT GTANHTSTSA LQNLRNQAVA FPRIQTRVMI
PPGGVKPFQT PKQQTKLRSA QMMSDFLAQS TAPGAGCLLP FQQPTLSRSQ PAMNKNTILN
PKQSATKSVE IPGRGLLNPY AKGASSTASS HARGGFSPPN SVRIQDTTIR KPFTVNRVSP
PASTSHSVTE DWLQKASKKR TPLGNAHNQK RQRRFVNTDT RNFNGSVPVP KPSQMFQTAR
ITKRVPMIQT SDVKEKARAA QVLSQQQILA CRLREQDGGG SQSSVFKASR PLSGADCSAV
RQQDRREEFY ALLDDIDIQN ASAATSQFAD EVSAEEYARS RRVVTELEEQ EGKKQSKVAK
SKTLGDKDKT AIRKEWYCQQ CRKSSPFKPA GCVRRGHSVA TKREIVQAKS TSERRLDLAS
KDANDGGLTL GSGIEWSANR WSRFN