Gene PHATRDRAFT_31981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31981 
Symbol 
ID7196522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1395648 
End bp1396856 
Gene Length1209 bp 
Protein Length402 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176777 
Protein GI219110050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCGC TTGGCATCGT GTCGTCCTCC CTGCTGACAT TGCGGAGCAG CCACGGACTC 
GTGCTGGTTC CATTGCCACT GCGACCGTCA TATAGCTCAA GCCACCGCTA TCAAGATAGA
CATGCAACAA CGTCCATGTC ATCCTATCGA GGGGCGCCAG AAGACATTTA TGGAGCTGTT
CATCGAAAAG AATACGAAAT GAAGAAGGTC AAGGCCCAGC ACATGTCTAC GACGGATCCT
GTTCGAATGG CGATGGGGTA CGCACAGGAA TCGGTCTCGC CGATGAAGCT CGCCAAAGCC
TTACGAAGAG TTTACGAGGA TCCATCCAAT CCAGCCAATC CCGACCATGT TCCTCTATCT
GATGAAGAAA AGAAGCGCGC TCAGACACTG CAACAAGTTG GAATGGCCGA TTTGGGAATG
CGACGAGGTA GTTTCATTGT CGACATTAAG CGCAAGTCGT TGAGTCGACC AGGCGAAGTT
TTTTGTAATT ATGATGATGC TGGTATGGTA GCAGAGGCTA TGGTACGCTT GGGAGCCGAC
GCCGTCTTTG TAAACACCGA CTATCAGGCC TACGGCGGTG ACATGACGGA ATTGAAATCG
GCTGTCAAGG CAGTTCGCGC CGTTTCAAAA TCGGCGGCGG TCGTGATGAA AGATATTGTA
GTGGATGAGA TTCAATTGGG ACTCGCGAAA GAGGCCGGTG CTGACGGAAT CGTTCTTATG
TCATCAGTGT TGGGGCCTAC GCTCGAGAAT TTCTTGAACC TGGCAACCAT GATTGGTTTG
GAGACGATCG TTGAGTGTCA TACACACGAT GAAGTACAGA GAGCCATCGA CATCCTGGCA
CCCAATATTT TGGTCAACAA CTACGATCGA GTTGCACAGG AACTTCACCC AGAGCAGGCA
ATTAAGCTTG CAGGTATGTT CCCTGGCTCT GGTGGACCCA TTATTTGTTT GGCTGCGGGA
GAGATCGAAA CCACCGATCA AATGAAGCGC CATTTGGCGG TTGGGTACGA CGGAGTTGTG
GTCGGTAAAG CAGTCATGGG AAGTCCGGCA GCTCCCGAGT TCATTCGAGC GGTTCGGGAT
CGAACACTGC TACCAGCCGA ATTCTCGGCT TGGGGTTTAG AAGACGTGGA GTTTGACATG
GACGGAAATG TCATGTCTGG ACCCAAGCGC GGCACTCCTC AAGATGGTGA TGCCGACGTC
TACCAATAA
 
Protein sequence
MTALGIVSSS LLTLRSSHGL VLVPLPLRPS YSSSHRYQDR HATTSMSSYR GAPEDIYGAV 
HRKEYEMKKV KAQHMSTTDP VRMAMGYAQE SVSPMKLAKA LRRVYEDPSN PANPDHVPLS
DEEKKRAQTL QQVGMADLGM RRGSFIVDIK RKSLSRPGEV FCNYDDAGMV AEAMVRLGAD
AVFVNTDYQA YGGDMTELKS AVKAVRAVSK SAAVVMKDIV VDEIQLGLAK EAGADGIVLM
SSVLGPTLEN FLNLATMIGL ETIVECHTHD EVQRAIDILA PNILVNNYDR VAQELHPEQA
IKLAGMFPGS GGPIICLAAG EIETTDQMKR HLAVGYDGVV VGKAVMGSPA APEFIRAVRD
RTLLPAEFSA WGLEDVEFDM DGNVMSGPKR GTPQDGDADV YQ