Gene PHATRDRAFT_44665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44665 
Symbol 
ID7197651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1203508 
End bp1205076 
Gene Length1569 bp 
Protein Length522 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178384 
Protein GI219115177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTGC AGATTTTCAA CGAAACAAAA GCGTCGACGC GGCTCCATTT TGTCGCCACG 
TTTGGTCTAG CCAAAGCCCT TGCCAACGCC ATGAGTGGTA GTCTGGCTGA TACGCTTGGC
CGGAAACCAG TACTGATTTT GGGCTGCTTG GTAGGCTTGC CCGTAATGCC CTACGTTATT
GTGGCGAATT CTTGGAGCGG CGTTACACTC ATGAATGTTC TCTTTGGACT GTCACAAGGG
TTGCTAGGCT CTTCGCTTTT CTTTTTATTA ATTGATCTAA TGGGATCACG TCGTAGGGGA
ATCGCTGTGG GTATGGGAGA ATCAACAATT TACGTTTCGA CTGCCATTGT AAACGTTCTT
GCAGGGCGAC TTGCTTCAGT GTACGGGTTT CGACCTGTTC CTTTTTATGT GGCAACCTCA
ATAAGCGTGT TGGGACTTTT GTCGACCATA CCCCTGCAAG ATACTTTGGA TCAAGTGAGG
GCCGAACAAA CAGAATCCGA ACGGATCAAT CGGAAGAAGT ATGCTCGGTT ACTGAGTACC
CAGCCCAGTA AGCTGGACGA AGAGCGAATA GACACTCCAA ATTGCACTGA TCGTAGCCGT
CTCGATACCA CCAATTGCGG CATCGGCGAG ACTTATGGTA GTGTAGATGA AGGTTACGTT
TCCCCGTGGC GGATGTCGAC GCACACGCTG GATACGACTG ATAAGGAAGC GGTGAAGGAA
AATGACGGTG ACATGGAGAC TCAACCGATG ACCGCATTTT CATTTGCTAC TTCTTGCTAT
GTGTCAACCA GCCGCAGGGA TCAGCGGACC AGTTCTGGCC CTCTACGTTC TGTTAAGATG
TTGAAGTCGC CAGAGACGAG CTTAAGCTCA CTTCTTTTAA AGAATCGGAG CTACGTCGCT
CTTTGCTTCG GAGGCATGTC GTTGAACTTC AAAGACGGTT TTTGTTGGGG TTCATTCCCG
GTCTTCTTCA AACATGAACA TGGCTTGAGT GACGTGCGAA CTGATTGGTT GATTGCTATC
TATCCTTTAT GTTGGGGCAG TGCCCAAGCT TTTACTGGGG CTTTATCTGA TCGTTTTGGT
CGCAAATCAT TCCTGGTAGC AGGGGTAGGA TGCTGTGCGG TTTCAATGGT CATCTATGTA
CTTCCTTCGT ATTGCTGGGG TGTAGCGGCA GGTTCCAAGC ACTTTCATGT TTGGGTGGCC
GCGGATGTGT TGTTAGGATT TGGGACCGCA TTGGCATATC CGGCACTACA AGCTGGTGCG
GCCGATGAGG TAGATCCTGC CTACCGCGGA CTCGCACTCG GATTCTACCG CTTCAGTAGA
GATATGGGCT ACGTACTTGG AGCGATCGTT TGTGGCCCTC TTACGGATGC GATTGGCTAC
GAGGACACAT TTCTGGTGAA TGGAACCGTA CTTTGCCTTG CTTTTATATT ACTGCTTGTG
TTCTATTCAG ACGATCAGTC TGAACAAACC TTTGAATTCA GTACTGCCAC AGATGCAACT
TTTAAGCCTT CCCCTTTTAC CGCAGGGAAA TCACGAATAC TAACCGCCGG GATTTCAGAT
TCATGGTGA
 
Protein sequence
MAVQIFNETK ASTRLHFVAT FGLAKALANA MSGSLADTLG RKPVLILGCL VGLPVMPYVI 
VANSWSGVTL MNVLFGLSQG LLGSSLFFLL IDLMGSRRRG IAVGMGESTI YVSTAIVNVL
AGRLASVYGF RPVPFYVATS ISVLGLLSTI PLQDTLDQVR AEQTESERIN RKKYARLLST
QPSKLDEERI DTPNCTDRSR LDTTNCGIGE TYGSVDEGYV SPWRMSTHTL DTTDKEAVKE
NDGDMETQPM TAFSFATSCY VSTSRRDQRT SSGPLRSVKM LKSPETSLSS LLLKNRSYVA
LCFGGMSLNF KDGFCWGSFP VFFKHEHGLS DVRTDWLIAI YPLCWGSAQA FTGALSDRFG
RKSFLVAGVG CCAVSMVIYV LPSYCWGVAA GSKHFHVWVA ADVLLGFGTA LAYPALQAGA
ADEVDPAYRG LALGFYRFSR DMGYVLGAIV CGPLTDAIGY EDTFLVNGTV LCLAFILLLV
FYSDDQSEQT FEFSTATDAT FKPSPFTAGK SRILTAGISD SW