Gene PHATRDRAFT_41624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41624 
Symbol 
ID7199453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011700 
Strand
Start bp143290 
End bp145080 
Gene Length1791 bp 
Protein Length596 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185569 
Protein GI219130854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.263212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTTC AGCGGTGGCG GAACGGGTGG GGGAGGGTAG GATGGTGCAT TCTGTGGTAC 
GGCGCGCTGC CGATCTACGG GACGGATATT CCCTCGGAGG AGGGTTCCCC CGGGTCGGAT
CCCACGACGA CCTCAACAAG AACGACGCCC ACCCCCAGTA CTGCCATTCC CAACGACTAT
GCCTGGCTCA ATCTCGGGTC GTCCGTTCCG TTCTTTTCGC AACTATTCGA CAATCCCGAT
CCTTCCGATG ATCCCTTTGA TCGCCTTGCC GGTTCCCACG AATCGGAAGA TTCCGATCTC
TTTCGGAAGC TCGCTTCCCT CTTTCGATCG ACACCATCTC TGAGTGTGTT TCCGAACGAC
AACAGCAACA ACAATTACGA CAAGGTACAC GACGAGAGTC AGAAATTCGT CAACCAAATT
GTGAACACCT TTAGTTCTCC CGGGCAAGCA CAAACTCGCC AACAAATCTT CAAATTGTTC
CAGCAAAGCA TCATCGCCAT GCAGGAACAA ATGCAGAAAA CCTTTGGAGA CATGCAGAAC
GAACTTTGGG AGCGTTTCAA TCTACTGCAA CTCACCTATT TTCTACAGCA CGAAGAATCC
GTCAAGAATC CTGTTTGGAA ACGACGCCAG CATCGATTTA TGACGCCACT GCCCGTCAGC
GAGGCCGTAC AACTGGGCGA TGGACTCTTT CTCTCCCATC TCGCTTACGT GGACGACTGT
CACCTCATTC AGGAATATCT GCGGAGCTTC CATAACGGCG CCTTTGTCCT ACGCAACTGC
ACCACCTCCA GTCAACCCAA CCAACCGGCC CACTTTTTGG CCGTCCGTAA GATCGCCAGC
CCCTCCCAGA CAACCACAAA ATCCTTCCTG ACGCCTGAGT GGAAATGGCT CATTCATCAG
TACTTGCCCT TCTTCTTTCG TCCCGAACCG CTCGAAGTCG TCCTCACGAT CCGCGGCACC
AAAGAAATAG GTGACTTCTT GTCCGACGCC ATGTTAGCCG CCGCGAAGCA TCGGAATGGT
AAGGCACACG ACGGTATACT CAAATCAACC CAATGGATGC TGAAAACATA CACGGACGAT
TTGCAACAAT TGTTAAAAGA TTCTCAACGG GATAGAATGA ATCTATGGTT GGTCGGACAT
TCTCTAGGAG GCGGAACGGC GGCCTTAATG GCCATTGAGT TGTTCGAAAC GCAAGACGGA
TGGGTTCAGC CCCACGCATT GGGCTTTGGT ACGCCCTCCT TGGTCTCAGC GGAACTATCC
CGCAAGTACA AACCAATTGT TAAGACCGTC ATTAACGATG CCGATGCCGT GCCCCGCATG
TCGGGTGCGT CCATTGCCAA CGCTTGGCTA CGAGTCGTCC GTTTTAACTG GACCGATGCC
TTCTTGCAAG ACTTTGATCA AACCGTTGGT TTTTTGAGCG ATTACTTCTT AAAATTCAAG
ATGAATACCG ATTGGATCAA TGAGTGCTCG CGTGACATTC GACAGATACT CATCACTTGG
CTTGATCAAA AGGTTTCTCC GGGCTTGAAG AACTTGCCTA AAGCGACCGA AAAAGAATCC
GAACTCATTC CTCCCGGAAA CTGTGTCCAC TTATACCGTG ATGGAACTTC CTGGCAAGGA
GCATACACCA GCTGTTACTT CTTTCAAGAG TTGGAAGTGG TTCGGCACTT GGTCGATGAT
CATTTGATTC CGGGCGGGTA CTACCGTGGC CTTTTGAATT ATGTTCGCGA GCAGAAACAG
AATGTTTCAT GGAAATTCGA CCATGACCTG CTCACACTCC CGGTGGGCTA G
 
Protein sequence
MTVQRWRNGW GRVGWCILWY GALPIYGTDI PSEEGSPGSD PTTTSTRTTP TPSTAIPNDY 
AWLNLGSSVP FFSQLFDNPD PSDDPFDRLA GSHESEDSDL FRKLASLFRS TPSLSVFPND
NSNNNYDKVH DESQKFVNQI VNTFSSPGQA QTRQQIFKLF QQSIIAMQEQ MQKTFGDMQN
ELWERFNLLQ LTYFLQHEES VKNPVWKRRQ HRFMTPLPVS EAVQLGDGLF LSHLAYVDDC
HLIQEYLRSF HNGAFVLRNC TTSSQPNQPA HFLAVRKIAS PSQTTTKSFL TPEWKWLIHQ
YLPFFFRPEP LEVVLTIRGT KEIGDFLSDA MLAAAKHRNG KAHDGILKST QWMLKTYTDD
LQQLLKDSQR DRMNLWLVGH SLGGGTAALM AIELFETQDG WVQPHALGFG TPSLVSAELS
RKYKPIVKTV INDADAVPRM SGASIANAWL RVVRFNWTDA FLQDFDQTVG FLSDYFLKFK
MNTDWINECS RDIRQILITW LDQKVSPGLK NLPKATEKES ELIPPGNCVH LYRDGTSWQG
AYTSCYFFQE LEVVRHLVDD HLIPGGYYRG LLNYVREQKQ NVSWKFDHDL LTLPVG