Gene PHATRDRAFT_25067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_25067 
Symbol 
ID7196961 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2423284 
End bp2425574 
Gene Length2291 bp 
Protein Length566 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176976 
Protein GI219110449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACAGTCGTGC GCATACTACC AGGGGATCCA CATACCTACA CAAAGTGACA CACACACACT 
CTCGTACCTT GCTGTTTGTG AATCTCCCCC ACTGGAACGA CTGTTTGGCA AAATCAACAA
CAAATCCCCT CGCATCGACT GGAACAAGTG AAAACGATCT GCCCGAGTGC AATTCCGAGA
ATATCTCAGT ATCAGCATAC CGCCAAATCG TTTGTAGTTC TACAGCGACT TTGTTTTTTT
GGCTTTCTCG AGACACTTGA GGAAACTCGT CGATTTGTAG AACGAGTCTT TTTTCCTCTC
GGCGTCGCAA TCACAAACTG GACTTCAATG GGAAACGAAC CATCGAAAAA GGCAGGTCGA
AATTCATCAG CACCGTCGAC TTCGACAGCC ACCACAACAA AGACGAACAC CTCAAAGGGT
AAGTCGAACG CGACATAAAG CGTCCACGAT TCGAAAGTGT ATGAGTGCGC ATTGTAGTTG
CTGTTTCGAT AAATGTTGCA CGGATTGACC TCACCTCGCC ATTTCCACCG TCTATCTCTG
ATTTACTTGG CATGGTCAGA TAAGCACAAG CCGGCTCACG GCTCTAAGCA TGTCAGTATA
AACCCGGGGG ACATAAAGAA CGAAACCTCT ACTCTCCCGG CCGGTAACAA AACCGCCAAA
CCAAAATTGC GCATGGACGA ATCCGCGCAT ACCTTTGCCC CCAGCACGGC GAGTGCCTCG
GGTCACTACC GCCGGGGATC GTCGCCCGTC ATGATTACCG ACGCCCTTTC CGATGTTCGC
GTGAACTATC ACATTGAACC CAAGGAACTA GGACACGGTC ATTACGGGGT GGTGCGAAAA
TGTATGCACC GTGATTCCGG AGAATGGTAC GCAATCAAGA GTATTCGAAA ATCGAAAGTG
TCCAAAATTG AGGTATTGAA ACGAGAAATT GCTATCCTGA AAGAAGTCCA ACACCCGCAC
ATAATCGAGC TGCACGAAGT TTACGAAGAC GAACGTTATC TGCATTTGAT TACGGAAATT
TGCACCGGTG GGGAACTCTT TGATCGGATT ATTGCCAAAA CGCAATCCGC CGAAGGACAC
TTTTCGGAAC ACGATGCGGC CGTCCTGGTG CGAGACATTC TCGACGCTAT TCGCTACTGC
CACGACGAAA AGGGCATTGT TCATCGCGAT TTAAAACCGG AAAATTTCCT CTTTCTCACG
GAAGCAGAGG ATGCACCCGT CAAGATTATT GATTTTGGGT TGTCCCGGCA CGAGACAGAC
ATGGGTATCA TGCAAACCAA GGTAGGGACA CCCTACTACG TCGCGCCAGA AGTTTTGAGA
CGGGAGTACA CCAATTCCTG TGATATTTGG TCGATCGGTG TCATTACGTA CATCTTACTG
TGCGGCTACC CACCCTTTTA TGGTGAATCC GACACGCAAA TATTTGAATC GGTCAAAGTG
GGCAAGTTTG ACTTTCCGTC ACCCGAATGG GACGAAATCA GCCAGTCGGC GAAAGATTTC
GTGCTGATTA TGCTCAAGAA GAGTCCCATG GATCGGTACG GAAAGGTGTC CAGTGACAGA
CATCTGCCTT GGCATCCGCA TGTTATCGTC ACTCACCCTT GTATTACTCA TCTTGCCTCC
TCCTCGTATC ATTAGACCTA CGGCTGCCGC TGCCCTTAAG CATCGATGGC TCAAGGAACA
GCTCGGACGC AAGGAACTGG CCACCTCTAG CATTTCTCAT GCAAGCGTTC GGACGGGAGA
GTTTACCAAG TATTTGGCGA TGAAAAAGTT GAGAAAGGCG GCTCTCGGTT ATATTGCGTC
GAACCTGACA CAAACCGAGG TGGGACATTT GGCGGAATTG TTCAAAACCA TGGACAAAAA
CGATGACGGT CACGTTTCAC TAGCCGAACT AGATGAAGCT ATTGCTAAGG GAAGCTTCAA
TAAGGAAATT CGAGACGATC TCAGGGAGAT GCGGCACGAA TTGACCTTGT CGGACGAAGA
GACTATTGAT TACCGAGACT TTTTGGCTGC AACCATGGAT CGCAGTCTAG CAATGCGCGA
GGAGAATATG AAAATGGCTT TTGAGCATTT CAAGCGTTCT GACGCCGACT ATCTGACTCT
GGAAGATTTT GCCGATTTCT TTGGTGGAGA AGCGCACGCT AAGGAGATCT TGAGTCTGTT
GGATGCCAAC GGAGATGGGA AGGTATCGTT CGATGACTTT CGAAGAGTTA TTGCCGAAAG
CATGGAGGAC GACGAAGATG AAACTGAGAA TGGGGAAGTC ATTGGGTAAC AGTAAATGTA
ACGTAACGCA C
 
Protein sequence
MGNEPSKKAG RNSSAPSTST ATTTKTNTSK DKHKPAHGSK HVSINPGDIK NETSTLPAGN 
KTAKPKLRMD ESAHTFAPST ASASGHYRRG SSPVMITDAL SDVRVNYHIE PKELGHGHYG
VVRKCMHRDS GEWYAIKSIR KSKVSKIEVL KREIAILKEV QHPHIIELHE VYEDERYLHL
ITEICTGGEL FDRIIAKTQS AEGHFSEHDA AVLVRDILDA IRYCHDEKGI VHRDLKPENF
LFLTEAEDAP VKIIDFGLSR HETDMGIMQT KVGTPYYVAP EVLRREYTNS CDIWSIGVIT
YILLCGYPPF YGESDTQIFE SVKVGKFDFP SPEWDEISQS AKDFVLIMLK KSPMDRPTAA
AALKHRWLKE QLGRKELATS SISHASVRTG EFTKYLAMKK LRKAALGYIA SNLTQTEVGH
LAELFKTMDK NDDGHVSLAE LDEAIAKGSF NKEIRDDLRE MRHELTLSDE ETIDYRDFLA
ATMDRSLAMR EENMKMAFEH FKRSDADYLT LEDFADFFGG EAHAKEILSL LDANGDGKVS
FDDFRRVIAE SMEDDEDETE NGEVIG