Gene PHATRDRAFT_34625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34625 
Symbol 
ID7199696 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp1072931 
End bp1076361 
Gene Length3431 bp 
Protein Length675 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178907 
Protein GI219116224 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000047159 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTCC GGAAACATAG CAATCGTACA AATGCACCCT TTATTGTGAG TCCACGTGGC 
CACGAAATAA GGCTGACAAT ATTTCTGTGC GCACCTCCTT AACTCCATGT GCATAGTTTT
GATTCACACT TGCGACATCC ATGTTAGGCG AATCATGTCC ATGAGACATG TCGCAAGGAG
GCATACCGTA ACATTCGTCA TAATGCCCAA CCTGTGTTTA CCATGCACTG TCAAACAGCG
TTACTATCCT ACGTACGTAC ATTGGTTGTA CTATGTGAGA GATTAACAGT CTTGACAGCC
CGTCAAACCA CTGCTGCCAA CTGCACAGTG TTTTTGTGGT AAACATTTTT CTTGAGTTTT
ACTTAAGATT GTTTCCAAAT TTGTACGAGG TGTACATTTT TGGTAACAAA CCGCTGACAT
TCTTTTGTGG TGTATAAGTG CCAGAGGTAT CACTTAAGGT GGTATTTATT GGCTTGTGCG
ACACATCGTT TGGAAGCCGC ATCCCCATCC CCTCACGTAA ACTTGTACTG AGAACCCACT
TGTAACCCTA ACGTTGGCTC AGCTCCAGGT TTTCGTAAGT AAGTTTTCGT TGCCTCCCTA
CTTCCTCGGA ATTGCTGGTG CCGAAAACCC CTTGGAGTCT TCAGGGTTTG TTGGCAAGTC
GTTCCGTGAT TACGGGTTGC TCTGGTTGGG TGAGGTAGAA ATACCAATAG GCCCAAAAGA
CTCCAACCCT GATTCTGGTT GGGACGGGCT TACTACCCGG CGTAAAATAG TACAGGGGGC
CAATCAAACC CTCGACAGTT TGCGTACGCG TGCGAGCGTA ACAGACACTA CCTACGGAGG
TAGAATTGAT TGTACCAGTG CTAGCTTTGT TCACTGGTGT GGGGTCATTT AACGAAGTCC
GCAATAACAA AACAGCAAAG GACCCAAACA CAAACGCTGT TTAATGGTTT GAGAACCTCA
AGCATTCTCA TATTGCTCTG GCTACCTAGC CCATCATGGT GTTGACCAGT GAGGATGTCT
ACGCGCACTT CTTAGACAAT GTGTTCTTGC TCTCCCAGGA GCATCCGATT CGACTATTTC
TTATGCAACA AGGTTTTGAA TCTATGGAGG ACCTCTTTGG CATATTTGAA GATGATCTCA
GTACCTTTGG ATATTTTCGC ACTGCTACTC TTAGATTCAA CAAGAACCCT CAATGGTCCC
TTTTGTCCCT TGCACACCGT CAGATTCTTC GACACTTCCT GCATTGGCAG GCATCTCTTT
GGCATCAAAA GGGAAGCAGC TTGAAGTATC TGGAGCTTGT TACATTGACC GACCAGGATT
TCGCTCAGTA CCGACAATCA ACATTGGAAG AGATACTTCC CGATGACACC AACAAAGTTC
ATGCTTGCTC TAGCATGCCA CCTGTTCCTA GCAAGACGGT CCTCATGCAC ATTCTGGACA
ATGTGTTTGT GCTCTCCCAA GATCACCCAA TCCGTTTATG TTTTTTGCAG CAGAGCATTG
AATCTATGGA GGATTTTTTC AGTTTTCTTG AAGATGGCAT TGATGCCCTC ACCTTTTCAC
CCACACCTGC TGACAAAGGC AACTCACTTC CACAACGGAT GCCCATGAAG CTTGGACACT
GTTGGCTTCT GCAGGCTTTC TTTGACTGGC AAGTATTGCT TGAATGGGAA AAGGGGAGTT
TTTTGGAGAA TTCGGAACTT GCTGCATTGA CCAAAGCGGA TTTTACTCGT TACCGACGAT
CTGCAATCAA GAAAGCCTCA ACTGCATCCC TTATGCCATC AGCTTCTATC CTTGGTTTCA
CCAGGAAGGG CCTTACACCA CAGGATGGGG AGGGTTGTTC TAAGCCCATC AACTTCGCCG
AGTCGCATAG AATCGAAAGC AAAAATTCCT GTGTCTTAAA GGGTTTACCT GATTCCACTC
CTGACAACCT CATTGGAGAA ACTTCGCCAA CCAATAGTCA GGATGATGGG GAGCAATTTC
GTGAACAACA AGCTTTTACC AAAAGTCAGA ATAATGGGGA GCAAGTTTGT GAATGGCAAC
CTTTTTCAAC TGATAGTCAT GATGATGGGG AGCAATCCTA TGGATGCACT GTACGAAAAT
TTCTGGGTTG TGATAAACCA TGCAACATGC AATTTTCGGT GGATATCAGT GATTGCAAGT
GTGATGAAAG CTTTGCCTGT ACCGAAAGTG ATGCCAAGTA TGATGAAAAC CTTGTTTCTA
TCGAAAATCT AGACAAACAC GAAACCAATC TAGATGAAAA GTGGCACAAG AGGGACCGAG
AATGGTGTTT CAAGGCATCT CCACCCATGT TAGGAATCTA CAAAGGTCCT AGATACAATG
GGTTTGTTCA ACAGGAAACT AGGGATTCTA CCTGTGAACC TCTAGATGTC TGTCACGGTA
TAGTACCATG CGACATGTCT TCTAATGATG TCCGTCAGTT GGATGTCAAC ACGGTGAACG
GCAGTTGGAG TGTTCTTGAG AATGAAGTGA GTCGTATGGG AGTTAGTTCG GTCTTATTGT
ATGGAGTCCC TAGATACAAT GATGTATCGA GCGTTGGAGA GAACGCGACA GAACGTGTCG
TGTTCTGACG GTGGCAACGG AGTGGGACAG ATAATACTTA TTGCCTGCTT AACGGATGTA
AATGGATGCG CTCCATTTTC CACGGCATGC ATGTTATGGA AGGTATTCTC GGTGTCACTA
TAAGTATATG TGAGCGCGGG TCATGGTATA GGAATACTAT GACCGAAGAA AGATATTTGT
GTACGGTTGA GTTTGTTTCT CAGTTTTGTT GTTTTGTTGG CAAATCGGAG AAGCAACGAG
AACTAGTTTA TAGCACTAAT TCTTCAATCA TTGTTTTATA AGTTCTTTGA TCCGTACTGC
TAGTTTACGC ACAGGTCCTC GTATCCGCTG CCTAACCCGA GGGGAGCGTC GCCAACGAGT
AGAGTTTTAC AGTCCCTAAG CCCCTTAGGT GTATCGCCGG TCGTTTGTCA TAACGCCCTC
TCTGTTTACC AAGCTCTGTC CCTAACAGCG CTTCTGTCCG AACGGTGCAC ACTGGTGTAT
ACCGTAACAG ATTAGCAGTC TTGGTTTCTC GTCAAATCAC TGCTGCTCGT CGCACAGAAT
TTGGTAAACC CACCTTGAGT CCGCCTCAAG GAAGTGTTGA GCTTGGTACC AACCGTACCT
TGTTTGACAA CACTGCTTAC ATTGCTTGTT GTTGCATAAA GTGTCAGAGG TGTCTACGGG
GACATTTATC GGCTTGTGCT TCGCACCTTT GTCAAGCTGC CACCACCACC CTCCTGACCA
TTGGTATTGA AAACCCATTA GTACTTCAGG GTTTGTCAGT AAGTCGTCTT GGCCTCCCTA
CTTCCTCGGA ATAACCGGCA CTGGAAATCG TCTAGGTTTT CTTCTGGATT TGCCAGTGGG
TTGTTTCGTG A
 
Protein sequence
MRLRKHSNRT NAPFIANHVH ETCRKEAYRN IRHNAQPVFT MHCQTALLSY LQVFVSKFSL 
PPYFLGIAGA ENPLESSGFV GKSFRDYGLL WLGEVEIPIG PKDSNPDSGW DGLTTRRKIV
QGANQTLDSL RTRASPIMVL TSEDVYAHFL DNVFLLSQEH PIRLFLMQQG FESMEDLFGI
FEDDLSTFGY FRTATLRFNK NPQWSLLSLA HRQILRHFLH WQASLWHQKG SSLKYLELVT
LTDQDFAQYR QSTLEEILPD DTNKVHACSS MPPVPSKTVL MHILDNVFVL SQDHPIRLCF
LQQSIESMED FFSFLEDGID ALTFSPTPAD KGNSLPQRMP MKLGHCWLLQ AFFDWQVLLE
WEKGSFLENS ELAALTKADF TRYRRSAIKK ASTASLMPSA SILGFTRKGL TPQDGEGCSK
PINFAESHRI ESKNSCVLKG LPDSTPDNLI GETSPTNSQD DGEQFREQQA FTKSQNNGEQ
VCEWQPFSTD SHDDGEQSYG CTVRKFLGCD KPCNMQFSVD ISDCKCDESF ACTESDAKYD
ENLVSIENLD KHETNLDEKW HKRDREWCFK ASPPILAVLV SRQITAARRT EFGKPTLSPP
QGSVELGTNR TLFDNTAYIA CCCIKCQRCL RGHLSACASH LCQAATTTLL TIGIENPLVL
QGLSVFFWIC QWVVS