Gene PHATRDRAFT_34520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34520 
Symbol 
ID7199637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp817144 
End bp819618 
Gene Length2475 bp 
Protein Length824 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179069 
Protein GI219116548 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGC CGCGACTACG CAGTGTGTGG GGCGTCGGCT GCCTACTTTT CGTCGGGGGG 
TCGGCCGCGG AAGCGACCCG TCGTTCGTGG ACGCAACGGC AGCCTTTCCG ATGGATCGTG
AACAATCCTT CCAAGCTCAA CTTCACATCA TCACCCGCCA TGGGTGTCCC GAGTCCCCGT
TCCAAGTCAT CGACGGTCGT ACCCACCCGG GTTCCCGCGA ATAGTTCCTC ACGGGTCCCG
ACTCGATGGC CGACGACGCG ATTCGGAGGG TCCACCAGCG CGCCGACGGG AAGTACGACA
AGAGGACTTA CGCTGCGTCC GAGTATGGCA CCTATGGCAC TGCCGACGAC GCATGGGACA
CTCACGGGTC CAACGGAAAC ACCCTCGGGT CTTCTGAGAA CGAGTCCAAT GTGGCCCTCC
CCGGTACCTG TTCATAGTCC AAGACGAGAC AGATCGCCAA TCACAACACG GATGCTGGGA
ATTCCTACAT CGACGAGTCC GGTCACTCCA ACCGCTGTCT CTATATTTCC TTCTCGACCC
GCCGGAAGTT CATCACGGAA ACCGGTAGAG GAGAATAGCA ACGACGCAAC CGGTAGGCCA
TCGCTAGGAC CATCGCGTAC GCTTTCCATA CCAATGGCGT TCTCCACTGA TCTACAGCTG
GTGCAATACA ACTTTAGCGG TCCGCAAAAC ATTGTGGAAA CTCTAGAATT TGCTTGGCAA
GGTTATTTGA CCGCCATTTT GAGTCGCTAC TATCGAGACC GTGAAGGGGT CCAGTTTACC
GGTGTTGACC TAGACGTACG GCAAGGCCGG CAACGCAGGT TCTTGTGGCA GTCGAACATC
AGACCCACGA GACGCTTACA GAAACTGGTG GGCAACGCCA CGATTTTATC GTTCGAAGCT
AACGGTACCG CCGTTTTGCT GGTGGATGCC AGTACACAGG ACGCAAATTC GATTGTTGCT
TCCACCAATT CCTTTCTCCG CACAGCTGTC ACGATGGAGA ACTTGCAACA AGCACTGGTG
GACGTTAACG AAAACCTGGT GACAGTGTCG AGTGTGAGTG TACCAAACGC CACTGGCGTG
CCCGAGCCCG ATCGAGATGA TGGACCCACT ACCGTTGAGA CTGTATTCGG GTTGTTCATT
GCGGCTGCGG CAATGTTGGG CTTGGCCTAT ACATGCCGCG TCATTTGCCA AAATCACAAG
GAAAGGCAGG CTCGTGCTAA GAAATTGATG GCCCGTCCCA TGGTATTACC GAATGCCCCG
CCGTCGCTAG CTTACCAGCC CAGACCGATG CCAAGGCAAA CTTTGGTCAC GCCCAATCAG
AGTGGGAACA ATGATGACAC CTTGAGTATC CCAGGAATTC CCAGTACGGA AACCAGTGAT
GGTGACCGTT TCGCCAGAGA GCTGCAAGAA GCAGCTTCGT TGGACCGGGC GGTTTGGGAA
GAAAAGCAGT ATGACAACTC AAACGGAGTG ACCGCTCCTT TTACTAGAGT GCCGGAGTCC
ACAGGGAAGC TACAAGTGTC CTCGTCATTC CCGTATGGGG ACGAAGCCGT CGGCAACTTC
GAGTCCATGG TGCACCAGCA AGGAGGCTTC GAATTGACTC CACAAGTGGG TATGCTTGGG
TCAAACGCTC GATCCGCGCC CCCAATGAAT GGAGACGACA TTCTAGATGT GCCCGACTTC
GAGGCGTTTG GGGATACAAA CACGGGTCCG GAGCTGAGAC AATTTAGCGC ATCGGATCGC
ATGCTCTCCG GAGGAAACCA AAAAACCCGC GGGTTGCAAA TGTTCAGTTT CACCGATCGC
CCAGGAGATG CAACAGTGGG TGATTCCACA ATCACATCCA GAGACCCATC TCTTACCGCT
ATTCCAGAAT CTCCGTCACT GTCATCATGG TCGCCAAAAG ACAATGAAGA TGACGAGGAT
ACCAATGTCT CGGACATATC TCATACCAAT GCCATGCTGC AAGAAGTTGA GCGTTTGTCG
ATGTTTGTTA GACAATACGA AAAGGAAAAG GAAGCTAGAA AAAGCAGCCA ATTTTTAGTC
GATACGTCTA GTACGGGAAA TGCTCCAGTA CAAAGAACAC AGACAAGAGC TTTCACGGAA
ATTGAGAGTT TAAAAGACGT AAGCTTTTCT CCAGGGGACG AAGATTCCAA GCGAAGACTC
GGTATCGGTC AATACAGCGT ACAAGAAAAA ATTCCTGGAC CTCTAGTGGA TGACGATGGC
GAGCCGCGGG GGACCTTGGA GACAGCAAAT GCTTTGCAGT ACCCCAGCCT CGCTGCGGCA
ACTTTGGGAA ATACAGGTAC TCCTTTGGAG CACAGAGACG GATCAATCGA AAAGGGAAGT
CTGCCAGGAC TCCGCACAGC AGTCCAGCAA GAGCGTCGTT TTGGTTTGCC TCGACCTAAC
GTCCGAAGCC GTTCGGGCAG GTTTTCAACT GATAGAACGG GTCGAACACA AGCGGAGAAG
AGACCCTCGC CTTAA
 
Protein sequence
MKRPRLRSVW GVGCLLFVGG SAAEATRRSW TQRQPFRWIV NNPSKLNFTS SPAMGVPSPR 
SKSSTVVPTR VPANSSSRVP TRWPTTRFGG STSAPTGSTT RGLTLRPSMA PMALPTTHGT
LTGPTETPSG LLRTSPMWPS PVPVHSPRRD RSPITTRMLG IPTSTSPVTP TAVSIFPSRP
AGSSSRKPVE ENSNDATGRP SLGPSRTLSI PMAFSTDLQL VQYNFSGPQN IVETLEFAWQ
GYLTAILSRY YRDREGVQFT GVDLDVRQGR QRRFLWQSNI RPTRRLQKLV GNATILSFEA
NGTAVLLVDA STQDANSIVA STNSFLRTAV TMENLQQALV DVNENLVTVS SVSVPNATGV
PEPDRDDGPT TVETVFGLFI AAAAMLGLAY TCRVICQNHK ERQARAKKLM ARPMVLPNAP
PSLAYQPRPM PRQTLVTPNQ SGNNDDTLSI PGIPSTETSD GDRFARELQE AASLDRAVWE
EKQYDNSNGV TAPFTRVPES TGKLQVSSSF PYGDEAVGNF ESMVHQQGGF ELTPQVGMLG
SNARSAPPMN GDDILDVPDF EAFGDTNTGP ELRQFSASDR MLSGGNQKTR GLQMFSFTDR
PGDATVGDST ITSRDPSLTA IPESPSLSSW SPKDNEDDED TNVSDISHTN AMLQEVERLS
MFVRQYEKEK EARKSSQFLV DTSSTGNAPV QRTQTRAFTE IESLKDVSFS PGDEDSKRRL
GIGQYSVQEK IPGPLVDDDG EPRGTLETAN ALQYPSLAAA TLGNTGTPLE HRDGSIEKGS
LPGLRTAVQQ ERRFGLPRPN VRSRSGRFST DRTGRTQAEK RPSP