Gene PHATRDRAFT_39557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39557 
Symbol 
ID7195234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp121910 
End bp123541 
Gene Length1632 bp 
Protein Length543 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183554 
Protein GI219126627 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCGT CGACCCCGTT CGCGTCCGTC GCACGGTCGC ACCACGCCCG ACGTCGGAGT 
GATCCGTCTC GTCGTTCCGG ACTGCACTGC TTAAGGCTGG TACTTCCGCT GGCTATCCTC
ACACTCGGCA GCTGCTCCTT GTGGATTCTG CTCGCGGACT CCAAGTCAAT CAACAGCAAT
CATCCCGATC CTTCCGAAAT CATTGCTCTC GCAGCCACGT TGGAAACCAC GCAACAACGC
CACACGCGTG TAGTCACCTC CGGAGTAACG GCTACTCCCA ATCTTCTACC CCCTTCCCTA
CCCGGGACCT TGGCCGACTT TCGTCACCAA CCCCGGGTGC CCTTGGAACC TCCCGTTCGA
CTCTGGAACC AATCCAAACA TGATTACGAT CTGGTACACG TCATTTTGAC ACGCTTCCAA
CAACACCAGG CCGATTTGGA GCATTTGGGA CGCGCTCGTT GGGAGCTCTT TCGGACCTTT
TGTGCACCGT CCCTGCGCGC ACAGACCAAC CAACAATTCC TTTGGATTCT ACGGGTCGAT
CCCGATCTCT CACCAGCATT GAAACGAGAT TTGTTACGCA CGGTGGATGG CATGGACAAC
GTACTCGTTG TGGCATCGAA CGGATCACAG GAAGATGGTC TCCGCAATCC ACACGGCAAT
CGCGATATTA CCCAACCCAA CGTGAGCAAC AACAGTAGTA ACAACACTAC TGGTAGTACT
TCCGTTTGGT ACGGCAGTGT GGAAACCTTT CGATCCTACC AGGAAGCGAG TCAAACACGC
ATGGTATTGG AAACCAATCT GGATGCCGAT GACGGACTGG CAGTGTCCTT TGTGGAAACG
CTCCAACGTC AAGCGGACGC CACTTTTCAC ACGACGAGTC ACACCGGGAC CACGGTGGAG
AGCGACACCG GTAGCTCGTT CGACCCAAAC ATTGCGTGGC GCATATATTG CGTGAATCAT
CACGTCACCT GGCAATTCTG GGCACCGTGG AGGAAGACCA ACGACGATAC CAACAACGAT
AGGAGTACCG TGACGGAACG AGGCAGTCTA GAAGCCCAAC ACGACTTGGA CATTTGTGTG
ACACCAGGTC TAACTTGGGC TTCTCGACCA CGCACCCCTC AAACCTTTAA GTACATGCGT
TGGCATTGGC GTATTCGTGG GACTCTACCC CGGTGTTCCG ACGACCATGA AAACAACGAA
AACAACAAAA ACAACCGTAC CACGGCACTT TCGGGATGCT GGTCCTACGT GCGTCCGGTC
CAAACGGTAG AAGGGACGTC CAGCGTCTCG CAATCCGTTC TCTCCTCAGT CGACTTTTCA
CCCTTGGCCA TCCGGGCACG GACTCCTACC AGCGCCGGTA TGAATGACGT TGCAACGGTT
GGCGGGACCG TCTCCACATC AACCGCTCGA GCCAAATTGC AACGCCAACA ACAACAAGAT
GACTTGCTTT GGCGAGACAA CGTGGCCGAA ACGATTTTTG GTGGAAACAC TACGACTATC
GTTGAGGCTC GAGAAAACAT GCAGACCCAC TTGGTAGACA TTGTGCAAGA CGCCCTGCAA
GGTCAATGCA CCAAGGGACA TTCGTGTCAC AACAAGAGCA AACTTGCTCT GACCCGATTG
TTGGAACAGT GA
 
Protein sequence
MAASTPFASV ARSHHARRRS DPSRRSGLHC LRLVLPLAIL TLGSCSLWIL LADSKSINSN 
HPDPSEIIAL AATLETTQQR HTRVVTSGVT ATPNLLPPSL PGTLADFRHQ PRVPLEPPVR
LWNQSKHDYD LVHVILTRFQ QHQADLEHLG RARWELFRTF CAPSLRAQTN QQFLWILRVD
PDLSPALKRD LLRTVDGMDN VLVVASNGSQ EDGLRNPHGN RDITQPNVSN NSSNNTTGST
SVWYGSVETF RSYQEASQTR MVLETNLDAD DGLAVSFVET LQRQADATFH TTSHTGTTVE
SDTGSSFDPN IAWRIYCVNH HVTWQFWAPW RKTNDDTNND RSTVTERGSL EAQHDLDICV
TPGLTWASRP RTPQTFKYMR WHWRIRGTLP RCSDDHENNE NNKNNRTTAL SGCWSYVRPV
QTVEGTSSVS QSVLSSVDFS PLAIRARTPT SAGMNDVATV GGTVSTSTAR AKLQRQQQQD
DLLWRDNVAE TIFGGNTTTI VEARENMQTH LVDIVQDALQ GQCTKGHSCH NKSKLALTRL
LEQ