Gene PHATRDRAFT_38840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38840 
SymbolTOP6B 
ID7203590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp323515 
End bp325563 
Gene Length2049 bp 
Protein Length682 aa 
Translation table 
GC content49% 
IMG OID 
Producttype II DNA topoisomerase 6 subunit 
Protein accessionXP_002182945 
Protein GI219125348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.833809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA CCAACGGCGT GGCCTCCAAG GCTTCGGCCG GCGAAGTTCA GGTACAGAAA 
AGCCCGGCCG AGTTTTTTGC CGAGAATCAG GCCATTGCTG GCTTTGACAA TTTGGGAAAA
TCTTTGTACA CGAGCTTACG CGAGCTCGTC GAGAACAGTC TGGATGCGTG TGAAAGCATC
CATGAATTAC CCGAGATCTC GATAGAAATC AAAGAATACA CGCAGGAAGA GTTTAACGCA
CTAGATGTTA TGAAGACCAG TCCACGCAAA AAACGAGATA TGCAGCTCTT TGAAACGAAG
AAAAAGGATA AGAAACCTCC GAAAGAGGCA ACCGATGTCG ACGAAATCAC CACCGTCGAA
AATTCAGGGA AAAAACGTAA GAAAGCGCCA CAAGACGCCT ATTTCAAGCT CACCGTCAAA
GACAATGGTT GCGGTATGTC CCACTCGGCT ATTCCAAACC TATTGGGTCG AGTCCTCAGT
GGAAGCAAAT ACGGTGTGCG GCAGACTCGG GGAAAGTTTG GGCTAGGGGC AAAAATGGCC
TTGATCTGGG CCAAAAAGAG TACGGGACAA CCTATTCGGA TCATGACGTC GCATCGGCCA
GATGGGGGCG TCGCACCAGA ACGAGCGTCC GCATGTGTGC TCGATATCGA CATATATAAG
AACGCTCCCC GAATTTTAGA ACACACAACT CGCAAAAATA CGGACGGCTG GATGGGTACT
GAAATGTCCG TGCTGATTGC GGGAAATTGG ACAACCTACA AGTCTCGAAT TGTCCAATAT
CTGCAGCAGC TTGCTATCAT AACTCCCTAC GCCGTGTTGG AGCTTGCCTA CATCAATATC
TCTGATTCAA AACGCAATCT ACATGTGCGG TACGACCGCC GTTCAGACCA AATGCCACCA
CCGGCAAAAA CTATCAAACA TCATCCCGCA TCGGTCAACA ATCTAGTCAT TCAACAACTC
ATACAATTCA CCAAAACAAG AACCCTTTTT AAATTTCTGT CCACCGAACT GTCGACCGTC
AGTCCACCGT TGGCCCGTCG ACTAGTTACC GAGCTCGGCT TTGACGAAAG CATGCCTCCT
TCGTCGCTTG AAGACAAAGA AATTACTCGG CTTGTTCAGC TACTTCGCCA AGTCCAGCTC
TTCAAAGCTC CTGATGGGTC CTGTCTAAGT CCACTTGGTG AATATAATCT GAATCTCGGC
ATTCGCAAGG TTCTGGAACC TGATCTGATT GCCACTTCCC GTGATAAGCC AGGGGCCTAC
GAAGGCCACC CTTTTCTGGT AGAAGCAGCA GTTTCCTTAG GTGGAAAAGA GGTCAAGGAA
GGGATTACGG TGATTCGATT CGCGAACAGG ATACCGTTGC TCTTTGAAGG AGGAGCCGAT
GTTGCAACTA GGGTTGCGAA TACCAAAATC CGATGGTCCA ACTACAAGAT GGATTACAAA
CGCGATCGGA TTGGCGTTTT TGTGTCCATT GTATCAACCA AAGTTCCCTT CAAGGGAACT
TCAAAGGAAT ACATCGGTGA TGACGCGACG GAAATTCAAC AATCAGTGAA GCGCGCGTTA
CAATCCTGTT GTCAACAATT ACGAGGTTAC TTGGCAAAGC GATCCGCGTT GAAAGATGCA
CAGACACGGA AATCTCGAAT GGCTAAATAC GTTCCTGATG TGGGACGATC CTTGTTCGGT
ATCCTAGATA GCATGCGAGC GCGCCAGGCG GAACTTTCGC TGCCAGAAGC ACCGCCTAGC
CAATCCCCAA CTAAGCGCTT GCGCTTGGAT CGGAGCGCGG CTCAGCTAAT GATCGAAAGG
CTGAATAGAG GGGAGGTGAC GGAGCAATCA CTGGCAGCAA AACTGACGGA AAGCATTGAT
GATCAACTAA ATGTTCAAGA GGAAGGAACT GACGATAAAG GAAGCGCGCC AACACAGGAA
GCTCAACCCC TCTACTTGGT GCCGTTGTAC AATCTCGACG ATGATTCCAA CGACATTTCA
CATCCCCTGT TTACATTCCG GCCCATAATG CCGATTCTGC GAATGCCTGC AATGCAAGTT
CAAGAATAA
 
Protein sequence
MAKTNGVASK ASAGEVQVQK SPAEFFAENQ AIAGFDNLGK SLYTSLRELV ENSLDACESI 
HELPEISIEI KEYTQEEFNA LDVMKTSPRK KRDMQLFETK KKDKKPPKEA TDVDEITTVE
NSGKKRKKAP QDAYFKLTVK DNGCGMSHSA IPNLLGRVLS GSKYGVRQTR GKFGLGAKMA
LIWAKKSTGQ PIRIMTSHRP DGGVAPERAS ACVLDIDIYK NAPRILEHTT RKNTDGWMGT
EMSVLIAGNW TTYKSRIVQY LQQLAIITPY AVLELAYINI SDSKRNLHVR YDRRSDQMPP
PAKTIKHHPA SVNNLVIQQL IQFTKTRTLF KFLSTELSTV SPPLARRLVT ELGFDESMPP
SSLEDKEITR LVQLLRQVQL FKAPDGSCLS PLGEYNLNLG IRKVLEPDLI ATSRDKPGAY
EGHPFLVEAA VSLGGKEVKE GITVIRFANR IPLLFEGGAD VATRVANTKI RWSNYKMDYK
RDRIGVFVSI VSTKVPFKGT SKEYIGDDAT EIQQSVKRAL QSCCQQLRGY LAKRSALKDA
QTRKSRMAKY VPDVGRSLFG ILDSMRARQA ELSLPEAPPS QSPTKRLRLD RSAAQLMIER
LNRGEVTEQS LAAKLTESID DQLNVQEEGT DDKGSAPTQE AQPLYLVPLY NLDDDSNDIS
HPLFTFRPIM PILRMPAMQV QE