Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38840 |
Symbol | TOP6B |
ID | 7203590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 323515 |
End bp | 325563 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | type II DNA topoisomerase 6 subunit |
Protein accession | XP_002182945 |
Protein GI | 219125348 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.833809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA CCAACGGCGT GGCCTCCAAG GCTTCGGCCG GCGAAGTTCA GGTACAGAAA AGCCCGGCCG AGTTTTTTGC CGAGAATCAG GCCATTGCTG GCTTTGACAA TTTGGGAAAA TCTTTGTACA CGAGCTTACG CGAGCTCGTC GAGAACAGTC TGGATGCGTG TGAAAGCATC CATGAATTAC CCGAGATCTC GATAGAAATC AAAGAATACA CGCAGGAAGA GTTTAACGCA CTAGATGTTA TGAAGACCAG TCCACGCAAA AAACGAGATA TGCAGCTCTT TGAAACGAAG AAAAAGGATA AGAAACCTCC GAAAGAGGCA ACCGATGTCG ACGAAATCAC CACCGTCGAA AATTCAGGGA AAAAACGTAA GAAAGCGCCA CAAGACGCCT ATTTCAAGCT CACCGTCAAA GACAATGGTT GCGGTATGTC CCACTCGGCT ATTCCAAACC TATTGGGTCG AGTCCTCAGT GGAAGCAAAT ACGGTGTGCG GCAGACTCGG GGAAAGTTTG GGCTAGGGGC AAAAATGGCC TTGATCTGGG CCAAAAAGAG TACGGGACAA CCTATTCGGA TCATGACGTC GCATCGGCCA GATGGGGGCG TCGCACCAGA ACGAGCGTCC GCATGTGTGC TCGATATCGA CATATATAAG AACGCTCCCC GAATTTTAGA ACACACAACT CGCAAAAATA CGGACGGCTG GATGGGTACT GAAATGTCCG TGCTGATTGC GGGAAATTGG ACAACCTACA AGTCTCGAAT TGTCCAATAT CTGCAGCAGC TTGCTATCAT AACTCCCTAC GCCGTGTTGG AGCTTGCCTA CATCAATATC TCTGATTCAA AACGCAATCT ACATGTGCGG TACGACCGCC GTTCAGACCA AATGCCACCA CCGGCAAAAA CTATCAAACA TCATCCCGCA TCGGTCAACA ATCTAGTCAT TCAACAACTC ATACAATTCA CCAAAACAAG AACCCTTTTT AAATTTCTGT CCACCGAACT GTCGACCGTC AGTCCACCGT TGGCCCGTCG ACTAGTTACC GAGCTCGGCT TTGACGAAAG CATGCCTCCT TCGTCGCTTG AAGACAAAGA AATTACTCGG CTTGTTCAGC TACTTCGCCA AGTCCAGCTC TTCAAAGCTC CTGATGGGTC CTGTCTAAGT CCACTTGGTG AATATAATCT GAATCTCGGC ATTCGCAAGG TTCTGGAACC TGATCTGATT GCCACTTCCC GTGATAAGCC AGGGGCCTAC GAAGGCCACC CTTTTCTGGT AGAAGCAGCA GTTTCCTTAG GTGGAAAAGA GGTCAAGGAA GGGATTACGG TGATTCGATT CGCGAACAGG ATACCGTTGC TCTTTGAAGG AGGAGCCGAT GTTGCAACTA GGGTTGCGAA TACCAAAATC CGATGGTCCA ACTACAAGAT GGATTACAAA CGCGATCGGA TTGGCGTTTT TGTGTCCATT GTATCAACCA AAGTTCCCTT CAAGGGAACT TCAAAGGAAT ACATCGGTGA TGACGCGACG GAAATTCAAC AATCAGTGAA GCGCGCGTTA CAATCCTGTT GTCAACAATT ACGAGGTTAC TTGGCAAAGC GATCCGCGTT GAAAGATGCA CAGACACGGA AATCTCGAAT GGCTAAATAC GTTCCTGATG TGGGACGATC CTTGTTCGGT ATCCTAGATA GCATGCGAGC GCGCCAGGCG GAACTTTCGC TGCCAGAAGC ACCGCCTAGC CAATCCCCAA CTAAGCGCTT GCGCTTGGAT CGGAGCGCGG CTCAGCTAAT GATCGAAAGG CTGAATAGAG GGGAGGTGAC GGAGCAATCA CTGGCAGCAA AACTGACGGA AAGCATTGAT GATCAACTAA ATGTTCAAGA GGAAGGAACT GACGATAAAG GAAGCGCGCC AACACAGGAA GCTCAACCCC TCTACTTGGT GCCGTTGTAC AATCTCGACG ATGATTCCAA CGACATTTCA CATCCCCTGT TTACATTCCG GCCCATAATG CCGATTCTGC GAATGCCTGC AATGCAAGTT CAAGAATAA
|
Protein sequence | MAKTNGVASK ASAGEVQVQK SPAEFFAENQ AIAGFDNLGK SLYTSLRELV ENSLDACESI HELPEISIEI KEYTQEEFNA LDVMKTSPRK KRDMQLFETK KKDKKPPKEA TDVDEITTVE NSGKKRKKAP QDAYFKLTVK DNGCGMSHSA IPNLLGRVLS GSKYGVRQTR GKFGLGAKMA LIWAKKSTGQ PIRIMTSHRP DGGVAPERAS ACVLDIDIYK NAPRILEHTT RKNTDGWMGT EMSVLIAGNW TTYKSRIVQY LQQLAIITPY AVLELAYINI SDSKRNLHVR YDRRSDQMPP PAKTIKHHPA SVNNLVIQQL IQFTKTRTLF KFLSTELSTV SPPLARRLVT ELGFDESMPP SSLEDKEITR LVQLLRQVQL FKAPDGSCLS PLGEYNLNLG IRKVLEPDLI ATSRDKPGAY EGHPFLVEAA VSLGGKEVKE GITVIRFANR IPLLFEGGAD VATRVANTKI RWSNYKMDYK RDRIGVFVSI VSTKVPFKGT SKEYIGDDAT EIQQSVKRAL QSCCQQLRGY LAKRSALKDA QTRKSRMAKY VPDVGRSLFG ILDSMRARQA ELSLPEAPPS QSPTKRLRLD RSAAQLMIER LNRGEVTEQS LAAKLTESID DQLNVQEEGT DDKGSAPTQE AQPLYLVPLY NLDDDSNDIS HPLFTFRPIM PILRMPAMQV QE
|
| |