Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54417 |
Symbol | |
ID | 7200582 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 335998 |
End bp | 341040 |
Gene Length | 5043 bp |
Protein Length | 1455 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | cohesin |
Protein accession | XP_002179627 |
Protein GI | 219117673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.125351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCGATCGAT AGCACCATGG GGAACGTCCG TCGCTCCAAT CGGACTCGCA AGGCCACGTA TACCGTATAC GACGATGCCA AGGCTCAACG CGATTCAATC GATTCCACCG AATCACGGTA CGTCCGAAAC AACGAGAGAT GATTGATACG AGCGTTAACG ATTGCACGGT ACCCTTGTGC CGCATTTTCA TTCCTGATCC TGGTACTGTC GTGCACCGTA CCCTTGTGCT CGTACTCGTC ACTCACATAT CGTGGTGTCC TTGTCTTGAC AAATTGTTAG AAAAAAGAGG CGCTTGACTA TTGACGACGA CTCCGTGGAC GAAGACGCCG AATCCCGGAC GGCACTTTCT GTCGCGGCAG AGCCGGAAGA CGTGTCCGAT AACGACGATA ACGACAACGT AGAGTCCTCC TCGGACGAAG AAGAAGCGGA CGCGGGCCCT CCACAGGAAT CCGGAACGAC GAAATACTCG TCACGTTCCC GTGCTACTTC CTCGCAACCC CGCCAAATCC GAGCCCATCG GGCTGTCGCC AAACCTTCCA CAAAGCGCCG TGGCGGTGCC GCGTCCATGG CAGCGATCAC CACGACGAGT CGTCGAGCAG CACAGACGGT ACTGCAAGGC TTGGCGGGAA AAGTCCTGGA TCCTGTAGTC GAAACGCCCG AGACTTCCTT ACTCGCGGCC TTGCTACAGC AAGCCCACGA CAATCCCACG CACCGTCGTT CCGGGGGGAG CTCAACCCCG ATAGAAACCA ATTTGCAGCG CATCGCGCTC ACGGTCCTAC ACGAACACGA CACTCATCCC AATCGTGCGC AAATTTCTCT CCTCAATCTG CTCTTTCGAT CCGTTGGTGG TGGGGTACAC ACGCTTTTGG ATCCGGAGGA GGTTGATTTG GAAAGTCTCT CGGATGAAGC ATGGGAAGAC CTCGTTACCA AGGTATTGGT AGACATGCAA CAAACACCCG CTGATCTCGT CTTGTTCTGT GCCGATCCCA ACGGAACCGG CGGAAAAAAG GCTTCGGTCG GTGTTCGGGA ATACCGCAAA GTCTATCGGG CCTTTTGGAA CGTCCTCGGG GCGACTGCTC TGACCATGAC GACTCGGACA AAGTCCCTTG CCACCGACGC TGCTCGTGAC CACGACTCGG ACACGGACGA AGAATCCTTC GACGCCAGTG CCCGCTTTCA GGTGGAACTT GTTCGCGACA TGGTCGCCCG CGTGGTCGAA ATTGTCGGGG TTGGTCAACC CGACATTCGG GCCGCCGCCT CCGTCGCCAT TTACAGTCTC TCGATCGCCA TCTTGTCCCA CACCGTCACA CTCCGCACCA AACTCGAAGC CGCGCAAAGA CAGCTCGCGA CGGCCCAACG CAGCAAACAA AAACGTAAGG CACACGCCCT GCAGGCGCAG ATTACGATTT GGACCCGCAC AACGGAAGAT CTCGAAGATA TCGTAAAGGA AACGACCATG GGGGTTTTTC TCAAACGGTA TCGCGACTCC AATCCACACA TCCGAGCCGA ATCCTTACAT GTATTGTCGA GATTTACCCT CACTCGGCCG GATATCTTCC ACAAGGCTAC CTTTCTTAAG TATCTGGGTT GGATGCTCAG CGATAAGGAA GCGGTCGTAC GGGAACGCGC TTTGGATGGT TTGATGGAAC CTTTGTTGGT CGCACCCACC ACCAGCGGTA AACCGTTGTT TTCGAAAATT GACGTTAGCG ACATGCGCTC GGTCGTGGAC AAATTCGCCA CGCGCTTGGC CGATTGCGTT TTGGACGTCG ACACGAATGT CCAAGAAAAA GCCATGAATT TCTTGCTCAA TCTTTCCCGC CAAGGCCTGT TGGATAGTTT GGAAGATGAT CAGGTGTGGG AGCAGATTAA TTTACGGGCA CTGGCGGAAG ATGCCACCCC CATCGTACGT CGGGACGCAC TCTTCTTTGT GACTGAGCAG CTCGAGGCCT TTGATAGCGG ATCGACAAAG TTGGAATCAA ATGTCACGGA ACGCATCCAT GAACTCGTTG TATGGTACGC ACCGTTATTT TTACGTTCCA ATCGGCTCGG GTAGATCAAC ACTTGTTTCT TTTTTGCTAA CCAGACGATG CTTTTTATCA GGGTGGCGCA TAGTTTGGCT GATGGGAATA TTCCTTTGGA GCATATTCGC TTTGATCTGG TCGGGTTTAT AGTTGTCTCT CTACGGGCAT CTCCAGAGCT TAAGCCGATT ATCTGCAATT GGCCCGTGCT TCTGAAAGGC CTACAGCGCG ACAAGTCGAG TACACCCAAA AGCAAACACG ATCGGAGAAT GCTGGCTGTG CAGCAACGCG TCCTTTTAGA AATGTTGATA CGCTCGGTCG AATTGGAAGT GCGAGCGGTA GCCAAGGATG GACTAATGGT ACAGCATGTC GATCCAGATC TTTTGGCTGT TCAAGAGTCG GAAGATACGA ACCTACTACC ATCGCGGAAG AAAAGTAAAG GGGACTCGTC CCACGAGGAG TTGACGGTTG CGTTGCTTCG AGCTCTTCCT GATCTCTTGG ATTTGTTCAA GACTGACTCT TCTGTGTTGG AGTTGCTTAC CGGACTTCCT CCATATTTTT GTAAGTTTCC CAAGTAATTG CCTCTACTAT TTGGTCACAG TACGGAAAAC TTACTCCATT CACCTGATGC AGTGCCGAGT GTTTTTAACC TACCAAACCG GAAGCAAGAC TTTTGCACAT TGATTTCTAA ACTATCAAAA ACCTTTCTTG AGGCGACGGA CAGCAACGTC CTTTTTAACT GCGCCCTGGC ACTTTCATGC CTTGCTAAGG ACGACCATGC GCGAAGTGGT GATGCATTCC TGGAGTTGCA GGCCACCACC ACTGCGATTC AAGTTCGACT TAGCAAGTTG TTTGAAAGGA AAGCTGACAT TTTGACGTCG GACAGTCCCA AAGGGGACAA TCTGATAGAT ACGGAGCACG CAATCGGACT CTGTCTTCGT CGACTTCGCA TCTTATCGAA GCGCTGGGAT ATAGCAGATC TTCTTGTGGA CGGTAAGACG AAGGCAAATT CCGTTGCTGA GTTGGAAAAG CTCTGCATTG ACATTGTACG TGTTGTTGCC AATGATCTTC GTATCCGAGA AGTCAAGAAT ACCGATGAAG TTGACAATAA CAATACACCA GACATTCCGA AAGTGTGGCT AGATCAAGAC AAACGGGTAC ACTCACTTGT GGCAGAGTCA GCATCAGAGG CGTTGTCTTT CCTCCTCAGT GCAACTGCTT GGCGATTGAA GATCGAGGTG GACGATTTGG CAATATCTGC GGAAGCTCCC AAGAAATCCA ACGGGCCAGA GATTGTTGTT AGAATGCGAG ATAGTCTGAT AAAGCTGGTT GTCTTATGCT TTGAGCAATA CGTCGAACTC GACGAGGGAA ATTCTGTTTA TTCCGAAGAG CATTTCGCTT TTGCAGAAAA AGTACAGAGT CATGCTGGTA CCATTGCTGG AGACCTCCGC TCCCTGTTTC CCAAGCAATG GAGCGCTGCA GTATCACCGA AACTTCGATC GTTTGCATTT ACGGAAGACG GCCACGTTGT CGGTGGTTTT GTTCGTTTTT TGAAGTCTCA AGAACACCGG GTACGGATAC TGATAGTGTT ATCAGCATGC GTGTTTTTAA CTGCTGCCTT GCTCATCTTA CGAAAATGTT CTTTTTTTTT CGCACAGCTT CGAGCCAACG AGAAAATGAA CGTTGAAGAT CGCTTTGCAA TAGAACAGCT GTTGCTTCCT ATTGCTCGTG GTCTTTCAGC TAATTGGAAG GACGGTATCC GTCGAGAGGC TGGTGCTGTT CTGTCTCATA TTACGGGAAG TGGACGAATC GCCCGCTGTA CTGTCTCGTC TTTGTCACGG GTTCTGAAGA GAATTGAGCC AGGTAAGCTT GCTTAAGAGG TTTTTCTGCT GATTTTCATT ACAGCAAGCT TACTCACTTT CTTATGAAAT AGTTCGATTT CTGGAAGCCC ACATGGCCTG TCTTCGCCAA GACTTCGACG ACTGGGCGGC GTCGGAACCC GAGGAATTGG AGAGTGACCA TCCAACCGAA AAAGAAATGG TCGCATACGA TGAAAAAGAA AAGGAACATG CTGCTAAGGT AGGTCTTTTC ACGAAGTGGT TTTTTCGTTC TGCACTACTG CGCACACAAT TGCTCATAGA TTCTACTCTG GTTTAGTTCG ACATGATTGA ACAACAAGCT CAACGGCTCT CAGCATCTTT GGGCGTTGGC AAGCTTAGAG AAAAGTCGTT AGGTCCCGCT CTACTCGGAT TCGTTAGAGA AGGCGCTCGG GTTGCCTTTT CCACGGATGT GCCCGGATAC GAGGAGGAAC TTCCTTTGGG AGCTCGCCTC CCTTTTTTGC GTATTGTGAG CAAGTAAGTC GTGCTGATTC GTGAAATGAA TGTACGAGTC TTCAAAGGCC GCTGCTTACG GTTTACTTTA TCCAGGTATC TCAATTGGAT TCGTCGGGAT GAAAACCAGC TTCAAACGCT CAGGCAGGAT TTCAATGAAA TGGAGAAGAA GCTACGAAGC GAATCTGAGT ACAATGATAT ATACGAAGAT GATCTAGCGA CTATTGAAGA ATTTCGCCTC GCCGGCGATC TCGGTAAATA TCCCTTCAAC AAAAATGCCA AAACAGACGC GATGGATGAC GGCTCCTTTT CTGCGGAAAG TCTCGACTCG CGAACCCGCC CACGCATGTC AATAACAAGC AATATCAGCA GTATCAGGTC CAAAATGTCT GCGACTCAAG CGTCTCTTTC CCCTCTGTAT GAAGAAGGCG ACGGGGATCG AGATTCTGAC GATGTCGGGG ATGATGCTCA TGGGTCCACC AATGACTACG CTTCAACCCA CGCCTCAACC CACGCCTCTA ACCGTTTCGA GTCCGAGTCT GTCAGCACAC GCTCTTCGTT GACCTCTCAT CCATAGCTGT TCGTTGGTGG AACAACTCAT GTGTGACGAC GATAAGGAAG ACCTTTATGT CAAGAAGCAC TTGAAAGAGA GGACTTGAGC TCAAATATCA TTCAAAATCA GTTCCGTTTT TAC
|
Protein sequence | MGNVRRSNRT RKATYTVYDD AKAQRDSIDS TESRKKRRLT IDDDSVDEDA ESRTALSVAA EPEDVSDNDD NDNVESSSDE EEADAGPPQE SGTTKYSSRS RATSSQPRQI RAHRAVAKPS TKRRGGAASM AAITTTSRRA AQTVLQGLAG KVLDPVVETP ETSLLAALLQ QAHDNPTHRR SGGSSTPIET NLQRIALTVL HEHDTHPNRA QISLLNLLFR SVGGGVHTLL DPEEVDLESL SDEAWEDLVT KVLVDMQQTP ADLVLFCADP NGTGGKKASV GVREYRKVYR AFWNVLGATA LTMTTRTKSL ATDAARDHDS DTDEESFDAS ARFQVELVRD MVARVVEIVG VGQPDIRAAA SVAIYSLSIA ILSHTVTLRT KLEAAQRQLA TAQRSKQKRK AHALQAQITI WTRTTEDLED IVKETTMGVF LKRYRDSNPH IRAESLHVLS RFTLTRPDIF HKATFLKYLG WMLSDKEAVV RERALDGLME PLLVAPTTSG KPLFSKIDVS DMRSVVDKFA TRLADCVLDV DTNVQEKAMN FLLNLSRQGL LDSLEDDQVW EQINLRALAE DATPIVRRDA LFFVTEQLEA FDSGSTKLES NVTERIHELV VWVAHSLADG NIPLEHIRFD LVGFIVVSLR ASPELKPIIC NWPVLLKGLQ RDKSSTPKSK HDRRMLAVQQ RVLLEMLIRS VELEVRAVAK DGLMVQHVDP DLLAVQESED TNLLPSRKKS KGDSSHEELT VALLRALPDL LDLFKTDSSV LELLTGLPPY FLPSVFNLPN RKQDFCTLIS KLSKTFLEAT DSNVLFNCAL ALSCLAKDDH ARSGDAFLEL QATTTAIQVR LSKLFERKAD ILTSDSPKGD NLIDTEHAIG LCLRRLRILS KRWDIADLLV DGKTKANSVA ELEKLCIDIV RVVANDLRIR EVKNTDEVDN NNTPDIPKVW LDQDKRVHSL VAESASEALS FLLSATAWRL KIEVDDLAIS AEAPKKSNGP EIVVRMRDSL IKLVVLCFEQ YVELDEGNSV YSEEHFAFAE KVQSHAGTIA GDLRSLFPKQ WSAAVSPKLR SFAFTEDGHV VGGFVRFLKS QEHRVRILIV LSACVFLTAA LLILRKCSFF FAQLRANEKM NVEDRFAIEQ LLLPIARGLS ANWKDGIRRE AGAVLSHITG SGRIARCTVS SLSRVLKRIE PVRFLEAHMA CLRQDFDDWA ASEPEELESD HPTEKEMVAY DEKEKEHAAK FDMIEQQAQR LSASLGVGKL REKSLGPALL GFVREGARVA FSTDVPGYEE ELPLGARLPF LRIVSKYLNW IRRDENQLQT LRQDFNEMEK KLRSESEYND IYEDDLATIE EFRLAGDLGK YPFNKNAKTD AMDDGSFSAE SLDSRTRPRM SITSNISSIR SKMSATQASL SPLYEEGDGD RDSDDVGDDA HGSTNDYAST HASTHASNRF ESESVSTRSS LTSHP
|
| |