Gene PHATRDRAFT_54417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54417 
Symbol 
ID7200582 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp335998 
End bp341040 
Gene Length5043 bp 
Protein Length1455 aa 
Translation table 
GC content50% 
IMG OID 
Productcohesin 
Protein accessionXP_002179627 
Protein GI219117673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.125351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCGATCGAT AGCACCATGG GGAACGTCCG TCGCTCCAAT CGGACTCGCA AGGCCACGTA 
TACCGTATAC GACGATGCCA AGGCTCAACG CGATTCAATC GATTCCACCG AATCACGGTA
CGTCCGAAAC AACGAGAGAT GATTGATACG AGCGTTAACG ATTGCACGGT ACCCTTGTGC
CGCATTTTCA TTCCTGATCC TGGTACTGTC GTGCACCGTA CCCTTGTGCT CGTACTCGTC
ACTCACATAT CGTGGTGTCC TTGTCTTGAC AAATTGTTAG AAAAAAGAGG CGCTTGACTA
TTGACGACGA CTCCGTGGAC GAAGACGCCG AATCCCGGAC GGCACTTTCT GTCGCGGCAG
AGCCGGAAGA CGTGTCCGAT AACGACGATA ACGACAACGT AGAGTCCTCC TCGGACGAAG
AAGAAGCGGA CGCGGGCCCT CCACAGGAAT CCGGAACGAC GAAATACTCG TCACGTTCCC
GTGCTACTTC CTCGCAACCC CGCCAAATCC GAGCCCATCG GGCTGTCGCC AAACCTTCCA
CAAAGCGCCG TGGCGGTGCC GCGTCCATGG CAGCGATCAC CACGACGAGT CGTCGAGCAG
CACAGACGGT ACTGCAAGGC TTGGCGGGAA AAGTCCTGGA TCCTGTAGTC GAAACGCCCG
AGACTTCCTT ACTCGCGGCC TTGCTACAGC AAGCCCACGA CAATCCCACG CACCGTCGTT
CCGGGGGGAG CTCAACCCCG ATAGAAACCA ATTTGCAGCG CATCGCGCTC ACGGTCCTAC
ACGAACACGA CACTCATCCC AATCGTGCGC AAATTTCTCT CCTCAATCTG CTCTTTCGAT
CCGTTGGTGG TGGGGTACAC ACGCTTTTGG ATCCGGAGGA GGTTGATTTG GAAAGTCTCT
CGGATGAAGC ATGGGAAGAC CTCGTTACCA AGGTATTGGT AGACATGCAA CAAACACCCG
CTGATCTCGT CTTGTTCTGT GCCGATCCCA ACGGAACCGG CGGAAAAAAG GCTTCGGTCG
GTGTTCGGGA ATACCGCAAA GTCTATCGGG CCTTTTGGAA CGTCCTCGGG GCGACTGCTC
TGACCATGAC GACTCGGACA AAGTCCCTTG CCACCGACGC TGCTCGTGAC CACGACTCGG
ACACGGACGA AGAATCCTTC GACGCCAGTG CCCGCTTTCA GGTGGAACTT GTTCGCGACA
TGGTCGCCCG CGTGGTCGAA ATTGTCGGGG TTGGTCAACC CGACATTCGG GCCGCCGCCT
CCGTCGCCAT TTACAGTCTC TCGATCGCCA TCTTGTCCCA CACCGTCACA CTCCGCACCA
AACTCGAAGC CGCGCAAAGA CAGCTCGCGA CGGCCCAACG CAGCAAACAA AAACGTAAGG
CACACGCCCT GCAGGCGCAG ATTACGATTT GGACCCGCAC AACGGAAGAT CTCGAAGATA
TCGTAAAGGA AACGACCATG GGGGTTTTTC TCAAACGGTA TCGCGACTCC AATCCACACA
TCCGAGCCGA ATCCTTACAT GTATTGTCGA GATTTACCCT CACTCGGCCG GATATCTTCC
ACAAGGCTAC CTTTCTTAAG TATCTGGGTT GGATGCTCAG CGATAAGGAA GCGGTCGTAC
GGGAACGCGC TTTGGATGGT TTGATGGAAC CTTTGTTGGT CGCACCCACC ACCAGCGGTA
AACCGTTGTT TTCGAAAATT GACGTTAGCG ACATGCGCTC GGTCGTGGAC AAATTCGCCA
CGCGCTTGGC CGATTGCGTT TTGGACGTCG ACACGAATGT CCAAGAAAAA GCCATGAATT
TCTTGCTCAA TCTTTCCCGC CAAGGCCTGT TGGATAGTTT GGAAGATGAT CAGGTGTGGG
AGCAGATTAA TTTACGGGCA CTGGCGGAAG ATGCCACCCC CATCGTACGT CGGGACGCAC
TCTTCTTTGT GACTGAGCAG CTCGAGGCCT TTGATAGCGG ATCGACAAAG TTGGAATCAA
ATGTCACGGA ACGCATCCAT GAACTCGTTG TATGGTACGC ACCGTTATTT TTACGTTCCA
ATCGGCTCGG GTAGATCAAC ACTTGTTTCT TTTTTGCTAA CCAGACGATG CTTTTTATCA
GGGTGGCGCA TAGTTTGGCT GATGGGAATA TTCCTTTGGA GCATATTCGC TTTGATCTGG
TCGGGTTTAT AGTTGTCTCT CTACGGGCAT CTCCAGAGCT TAAGCCGATT ATCTGCAATT
GGCCCGTGCT TCTGAAAGGC CTACAGCGCG ACAAGTCGAG TACACCCAAA AGCAAACACG
ATCGGAGAAT GCTGGCTGTG CAGCAACGCG TCCTTTTAGA AATGTTGATA CGCTCGGTCG
AATTGGAAGT GCGAGCGGTA GCCAAGGATG GACTAATGGT ACAGCATGTC GATCCAGATC
TTTTGGCTGT TCAAGAGTCG GAAGATACGA ACCTACTACC ATCGCGGAAG AAAAGTAAAG
GGGACTCGTC CCACGAGGAG TTGACGGTTG CGTTGCTTCG AGCTCTTCCT GATCTCTTGG
ATTTGTTCAA GACTGACTCT TCTGTGTTGG AGTTGCTTAC CGGACTTCCT CCATATTTTT
GTAAGTTTCC CAAGTAATTG CCTCTACTAT TTGGTCACAG TACGGAAAAC TTACTCCATT
CACCTGATGC AGTGCCGAGT GTTTTTAACC TACCAAACCG GAAGCAAGAC TTTTGCACAT
TGATTTCTAA ACTATCAAAA ACCTTTCTTG AGGCGACGGA CAGCAACGTC CTTTTTAACT
GCGCCCTGGC ACTTTCATGC CTTGCTAAGG ACGACCATGC GCGAAGTGGT GATGCATTCC
TGGAGTTGCA GGCCACCACC ACTGCGATTC AAGTTCGACT TAGCAAGTTG TTTGAAAGGA
AAGCTGACAT TTTGACGTCG GACAGTCCCA AAGGGGACAA TCTGATAGAT ACGGAGCACG
CAATCGGACT CTGTCTTCGT CGACTTCGCA TCTTATCGAA GCGCTGGGAT ATAGCAGATC
TTCTTGTGGA CGGTAAGACG AAGGCAAATT CCGTTGCTGA GTTGGAAAAG CTCTGCATTG
ACATTGTACG TGTTGTTGCC AATGATCTTC GTATCCGAGA AGTCAAGAAT ACCGATGAAG
TTGACAATAA CAATACACCA GACATTCCGA AAGTGTGGCT AGATCAAGAC AAACGGGTAC
ACTCACTTGT GGCAGAGTCA GCATCAGAGG CGTTGTCTTT CCTCCTCAGT GCAACTGCTT
GGCGATTGAA GATCGAGGTG GACGATTTGG CAATATCTGC GGAAGCTCCC AAGAAATCCA
ACGGGCCAGA GATTGTTGTT AGAATGCGAG ATAGTCTGAT AAAGCTGGTT GTCTTATGCT
TTGAGCAATA CGTCGAACTC GACGAGGGAA ATTCTGTTTA TTCCGAAGAG CATTTCGCTT
TTGCAGAAAA AGTACAGAGT CATGCTGGTA CCATTGCTGG AGACCTCCGC TCCCTGTTTC
CCAAGCAATG GAGCGCTGCA GTATCACCGA AACTTCGATC GTTTGCATTT ACGGAAGACG
GCCACGTTGT CGGTGGTTTT GTTCGTTTTT TGAAGTCTCA AGAACACCGG GTACGGATAC
TGATAGTGTT ATCAGCATGC GTGTTTTTAA CTGCTGCCTT GCTCATCTTA CGAAAATGTT
CTTTTTTTTT CGCACAGCTT CGAGCCAACG AGAAAATGAA CGTTGAAGAT CGCTTTGCAA
TAGAACAGCT GTTGCTTCCT ATTGCTCGTG GTCTTTCAGC TAATTGGAAG GACGGTATCC
GTCGAGAGGC TGGTGCTGTT CTGTCTCATA TTACGGGAAG TGGACGAATC GCCCGCTGTA
CTGTCTCGTC TTTGTCACGG GTTCTGAAGA GAATTGAGCC AGGTAAGCTT GCTTAAGAGG
TTTTTCTGCT GATTTTCATT ACAGCAAGCT TACTCACTTT CTTATGAAAT AGTTCGATTT
CTGGAAGCCC ACATGGCCTG TCTTCGCCAA GACTTCGACG ACTGGGCGGC GTCGGAACCC
GAGGAATTGG AGAGTGACCA TCCAACCGAA AAAGAAATGG TCGCATACGA TGAAAAAGAA
AAGGAACATG CTGCTAAGGT AGGTCTTTTC ACGAAGTGGT TTTTTCGTTC TGCACTACTG
CGCACACAAT TGCTCATAGA TTCTACTCTG GTTTAGTTCG ACATGATTGA ACAACAAGCT
CAACGGCTCT CAGCATCTTT GGGCGTTGGC AAGCTTAGAG AAAAGTCGTT AGGTCCCGCT
CTACTCGGAT TCGTTAGAGA AGGCGCTCGG GTTGCCTTTT CCACGGATGT GCCCGGATAC
GAGGAGGAAC TTCCTTTGGG AGCTCGCCTC CCTTTTTTGC GTATTGTGAG CAAGTAAGTC
GTGCTGATTC GTGAAATGAA TGTACGAGTC TTCAAAGGCC GCTGCTTACG GTTTACTTTA
TCCAGGTATC TCAATTGGAT TCGTCGGGAT GAAAACCAGC TTCAAACGCT CAGGCAGGAT
TTCAATGAAA TGGAGAAGAA GCTACGAAGC GAATCTGAGT ACAATGATAT ATACGAAGAT
GATCTAGCGA CTATTGAAGA ATTTCGCCTC GCCGGCGATC TCGGTAAATA TCCCTTCAAC
AAAAATGCCA AAACAGACGC GATGGATGAC GGCTCCTTTT CTGCGGAAAG TCTCGACTCG
CGAACCCGCC CACGCATGTC AATAACAAGC AATATCAGCA GTATCAGGTC CAAAATGTCT
GCGACTCAAG CGTCTCTTTC CCCTCTGTAT GAAGAAGGCG ACGGGGATCG AGATTCTGAC
GATGTCGGGG ATGATGCTCA TGGGTCCACC AATGACTACG CTTCAACCCA CGCCTCAACC
CACGCCTCTA ACCGTTTCGA GTCCGAGTCT GTCAGCACAC GCTCTTCGTT GACCTCTCAT
CCATAGCTGT TCGTTGGTGG AACAACTCAT GTGTGACGAC GATAAGGAAG ACCTTTATGT
CAAGAAGCAC TTGAAAGAGA GGACTTGAGC TCAAATATCA TTCAAAATCA GTTCCGTTTT
TAC
 
Protein sequence
MGNVRRSNRT RKATYTVYDD AKAQRDSIDS TESRKKRRLT IDDDSVDEDA ESRTALSVAA 
EPEDVSDNDD NDNVESSSDE EEADAGPPQE SGTTKYSSRS RATSSQPRQI RAHRAVAKPS
TKRRGGAASM AAITTTSRRA AQTVLQGLAG KVLDPVVETP ETSLLAALLQ QAHDNPTHRR
SGGSSTPIET NLQRIALTVL HEHDTHPNRA QISLLNLLFR SVGGGVHTLL DPEEVDLESL
SDEAWEDLVT KVLVDMQQTP ADLVLFCADP NGTGGKKASV GVREYRKVYR AFWNVLGATA
LTMTTRTKSL ATDAARDHDS DTDEESFDAS ARFQVELVRD MVARVVEIVG VGQPDIRAAA
SVAIYSLSIA ILSHTVTLRT KLEAAQRQLA TAQRSKQKRK AHALQAQITI WTRTTEDLED
IVKETTMGVF LKRYRDSNPH IRAESLHVLS RFTLTRPDIF HKATFLKYLG WMLSDKEAVV
RERALDGLME PLLVAPTTSG KPLFSKIDVS DMRSVVDKFA TRLADCVLDV DTNVQEKAMN
FLLNLSRQGL LDSLEDDQVW EQINLRALAE DATPIVRRDA LFFVTEQLEA FDSGSTKLES
NVTERIHELV VWVAHSLADG NIPLEHIRFD LVGFIVVSLR ASPELKPIIC NWPVLLKGLQ
RDKSSTPKSK HDRRMLAVQQ RVLLEMLIRS VELEVRAVAK DGLMVQHVDP DLLAVQESED
TNLLPSRKKS KGDSSHEELT VALLRALPDL LDLFKTDSSV LELLTGLPPY FLPSVFNLPN
RKQDFCTLIS KLSKTFLEAT DSNVLFNCAL ALSCLAKDDH ARSGDAFLEL QATTTAIQVR
LSKLFERKAD ILTSDSPKGD NLIDTEHAIG LCLRRLRILS KRWDIADLLV DGKTKANSVA
ELEKLCIDIV RVVANDLRIR EVKNTDEVDN NNTPDIPKVW LDQDKRVHSL VAESASEALS
FLLSATAWRL KIEVDDLAIS AEAPKKSNGP EIVVRMRDSL IKLVVLCFEQ YVELDEGNSV
YSEEHFAFAE KVQSHAGTIA GDLRSLFPKQ WSAAVSPKLR SFAFTEDGHV VGGFVRFLKS
QEHRVRILIV LSACVFLTAA LLILRKCSFF FAQLRANEKM NVEDRFAIEQ LLLPIARGLS
ANWKDGIRRE AGAVLSHITG SGRIARCTVS SLSRVLKRIE PVRFLEAHMA CLRQDFDDWA
ASEPEELESD HPTEKEMVAY DEKEKEHAAK FDMIEQQAQR LSASLGVGKL REKSLGPALL
GFVREGARVA FSTDVPGYEE ELPLGARLPF LRIVSKYLNW IRRDENQLQT LRQDFNEMEK
KLRSESEYND IYEDDLATIE EFRLAGDLGK YPFNKNAKTD AMDDGSFSAE SLDSRTRPRM
SITSNISSIR SKMSATQASL SPLYEEGDGD RDSDDVGDDA HGSTNDYAST HASTHASNRF
ESESVSTRSS LTSHP