Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_25506 |
Symbol | SMC1 |
ID | 7197261 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1337790 |
End bp | 1341963 |
Gene Length | 4174 bp |
Protein Length | 1237 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177808 |
Protein GI | 219112113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTCGTCGTC GTGCGAGTGT GTGTGTGTGT GTGTGTTCGT TGCGGCGTTG ACCGATACTC GCGTCGTCCA CCATGCCCGT CACGTCGCTG GAACTCGAGA ACTTCAAGTC CTACGCGGGC TTACAAACCA TCGGACCCTT TCGCGACTTT ACGTCCGTCA TTGGCCCCAA TGGTGCCGGC AAGTCCAATC TCATGGACGC CGTCAGTTTC GTCCTCGGCG TACAGTCCCG CGATTTGCGC AGTACCGTCC TGGCGGATCT CGTCTTTCGT CCTCCCACGA CGATTGGTAC TACTGTGAGT ACTACTAGTA CGACGCCCGC ACTCCGCGCG TCGGCCACGC TCGTGTACGC CGACGCCGTC ACCGGCGCCG AGACACGCTT CGGACGGACC ATTGGCGTCC GCGGAGTCGG GGAGTACCAC CTGGACGGGA AGGTTGTTTC CTGGACGGAC TACGAAGCCG CCCTCGCCGA CATTGGGGTC CTCGTCAAGG CGCGGAACTT TCTCGTCTTT CAGGGAGACG TGGAAGCCTT GGCGCGCAAG TCGCCGGCGG AACTCACCGC CCTCGTCGAG CAAATTGCCG GATCCGCCGG ACTCGCGGAC GACTACCGCC AACGACACGC CGACAAGGAA CAAGCGCAGC AGAATACCGT CTTTCTGTTG CAACAGCAAA AGACGCTCCG GGCCGAGCGG AAACTACTCA AGGAACAAAA GACTGAGGCC GACCGGTTCC ACCAACTACT CACGGAAAAA GCCGACGTCG AGACCGAGCT CTATTTGTGG ATACTCTACC ACTTGGACCG GGACCGACAC GAGCGGGACG CCGTCCTCGG GGAACTCCGC GACGAACGGG ACGCCCACCG CGCCACCGAA CAAACACACG CCGAGACACT CCAACAAGCC AAAAAGCAAG CCAGTGCGGC CCGGAGGGAG ACCGGACAAC GCCAACAACG CCGGGTGGAA CTCGCCGCAC TCGCGGATCG ACTAGAACCC GCCGTCATAC AAACCACGGA AGAAATCAAA TCCCTCGCTA ACAAACTCGC GCAGGACGAG AAACAAGTCG CCAAGAAACA GACGGAGGCC GATACGCACC GGGAACGAAT CGACGCGATT GCCAAGGAAA TTGCCGACTA CCGCACACAG CTGACCGCGT TGGAACGCGA CTACGACGAA ATCAAGGCCA ACGCCGCTCC CGTCCAACTC ACGCCCGAGC AAGAAACACG TTACGAAGCG CTCCGCTACC AGGCTGCTGC CGCCAGTGCC GCACCAAGAC ACGCGCTCCA TGCCGCCCAA CGACGCCTGG AGCAGGCACG TGCTCACGTT GCTACACTCC AACACACGCT CCAAGAAGCC CAAGCGGCCC AGGCGGAAAC CGCACGCGAC GTACAGGCAC TCGACACCCG ACGCGAGAAA CTTACCAAGG TACGGTGGAC GAGTGGGTGG GTGGTCTGGC ATGTTGGCTT GGATAGGAAC GGAGGGGCAT GCCGTTGCCC ATGCTGTGTT CTCACGCTCG CGTGTTTTGT TTTTTGTTTT CTCGCGCTTC CTCCCAGAGT CTCGCAAATA CTACTCAGGA TCTGCAGGCA ACGGAACATG AACTGGTGCA AGTCCAAGGA CAGGCGCAGC GGGTCCAAGT TCGTCGCCAG GAATTGGATG TTGACATAGA AAAGCTCGAT GCGTCCTTAC GGGAGGCCAA GTACGACAGT ACCCGTAGTA AGGACGAGGA ATGTTTGGTG CGGGCCATCG CCTCACTCCA ACAGCACTTT ACCGGGGTTC ACGGTCGACT CGTCGATTTG TGTCGCCCCG TCTCCCGCAA ATTCAATCTC GCTGTTACCG TCGCGGCCGG GAAAGACATG GACGCCATTG TTGTGGATAC CAAGCAGACG GCCTTTGAGT GCATCAAGTA CTTGCGCGAA CAGCGCGTCG GTACGGCCAC CTTTCTGCCC TTGGATAGTC TACAAACGCC GTCACCCGAT AGTACGGAAC GATTGCGGGC GCACGTGGCC AAGGACGGCC GATACAGCCT GGTAGCCGAC GTAATTGCTT GTGACGACGC GGTCCACCGA GCCGTGCAAT ACGCCGTGGG GAATACGGTC GTGGCCGAAG ATCTGGATGC GGCCCGGGAA CTGTGCTTTG GATCGTCCTC GTCGCGGCGA GGCGGTCGGT CCGAGGGAAA CTCACCCCAA TCTCGCGTCA AGGCGGTCAC CTTGGGAGGG GCCGTCATTA GCAAAGCGGG TACCATGACG GGTGGAGTCA CACGAGACGA GGACTCCAAA TCGGGACGCT GGGATGCGCA GAATCTCCAT AAAATTCAGG AGCAGAAAGC TCAGTTGGAA GCGGAACGGG AGGCCTTGGA TACCGGCGGT GCCTCCAACA GACGCAGCGG AGTTGGGGCT GGTGGGTCTT TGGGACACGC GAGCAAGATC GAAGAGCTCC GGAACAAGGT TGGTAATCTC CGCAACAAAG ACCAGTATTC AAAAAGCGAT CTGGAGTTTA CCAAAAAGCA ACTGGAGGAA AAGACGGTGT TGCTAAAGTC GACCGAAAAG CAGCTCGCCA AGTTGGAAAA ACAAGTGGCG GCGGGCGAAA AGGAATTTTC CAAGGCCAAT ACTGCGGTCC AGAAGGGAAT TGCCGCAGTC AAGGCGGCGG AGGATGAGCT TCTTGGAGAT TTCCGTGACG AGACTGGTCT GCGAGATTTG AACGCTTACG AAGAGGCGAT CGGAAAAAGC CGGGATGAAT TCAACGAGAG GAAACGCACG TTTATGGAGC ACATTGCGCA GCTGGAGCAA CAAACAAAAT ACGAGTCGGG GCGTGATCTC CAACAGCCCA TTGTACGTAT CGAAAAGCGG ATCAAGGAAC GTAAGGCTGC TCTTGCCAAA GCGAAGAAGA AGGAATCTGA ACTGCGCAAG AAGGTCGATG AGGCCAAAGC GAATCTGGCC GAAGCGGAAA TCAAGGTGGA GGAAGCGATC GACAACGAGA AGAAATTCGA GGAGCAAGTG CAGGATGCTC AAAGTGCCTT GACGGAGGCC CAGAATGAGC GGATTCGTAT TGACAAAGCC ATTGGATCGG AAGAGACAGC GCTCGAACGC CTTCGTGCAA AGTTGCATGA TACCTTGCAA AAAGCGCATG TAGAGGAAGT CTTGCTGCCG CGAGTTGGAG ATGACAACGC ATCTCAGGTC CGCACTCGCT CTCAGCGCCA CAGCGCTGGT GGAAGCGGCG TGGACGAAGC GGGCGAGTCA GAAAGTCAGT CCCAATCACA GATGTCGTCC TCAGCGACTG GCGCAAGTAT TCCACTGACA CAGGAGAGCC GTTCGCGGAC ACACTTCTCC CAGGCAGACA ATACCGTTGT CGTCCTGGAC CTTGAGAAGG CTTCGAGCGT AGATTTCTCC CGAATGCCCA GTCCTCTGAA GCAGCGCATG AGTGACCGCG ATGAAAAGCG AATGCGCAAG GAGTTCGAGG ACAAGCTGGC CAAAATAGCT GCCAACATTG AGAGTATCAC TCCAAATATG AAGGTATGCT CCTACCCGTA AACGGATTTT ACTGTGGATG AGTCACTGGC TAACACTCTT TTTTTCCGTT TTAGGCCAGC GAAGCGTTCT CTACCATAAC CGATCGCCTC AAAGGAAGTA GCTCGGACTA CGAAAAGTCC AAAGAGAAGT CTGCGAAAGC GGCTCAGGCC TTTCAACGAG TCAAGGCAAA GCGCGCTAAA CTATTTAACG AGGCTTTCAA CCATATTGAT GAGGCTTTGA AGACAATCTA CACCGACATG ACCAAGAGTA GCAAGCATCC TTTAGGTGGG AATGCCTATC TTAGCCTCGA CGATGCCGAA GAGCCGTATA AAGGTGGAAT CAAGTTCAAT GCGATGCCAC CGATGAAGCG ATTTCGTGAC ATGGAACAGC TAAGTGGCGG CGAGAAGACG GTAGCCGCTC TGTCACTATT GTTTGCCATT CATTCGTTCC ACCCGGCGCC GTTTTTTATC ATGGACGAAA TTGATGCAGC GTTGGACAAC GTCAACCTTC GCAAGGTTTG CAACTATATC AAGCAACGCA GCCAGACAGA TTTTCAGTGC ATCGTAATCA GTCTCAAAGA CATGTTCTAC GAGCACAGCC AAGGATTAGT AGGTATCTAC CGCGATGTCG GTACGAACTC GAGCCATACT CTTACTTTAG ACTTGACGAA ATTT
|
Protein sequence | MPVTSLELEN FKSYAGLQTI GPFRDFTSVI GPNGAGKSNL MDAVSFVLGV QSRDLRSTVL ADLVFRPPTT IGTTVSTTST TPALRASATL VYADAVTGAE TRFGRTIGVR GVGEYHLDGK VVSWTDYEAA LADIGVLVKA RNFLVFQGDV EALARKSPAE LTALVEQIAG SAGLADDYRQ RHADKEQAQQ NTVFLLQQQK TLRAERKLLK EQKTEADRFH QLLTEKADVE TELYLWILYH LDRDRHERDA VLGELRDERD AHRATEQTHA ETLQQAKKQA SAARRETGQR QQRRVELAAL ADRLEPAVIQ TTEEIKSLAN KLAQDEKQVA KKQTEADTHR ERIDAIAKEI ADYRTQLTAL ERDYDEIKAN AAPVQLTPEQ ETRYEALRYQ AAAASAAPRH ALHAAQRRLE QARAHVATLQ HTLQEAQAAQ AETARDVQAL DTRREKLTKS LANTTQDLQA TEHELVQVQG QAQRVQVRRQ ELDVDIEKLD ASLREAKYDS TRSKDEECLV RAIASLQQHF TGVHGRLVDL CRPVSRKFNL AVTVAAGKDM DAIVVDTKQT AFECIKYLRE QRVGTATFLP LDSLQTPSPD STERLRAHVA KDGRYSLVAD VIACDDAVHR AVQYAVGNTV VAEDLDAARE LCFGSSSSRR GGRSEGNSPQ SRVKAVTLGG AVISKAGTMT GGVTRDEDSK SGRWDAQNLH KIQEQKAQLE AEREALDTGG ASNRRSGVGA GGSLGHASKI EELRNKVGNL RNKDQYSKSD LEFTKKQLEE KTVLLKSTEK QLAKLEKQVA AGEKEFSKAN TAVQKGIAAV KAAEDELLGD FRDETGLRDL NAYEEAIGKS RDEFNERKRT FMEHIAQLEQ QTKYESGRDL QQPIVRIEKR IKERKAALAK AKKKESELRK KVDEAKANLA EAEIKVEEAI DNEKKFEEQV QDAQSALTEA QNERIRIDKA IGSEETALER LRAKLHDTLQ KAHVEEVLLP RVGDDNASQA SSVDFSRMPS PLKQRMSDRD EKRMRKEFED KLAKIAANIE SITPNMKASE AFSTITDRLK GSSSDYEKSK EKSAKAAQAF QRVKAKRAKL FNEAFNHIDE ALKTIYTDMT KSSKHPLGGN AYLSLDDAEE PYKGGIKFNA MPPMKRFRDM EQLSGGEKTV AALSLLFAIH SFHPAPFFIM DEIDAALDNV NLRKVCNYIK QRSQTDFQCI VISLKDMFYE HSQGLVGIYR DVGTNSSHTL TLDLTKF
|
| |