Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30352 |
Symbol | SMC2 |
ID | 7195804 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 252225 |
End bp | 256294 |
Gene Length | 4070 bp |
Protein Length | 1213 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184096 |
Protein GI | 219127758 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGATACGAAG TGTCATCGGT TTATTTGCCT TTCTTTTCGC AAGAGAGAGC GTACGCCAGA AAAGGGGATC TCTGAAGCGC TGTTGATAGT ATTATCGTCG TGTGAATCGA TAAAATTCTG AGGCAGTCAG CAGCACCGGA AACGCTCCAT TTGGTTAACC ATGTTCATTC AAGAAATTGT CATTGATGGT TTCAAGTCCT ACGCTCGTCG AACAGTCGTG GAGGGGTAAG ATAAAGCGAG TGACCTTTAT GAAGGTCGGA TTTTGTTTCG TGCTAAATCT CACGTCTTTG CACGATGTAC TATAGTTTCG ATCCTCACTT CAACGCCATT ACCGGCTTGA ACGGGTCTGG TAAATCCAAC ATTCTCGACG CCATTTGCTT CGTGCTCGGT ATAACGAACC TGTCCCAGGT ACGCGCCGGA AATCTTTCCG AGCTCGTCTA CAAACAAGGA CAGGCTGGAG TAAACAAAGC CACCGTTACG ATCATTTTCA ACAACGAGGA CGAATCTTCC AGTCCAGTAG GTTACGAGCA ATGCCCTCAG GTTACCGTCA CACGCCAGGT TTTGATCGGC GGCAAGAGCA AATATCTCAT TAATGGACGC AATGCCCCCG CGAATCAGGT ACAGAATCTT TTCCATTCGG TACAGCTCAA CGTGAACAAT CCCCACTTCC TCATTATGCA AGGACGCATC ACCAAGGTCT TGAATATGAA GCCGCACGAA ATTCTCGGTA TGGTCGAAGA AGCTGCCGGT ACCCGCATGT ATGAAACGAA ACGCGTCGGT GCCTTGAAAA CGATTGAGAA AAAGCAGCTC AAGCTGGATG AACTCAACGC AGTTCTAGCC GAAGAAATTA CACCCACTTT GGAACGATTG AGGGGCGAAA AACAATCCTA CCTTAAATGG AGCAAGAACA ACGCTGACAT GGAGCGTATT GAACGTTTCG TCATTGCCAA TGAATTCATG CAGGCACAGA AGGCTTTGGA TAATAACACA GAAGGCTCCG CCGAAATGGA AGAGCAGGTT GCCATTCTAG ACGACAAGAC TTCGCAAATT CGAGAACTCA TTGTTGCCAA GGAACGCGAG ATTGAAGAGC GCTCGTCCTC CCTCAAAGGA GAGTTTGAGA ATTCACATAA CGAAGCAAAA GTTTTGGAAG AGCAGCGCTC CAAAGATCTC GTCAAAATAA CTTCCTCCTG GAAAAATGCA AAGACCAATG TTACCAAGGC AGAGAGCGAC CTGGACGCGG CGCGAAGTCT CGTCACTGAA ACGAAACAAG CGGTAGTTGC CAAGGAAAGC GACATCGCTA CTGAATCGCA GAGCATTGAA CACAAGATTC TGGCTGCCAA AGAGGCTGAG GAACGACTTG CACGGCTAAC TCTGGACTAC CAGAACATGT CGGCCGGTAT CAGTTCCACA GAAGGAGACG AAGGCCGTAC ATTACCGGAA CAGATAAGCA AGGCGCACAG CGATTCAAAG TCAGCCGAGG CAAAGGTGCA GCAAGCCAGC ATGAAGATGA AGCACTTGTC AAAAGAGTTG AAGGTATGTG TACTTGTCGA TATTTTTGGT TTAGTTTTTT ACCCTTTCGA CTCAAGAGCC AAATTGCTCT GACTGTTTCC AGCTGGTCGA GAAAGACCTC CAAAAGGAAG GGAAAACTGC TGAAAAGATG GCCCAGAAGC GTGCAGTAGC AGCTCACAAA GTAGAGGATT GCCGTGGCAA GCTTAAAGAT ATGGGCTTTT CGCCGGAAGA GTTCAATGCT CTGGATCAAG AAAAGACAGA CCTGGAAATT ACCGTCTCGG AGTTGTCGGA GCGCGTTGAC ACGCTTTCCG CACAGCTTGA AGGGAGGCTC CGTTTCAAAT ATTCCGATCC CGTGCGTGGG TTTGATCGTA GCAAGGTCAA GGGGCTTGTG GCAAAGCTCA TCGAAGTGAA GGATCACAAG AATGCTACTG CTTTGGAGGT TGTTGCCGGT GGAAAGCTGT ATCAAGTCGT GGTCGACGAA GCAATTACTG GTAAAGCGCT TTTGGACCGC GGCAAGTTAG AGCGACGTGT GACCATCATC CCACTGGACA AGATCAAGCC GCGTAATGTT AGTCACACTG CTTCGGAACT AGCCAATGAT ATTTCCCAGT CGCTCGATTC GAGGGCTTCT CCGGCAATCG AATTAGTTGG TTTCGATGAA GAGGTTCGTA GTGCCGTTGA GTACGTCTTT GGCTCAACTA TTGTGGTCGA CGGCATGAAA GCTGCAAACG CTATCTGCGA TGCAACAAAA ACACGGACCG TTACCTTGGA AGGCGACGTT TACGATCCGT CGGGGACTAT ATCTGGTGGC TCCAACAACC AATTGGGGAC AACTCTAGTC AAGCTCACTG AACTAACTCA AGTGACAAGT AAGCTCGACG AAAAGCGCTC GCTCCTTGCT TCTATATCGA TGAAAGTGAA GTCTATGGCT ACGCATGCTT CTTCCTACGA CAAGCTCAGC GCAACTTTGG AGCTAGCAGA GGCGGAACTG AGCAATATCG ATAAGCATCT GTCACAGACT AGCTTTGGTA TGCTGGTTGA GCAGCGTGAT TCTATGGCTG CCGAACTGGA AGCGGCCCAG AACGAGTCGA TTGAAATGGA AGAGGAGAAA GAAAAAAAGT GGACACTCTT TGTCAATCTC CAGGCACAAG AAGCTGAATT GACCGAGCGT CGAGAACAAC GCTTAGCTGA GATTGATCAA GCGGTCAAAG ATGCAAAAGC TGACACTGTT GAGAAAGGGC GCATCGCTCG ACAGGCGGAC TCAAAATCTC AAACATTTTC TTTGGAACTC GATAGTCTCC AAGCTGAGGT CGCAGCAGCA GAGGAAGCCG TTTCAGTAGC GGAGCAACTA CTTGATGAGG CCACGGGTGA CGAATCGAAG GTACAAATGA AAGTTGGAGA AGTTCGCGCG CTGTACGAAG AAGCGAAGAA AGAGTTGGAT GAACTTGACG GCCGCCTAAA TTTATACTCT GCCAAGCTTG TGGAGCTCAA ACGCGCCAAG AGCTATCTCG TCAAAGAAGC CGAAGTGGCA ACCTTGGAAG CCAAGAAATT GTCCGTGACT ATCACTCGGA TTCACAAGGA ACGAAGTGGG GCGGAAAAGC TTGTTGCCAC ATTGATGAAA AAGTATGCTT GGATCGACAG CGAAAAGAGC GCTTTCGGGG TGCCCGGGGG AGACTACGAC TTCGAAGAAA CAAACCCGCG CCATGTTGGG CAACAGCTAC AGTCTCTCAA AGCCGAACAG GAATCCTTGG TAAGCACTTC ATGCTGTTTA AAACCTTTTC AAAGGTACAC TCACATTTGC ATCTCCACAT AGTCCAAGAA AATCAATAAG AAAGTTATGG GAATGATTGA GAAAGCAGAA GGGGAATACA CTGAGCTTTT GCGAAAGCGG AAGGTGGTCG AGAACGACAA GAAGAAGATA CAGGCCGTCA TTGAGGAATT GGACGTTAAA AAAAAATCGG AACTCGAGCG TACTTGGGTC AAGGTCAATC GGGATTTTGG ATCCATATTT TCGACACTGT TGCCCGGCGC TTTTGCGAAA CTTGAACCTC CGGATGGCAT GAAAGCCTGG GAGGGTCTCG AAGTGAAGGT GGCTTTCGGT GACGTCTGGA AAGACAGTCT GAGCGAACTC AGTGGTGGAC AGCGGTCTCT ATTGGCTTTG TCCCTAATTC TGTCATTGCT ACTTTTCAAG CCTGCCCCAA TGTACATTCT TGATGAAGTC GATGCCGCTC TGGATTTGAG TCATACCCAG AACATCGGAA ATATGTTGAA AACCCACTTT TCGCAGAGTC AGTTCGTTGT CGTGTCGCTG AAAGAAGGCA TGTTCAACAA TGCCAATGTC ATTTTCAGAA CGAAGTTTGT GGACGGGATT TCTACGGTTA CTAGAACAAT TGGAATTGGG TCCAGCCGCA ATCGTGCCTT AGCCGAATCT GACAACGCCG ACTCCACCAA TACTTCTGAA AAAGGACGAA CAGAGCAGTC AAGGAGAATT GGCAAAGAAA ATACTGTAGT TTAAGGGTTT TGTCGACAGC TCACAGTCCA GCATAATTGT AGTGCTGTTT
|
Protein sequence | MFIQEIVIDG FKSYARRTVV EGFDPHFNAI TGLNGSGKSN ILDAICFVLG ITNLSQVRAG NLSELVYKQG QAGVNKATVT IIFNNEDESS SPVGYEQCPQ VTVTRQVLIG GKSKYLINGR NAPANQVQNL FHSVQLNVNN PHFLIMQGRI TKVLNMKPHE ILGMVEEAAG TRMYETKRVG ALKTIEKKQL KLDELNAVLA EEITPTLERL RGEKQSYLKW SKNNADMERI ERFVIANEFM QAQKALDNNT EGSAEMEEQV AILDDKTSQI RELIVAKERE IEERSSSLKG EFENSHNEAK VLEEQRSKDL VKITSSWKNA KTNVTKAESD LDAARSLVTE TKQAVVAKES DIATESQSIE HKILAAKEAE ERLARLTLDY QNMSAGISST EGDEGRTLPE QISKAHSDSK SAEAKVQQAS MKMKHLSKEL KLVEKDLQKE GKTAEKMAQK RAVAAHKVED CRGKLKDMGF SPEEFNALDQ EKTDLEITVS ELSERVDTLS AQLEGRLRFK YSDPVRGFDR SKVKGLVAKL IEVKDHKNAT ALEVVAGGKL YQVVVDEAIT GKALLDRGKL ERRVTIIPLD KIKPRNVSHT ASELANDISQ SLDSRASPAI ELVGFDEEVR SAVEYVFGST IVVDGMKAAN AICDATKTRT VTLEGDVYDP SGTISGGSNN QLGTTLVKLT ELTQVTSKLD EKRSLLASIS MKVKSMATHA SSYDKLSATL ELAEAELSNI DKHLSQTSFG MLVEQRDSMA AELEAAQNES IEMEEEKEKK WTLFVNLQAQ EAELTERREQ RLAEIDQAVK DAKADTVEKG RIARQADSKS QTFSLELDSL QAEVAAAEEA VSVAEQLLDE ATGDESKVQM KVGEVRALYE EAKKELDELD GRLNLYSAKL VELKRAKSYL VKEAEVATLE AKKLSVTITR IHKERSGAEK LVATLMKKYA WIDSEKSAFG VPGGDYDFEE TNPRHVGQQL QSLKAEQESL SKKINKKVMG MIEKAEGEYT ELLRKRKVVE NDKKKIQAVI EELDVKKKSE LERTWVKVNR DFGSIFSTLL PGAFAKLEPP DGMKAWEGLE VKVAFGDVWK DSLSELSGGQ RSLLALSLIL SLLLFKPAPM YILDEVDAAL DLSHTQNIGN MLKTHFSQSQ FVVVSLKEGM FNNANVIFRT KFVDGISTVT RTIGIGSSRN RALAESDNAD STNTSEKGRT EQSRRIGKEN TVV
|
| |