Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_54192 |
Symbol | SMC5 |
ID | 7204040 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1008701 |
End bp | 1012215 |
Gene Length | 3515 bp |
Protein Length | 1099 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186447 |
Protein GI | 219113727 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACCTGCATCC AAAGTGCACC ATGACGAGTG ACTCGGACAG TGAAGAGGCC TACATCCGAC GTGTGGGTAC GCACAAGGCG GGTTCTATTA CAAGGATTAA GCTACACAAC TTTCTGACTT ATTCCGATGT CGAATTTCGT CCCGGGCCGC GGTACGTATC CGTATTGTCC ACGCCGCGCT CGAATGCATC CAGCCATTGC ACGGTTTTCT GAAAACGCGG AAAAGTCACG CGTTCTCAAC TCTGTTGCGT ATTGTTTTTG TGCAAAAGTC TCAATATGGT CATTGGTCCT AACGGGACGG GAAAGTCGTC GATTCTGAAC GCCATTTGTT TCGGACTGGG AGGGGAGCCC AAATTACTCG GTCGGGCAGA CGACGCCCGT GCATTCATCG CACACGGCAA GGACCACGCC GAAATTGAAA TCGAGTTGGC GCCGCTACCC GGCAAGGGTA CTCACGTGTT TCGACGTACC ATTGATCGTC ATAAGGGGTC AGAGAAAGGC AAAGGTCGGG GGGCGTCGCA GTACTTTGTC AATGACGAAA AGGTACACCC CAACGTGATT CGGGAAATTG TGTCGGAGGA CTATAATATT GCCATTGATA ACCTCTGTAC CTTTCTGCCA CAAGACAAGG TGGGGAGCTT CTCGGGCTTT GATTCGAAAC AGCTTTTGCA AGAGACGGAG AAAACGCTCT CCACGTCGCA GCACTTGTAC AGGCTACATA TGGATCTGAT CCAAGCCGAG GCTGAGCTAC AGTCGGGAGT CGTCAACGTC GATACAATAA AATCCAAATT AAAAAAACTA GAGCACGAGA ACAAACAGCT CGAGCAAGAA AAAATGCGTG TGGAAGAACG CGAAGAAGCT CTAGTACAAG CCGAAGTTTT GGAGAAGAAA AGAATTTGGC TGCAAGTCGA TGTGTTGCGC GAAGAAGCCG TCTCGTTAAA AGAAGCGAAG ACGGAAGTCA AGGACCGCCT CAAGGCCGCT CATGCAGAGC TCGCTCCTTT GCAAGAGGAG CAGCAACGTC TTGCAAAGGC TTGGAAGGAA GCCGATCTGC AGTTGAAGGT TCTCGAAATG AATAAGCAGA AATGTAACAA AGAAATGGAA AAACAGCTGA AGAAGTACGA GAATCATGAC GATGGTATCG AGGAGGCGCT CGCTATGCTA CGCGAGCTGG ACACCAAGCA TGAAGAAGTA CAAGCTCGTT ACCGCTCTCA GGAAGAACGC GTCGCGACTT TGGAAGAGCA GCTCTCTAGT TTTGCGACCA CTGAAGAAGA AATGACAGAC CAGTATAACG AAGCCAGGGA GGCGGCTCGT GTTGCGTCGC GCGCATACGA ATCAGCTAAA CGCGAGCTTG CCAGACATCT CGAAAAAGCT CACCTACTGA AAGAAAAGGG TAAAGAGGCG CAGATGAAGC TTGCGAAGAT GAACGACGAA GGTGCCCGGC GAAAGGAACG AATATTTCGA CAAGAAAGGA ATCTCGGGCA GATTTTCGAA TGGCTGGAAA GTAACCGGGA TAAGTTTCGT CGCCCTGTCT GGGGTCCGGT GGCCTGCGAA GTGGCAACCA AAGATCAGAA TACTGCGGCT GCTTTGGAGC AACACGTTCC AAATTGGGTA TTGAAATCCT TTGTCGTCGA AAACAAAGAA GACTATGATT TTTTGTTTAG CGAGATTCGT GAACGTCGTA AAATTCCGAT CAATATTGTG AATACGGATG GCCAGCGACT TTCGGATCCG CAACGGCCGT ACTCGGAAGA GAAAATGTCG ATCCTTCAAA AGGAGTACGC GATAGCAGGG TATCTTGATC ATTATTTTAC AGCTCCTGAC CAGATTATGC TAGTACTTCG CAAGCAAGCA GCGGTGCACA AAGTTCTAAT GGGAGGAGAA GAAACGAATC AAAAGCTTAC AAAACTGACG GACTTCATTT CGGAGCCGGA TATCTCCCTA GGTCAGACCG ACAAGCAGCC TTCGGTCCTA TTCTGTTCTG ATAATGGCAA AGCGCTTAAG TTCAGCAACG TTGTCTCAAG ATACAGTAAG GAAATCTCAT CGCGACAGGA TGATATCAGT CAAGCTCGCC TTCTGGCGCC GGGTGTCAAT CCTCGCGTAA AAAAAGAGGC CGAGGACAGA ATTGCAGAGG CCAATGCCGA AATGAATGAA CTGAGGCCAG CGATTGAAGA CTCCCAGAAA GAAAAAAATA AAACTGAACT TGCAGCCCAA GAAGTGAAGG CCAAATTACA GTCATCAAAG CAGTCTTTGG AAAGTCTGAA GAAGTTTCAG CAGAAACTGG AGAACGCTCG CAATAAACTT GATGATGCTC GGCGCGATTT GGAAAGCGAT GACGAAAAGG AAAAGAAGGC GCTGGTACAA TCTCTAATGA ATCGAGTTGC TCATGGGGTG TCGGCCCTGG AAGTCCATGC TCAACAACAT GAGCAAATGC TGCTAGCAAC AATGGAGAAT GCCGGTCTTC AAATTTCTCG CAACGATTTT TCAGTAGCCG AGCGTAGAGC GAAGTACGTA TGTGTCATTG CTAGCCGTCG ATTGTTGTGT TCGCGTCCCT AACTAATCTT TGATTCGCAT CTCAGAGAAC TTGTCCTCGA GAAGAACACT AGCTTTAAGG GACTTGAGAC TCGAGCGGTG AAGATTCAGA CAGACTTCAT GAATGTGAAA AAAGAATACG CGAAGCTGAA GACGGAAGCT GAGCGAGTTG CGCCTTTGGA AGACGAAAAC GGCAACAAAA CAGAGCTATT TGATCAGCTC CAGGAACTCG AGGTTACGAC TTTGCATGAC TGCGAAGCCG CCCTCGACGA AGCAGTCAGC AAAGCAGATG AATACGCGGA CAACCCGGAC GCCCTACGGC AATATGAGCG AACAAAGGCA GAGATCGAAG AAGTTCAGAC GAAGCTGGAC GATTTGACGA GCTCGAAAGA TGCCAAGCTG CAGGAAATCC GAAATAAGAG CAATCCGTGG CAGGCTGCGT TGGAGAACTA TGTAAGCAAG GTGGACAAAC TCTTCAGCGA GTACATGCAG GAAATGGAAT GCACCGGCGA AATTCGTCTA AAAAGGGGTA AAATCGATGA AGATGATGAA AATCAGATAG GCAACTTCAA GGACTGGGGT ATCGAAATTC TGGTGAGTTT TCGGGAAGGT ACGAAGGCGC AAATTTTATC AGCACAGGTC CAGTCTGGAG GTGAGCGTTC GGTCAGTACA ATCATGTATT TAATGGCTCT ACAAGACATG ATGGTAGCCC CATTCCGATG CGTTGACGAA ATCAACCAAG GCCTTGACGA TCGGAACGAG CGGCTTGTCT TTCGTCGCAT TGTTGAGAAC TCAACCCGTC CGCCGAAGGG CGAGCCGTTC GAGCACGTTG GACAATATTT TCTCATCACG CCGAAACTTT TGCCGAATCT TGTGGATATG GAAGAGGAAG GCGTCACGAT TCTTTTTGTT TTTAATGGTG AGGGAATGCA CCAGAGTATA TTTTTTTATG AATTGTCTCA TCTCACACTT TGTCCTCTTG TTTAG
|
Protein sequence | MTSDSDSEEA YIRRVGTHKA GSITRIKLHN FLTYSDVEFR PGPRLNMVIG PNGTGKSSIL NAICFGLGGE PKLLGRADDA RAFIAHGKDH AEIEIELAPL PGKGTHVFRR TIDRHKGSEK GKGRGASQYF VNDEKVHPNV IREIVSEDYN IAIDNLCTFL PQDKVGSFSG FDSKQLLQET EKTLSTSQHL YRLHMDLIQA EAELQSGVVN VDTIKSKLKK LEHENKQLEQ EKMRVEEREE ALVQAEVLEK KRIWLQVDVL REEAVSLKEA KTEVKDRLKA AHAELAPLQE EQQRLAKAWK EADLQLKVLE MNKQKCNKEM EKQLKKYENH DDGIEEALAM LRELDTKHEE VQARYRSQEE RVATLEEQLS SFATTEEEMT DQYNEAREAA RVASRAYESA KRELARHLEK AHLLKEKGKE AQMKLAKMND EGARRKERIF RQERNLGQIF EWLESNRDKF RRPVWGPVAC EVATKDQNTA AALEQHVPNW VLKSFVVENK EDYDFLFSEI RERRKIPINI VNTDGQRLSD PQRPYSEEKM SILQKEYAIA GYLDHYFTAP DQIMLVLRKQ AAVHKVLMGG EETNQKLTKL TDFISEPDIS LGQTDKQPSV LFCSDNGKAL KFSNVVSRYS KEISSRQDDI SQARLLAPGV NPRVKKEAED RIAEANAEMN ELRPAIEDSQ KEKNKTELAA QEVKAKLQSS KQSLESLKKF QQKLENARNK LDDARRDLES DDEKEKKALV QSLMNRVAHG VSALEVHAQQ HEQMLLATME NAGLQISRND FSVAERRAKY VCKNTSFKGL ETRAVKIQTD FMNVKKEYAK LKTEAERVAP LEDENGNKTE LFDQLQELEV TTLHDCEAAL DEAVSKADEY ADNPDALRQY ERTKAEIEEV QTKLDDLTSS KDAKLQEIRN KSNPWQAALE NYVSKVDKLF SEYMQEMECT GEIRLKRGKI DEDDENQIGN FKDWGIEILV SFREGTKAQI LSAQVQSGGE RSVSTIMYLM ALQDMMVAPF RCVDEINQGL DDRNERLVFR RIVENSTRPP KGEPFEHVGQ YFLITPKLLP NLVDMEEEGV TILFVFNGEG MHQSIFFYEL SHLTLCPLV
|
| |