Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_52607 |
Symbol | SMC3 |
ID | 7198409 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 88863 |
End bp | 92808 |
Gene Length | 3946 bp |
Protein Length | 1232 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184568 |
Protein GI | 219128749 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATTA AACAAATCAC CATTAGCAAT TTCCGGTCGT TTCGACAACA GCCGGAAATC GAAGCCTTTT CAACGCACAC CAATTGCGTG GTGGGACGTA ACGGGTCCGG CAAGTCCAAT CTCCTGGATG CCGTGCAGTT CGTCTTGTTG GCTCCGCGAT TCGCTAATCT CCGTCAGGTA CGTTGCGTTG TTTTTGTGGG AGAACTCTTT ACGATGCTCA TTCCAACGAT TTCCTACCTA CCTTGGCCTT TATCCCTCCT TACCTGCCTT TGTTGCTTCA CACTCTCACA CACACTCTCA CGCACACACT CGTTCACAAC GACAGGAAGA ACGGCAGGCC TTGCTGCACG AAGGATCCGG CAGTGCGGCC GTCAACGCGT TCGTCGAGAT TGTCTTTGAC AATGCCGATC ACCGGTTCGC CTTGGAGCAT TCGGACGAAG TCGTCCTGCG ACGGACCGTG GGACTCAAGA AGGACGAGTT CTTTCTCCAG CGAAAGCGGG TGACCAAACA AGAAGTACAG AGTTTGTTGG AAGGCGCCGG CTTCTCCCGC TCCAACCCCT ATTACATCGT GCAGCAGGGG AAAGTACAAG ATCTGTGTAC CATGAGCGAC GTGGAGCGCC TCCGACTCTT GAAGGAAGTC GCCGGGACCG TCGTCTACGA TCAGAAAAAA ACGGAAAGCC TCGCCAAAAT GCAGGAAAAC GGACACAGCA TTACCAAAAT ACAGGAAATA CTCGACGACA TTGAAGCCAA GCTCGACGAA TTGCAGGCGG AAAAGGAAGA ACTCACCGCC TACCAAGCCT TGGACCGGCA ACGCCGGGCC CTTCAGTACG TCCTCTACGA CAAACAACTC CGGAAAGCCC GGCACGATCT CGATCAGTTG GAACACGCAC GGACATCCCA CGTACAAAAC CTCAGCGCAC TACACGAAGA ATTGAAAGAC ACGCACGAGG CCATTCGCAA CAAGGACGCC GTGCTCAAAA CAAAAACTAA CGCGCTCCGG CGCAACCAGA CCACCCTCGA AGCCCACGAG CAGGACAAGA CCAGCCGGGT TACGGCCGTC ACGAGACTCC AACTCGAATG TCAGGAGCTT CTGGAAGCTG TGCGGAGTGG CGCGGAACAG CTACAGTCCA ACGAAAAGGA GCTCGAGGCC GTCCAGGCCG AAATTGAGGC CTCCCAAACG AACCTGCGGG ATACTGTCCA GCCCGCCTAC GATCAAGCCG TGGCTGTCTT GCAAGACTTG ACCACTCGTC GGGACGAAGC ATCCCGTCAG GTCGAATCAC TCTACGCCAA ACAAGGTCGG GGACAGCACT TTCAAACAGT ACAAGATCGA GACGCCTTTT TGCAAAGCTC CGTAGAGGAA CTCTTGGCGA CACAGCGCGA TAAAACCAGT GCCGTACAGG CGCAACAAGA CACACTGGCC AACTTGCGCC GTTCCGTTAC GCAAGAAACG ACGGAGATTG ACAAGCTTAC CTCGCAACTC ACCAGCCAAG CTGCCGGGTT GCAGTCTTTA TCCAAAACCA TTGAAGAAAC CAAGCGCCAA CGCTTGGAAC TACACGATGC GCGCAAGGAA GCTTGGCGGG AGGCCGAAGC CCTTCACGAT CAAGTTCGGG AAGCCAGGGG AACTTTTCAA CGGGCCAAAC AAGACACCGC CAAGGTCATG CCTCGCGCAA CCGCAATGGG ACTCAAGGCG CTAACGAGTG TGGTCGAACA GGAAGGACTC ACCTCGGATC AGTATTTCGG CATGCTCATG GACAATTTCG TTCTGCGAGA CGACAAGTAT CAAACGGCCG TCGAAGTAGC GGCGCAGAAT GCGCTCTTTC ACGTCGTTGT GGACACCGAC GTTACGGCGT CCCGGCTCAT GAAGCGCTTG GAGGCCGACA AGCTCGGTCG CGTAACCTTT CTACCGCTCA ACCAGCTCCG TATCGATCAG CCCAATTTGC CCCAATCTAA CGATATTCGT CCCATGCTCG ATTTATGTCT CCAGTACGAC CGAAAGGTCG AACGTGCCTT GCAACACGTC TTTGGCAAAA AACTCCTGGC TCGAACACCA GAAATAGCAT CGGAATGGAG TGCTCAGCAC GGCGTCGACG CAATCACCTT GGACGGCGAC TTGTGCAGTC GCAAGGGCGC CTTGACGGGC GGTTACGTGG ACACGTCCAA GTCTCGGCTC CGTGCGCACG CCAAACAAAC CGAAGCCCAA GCGGCCTTGC AAAACGTGGA GACGCTCCAT CAAGGCAAAA GTCGTGAAGC CGAGCAAGTG GAGCAGCAAG TAACGAATCT TATGCAGGAG CTGCAGCGAC AGGAAAGCAA ACAGGCGGAG CTCAGTCGGA TGGTCCAAGG TAAGGAAATG GAACTTGATC GAATACAGGC TCGATTAGAA AATCACAAAA AGCAGGTGGA AATGGTCGAA AAGACGCTAA TTCCACCTTT GGAACGGGAC ATTGTTGCCT TGAACGGGGA CATGGATCGT CTCAAGGCCG AAATGGGGAC CCCCTTGACG CAGACGCTCT CCGATGAGGA CCGCAAGCTG TTGGCCACGC TCAAAGAAAC GCAAACACGG CTTGTTGCCG AGATTGAATC CCAGAGTGAT AAGGTTGCTC AAGTTGGTTT GGACCGCCAA AAGCTCCAGA GTCTACTCGA CGATAATCTG CTGAAACGTC AACGCGAGCT AAGGGAAGGT GGCGTCGACG GGCGTCGGCG TCAGAGCCAT GGTCGTCTTT CGTCAGCCGC GGTGCAAGCG GAGCAGCAAG AAGAATTGGC CGAATGCCAG CGCCGATTGG ACGATGCCCT GCGCGTCAAA GACGAAATTG AGGGGCGTTT GGAAGAGTCA CGTCGGGTGG ACGAGGAGTT GCGGGGCGAA ATTATTGTGG TGAAAAATGA GCTGGAACAG CTGAAGAGCG AGTATCTGAA TGTTTCTAAG CGCCTCGAAG AAGCGCAGAA CGAAACTGAG CGGCTCATGA ACAAGGTACG AACGTGTTGT GGTGTATTGT GCGGTTGTGT CTATCGTCTC TGCGACAAGG CCGTATACTG ACTTAGATTC GGCATTTCTT TATTTGACTA CTGGTAGCGC TCCATGTGTA TTTCTACTCG CGAAGAAAAG ATGCGCTCCA TCCGGGAATT GGGATCCCTT CCCCCTCCGG CCGAGCTCGA CAAGCACTCG GGCAAGTCCA CTGAGGCACT CAAAAATTCC ATCGAGGGCG TCAATAAGAA GTTGAAAAAG TATTCGCACA TCAACAACAA GGCGTTCGAC CAATACATCA ATTTTAGTGA GCAGCGCGAA TCGCTGCTGG TGCGCAAGGC TGAACTGGAC CAAGGTGCCG AAAAGGTGGA GGAGCTCGTA TCGAGTCTCG ACCAACAGAA GGACGAGGCC ATCAATCGTA CCTTTCGCGG GGTCAGCGCC CATTTCAAGG ACGTGTTTGA AGAGCTCGTT CCCAATGGGG CGGGTGAGCT CATTTTACGC ACGGCCATGG ACGAAGCGAT GGAGGACGAT GCGAACGATA CGGATCAGGA CGACGATTCC GTCAACGCGG ATTCTCCGAA AAAGGCCAAG GGTTTCGACG CGAACAATCC GGATGTCAAT TTGTACCGCG GTATTGGCAT CCAAGTACGT TTTTCGGCCG TGGGCGAAAA CTATCTCATG TCCCAGCTGT CGGGTGGTCA AAAGGCCTTG GTCTCGCTGG CCCTGATTTT TGCTATTCAA CGGTGCGATC CGGCTCCCTT TTATATATTG GACGAGCTGG ACCAGGCGTT GGATGCCTCG TACCGTGCGG CGGTGGCCAA TCTGATTCAA AAGCAGGCCA CATCCACGGA GAATCCAACA CAGTTCATTG TGAGTACCTT CCGTCCGGAA CTGGTAGCGA TTGCGAATCG TTGTTACGGA ATTTCATTGC AGAACAAGGT GAGTCGCATC CACCCGTTGA GTAAGAAGGA TGCGTTACAC TTTATC
|
Protein sequence | MHIKQITISN FRSFRQQPEI EAFSTHTNCV VGRNGSGKSN LLDAVQFVLL APRFANLRQE ERQALLHEGS GSAAVNAFVE IVFDNADHRF ALEHSDEVVL RRTVGLKKDE FFLQRKRVTK QEVQSLLEGA GFSRSNPYYI VQQGKVQDLC TMSDVERLRL LKEVAGTVVY DQKKTESLAK MQENGHSITK IQEILDDIEA KLDELQAEKE ELTAYQALDR QRRALQYVLY DKQLRKARHD LDQLEHARTS HVQNLSALHE ELKDTHEAIR NKDAVLKTKT NALRRNQTTL EAHEQDKTSR VTAVTRLQLE CQELLEAVRS GAEQLQSNEK ELEAVQAEIE ASQTNLRDTV QPAYDQAVAV LQDLTTRRDE ASRQVESLYA KQGRGQHFQT VQDRDAFLQS SVEELLATQR DKTSAVQAQQ DTLANLRRSV TQETTEIDKL TSQLTSQAAG LQSLSKTIEE TKRQRLELHD ARKEAWREAE ALHDQVREAR GTFQRAKQDT AKVMPRATAM GLKALTSVVE QEGLTSDQYF GMLMDNFVLR DDKYQTAVEV AAQNALFHVV VDTDVTASRL MKRLEADKLG RVTFLPLNQL RIDQPNLPQS NDIRPMLDLC LQYDRKVERA LQHVFGKKLL ARTPEIASEW SAQHGVDAIT LDGDLCSRKG ALTGGYVDTS KSRLRAHAKQ TEAQAALQNV ETLHQGKSRE AEQVEQQVTN LMQELQRQES KQAELSRMVQ GKEMELDRIQ ARLENHKKQV EMVEKTLIPP LERDIVALNG DMDRLKAEMG TPLTQTLSDE DRKLLATLKE TQTRLVAEIE SQSDKVAQVG LDRQKLQSLL DDNLLKRQRE LREGGVDGRR RQSHGRLSSA AVQAEQQEEL AECQRRLDDA LRVKDEIEGR LEESRRVDEE LRGEIIVVKN ELEQLKSEYL NVSKRLEEAQ NETERLMNKR SMCISTREEK MRSIRELGSL PPPAELDKHS GKSTEALKNS IEGVNKKLKK YSHINNKAFD QYINFSEQRE SLLVRKAELD QGAEKVEELV SSLDQQKDEA INRTFRGVSA HFKDVFEELV PNGAGELILR TAMDEAMEDD ANDTDQDDDS VNADSPKKAK GFDANNPDVN LYRGIGIQVR FSAVGENYLM SQLSGGQKAL VSLALIFAIQ RCDPAPFYIL DELDQALDAS YRAAVANLIQ KQATSTENPT QFIVSTFRPE LVAIANRCYG ISLQNKVSRI HPLSKKDALH FI
|
| |