Gene PHATRDRAFT_52607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_52607 
SymbolSMC3 
ID7198409 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp88863 
End bp92808 
Gene Length3946 bp 
Protein Length1232 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184568 
Protein GI219128749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATTA AACAAATCAC CATTAGCAAT TTCCGGTCGT TTCGACAACA GCCGGAAATC 
GAAGCCTTTT CAACGCACAC CAATTGCGTG GTGGGACGTA ACGGGTCCGG CAAGTCCAAT
CTCCTGGATG CCGTGCAGTT CGTCTTGTTG GCTCCGCGAT TCGCTAATCT CCGTCAGGTA
CGTTGCGTTG TTTTTGTGGG AGAACTCTTT ACGATGCTCA TTCCAACGAT TTCCTACCTA
CCTTGGCCTT TATCCCTCCT TACCTGCCTT TGTTGCTTCA CACTCTCACA CACACTCTCA
CGCACACACT CGTTCACAAC GACAGGAAGA ACGGCAGGCC TTGCTGCACG AAGGATCCGG
CAGTGCGGCC GTCAACGCGT TCGTCGAGAT TGTCTTTGAC AATGCCGATC ACCGGTTCGC
CTTGGAGCAT TCGGACGAAG TCGTCCTGCG ACGGACCGTG GGACTCAAGA AGGACGAGTT
CTTTCTCCAG CGAAAGCGGG TGACCAAACA AGAAGTACAG AGTTTGTTGG AAGGCGCCGG
CTTCTCCCGC TCCAACCCCT ATTACATCGT GCAGCAGGGG AAAGTACAAG ATCTGTGTAC
CATGAGCGAC GTGGAGCGCC TCCGACTCTT GAAGGAAGTC GCCGGGACCG TCGTCTACGA
TCAGAAAAAA ACGGAAAGCC TCGCCAAAAT GCAGGAAAAC GGACACAGCA TTACCAAAAT
ACAGGAAATA CTCGACGACA TTGAAGCCAA GCTCGACGAA TTGCAGGCGG AAAAGGAAGA
ACTCACCGCC TACCAAGCCT TGGACCGGCA ACGCCGGGCC CTTCAGTACG TCCTCTACGA
CAAACAACTC CGGAAAGCCC GGCACGATCT CGATCAGTTG GAACACGCAC GGACATCCCA
CGTACAAAAC CTCAGCGCAC TACACGAAGA ATTGAAAGAC ACGCACGAGG CCATTCGCAA
CAAGGACGCC GTGCTCAAAA CAAAAACTAA CGCGCTCCGG CGCAACCAGA CCACCCTCGA
AGCCCACGAG CAGGACAAGA CCAGCCGGGT TACGGCCGTC ACGAGACTCC AACTCGAATG
TCAGGAGCTT CTGGAAGCTG TGCGGAGTGG CGCGGAACAG CTACAGTCCA ACGAAAAGGA
GCTCGAGGCC GTCCAGGCCG AAATTGAGGC CTCCCAAACG AACCTGCGGG ATACTGTCCA
GCCCGCCTAC GATCAAGCCG TGGCTGTCTT GCAAGACTTG ACCACTCGTC GGGACGAAGC
ATCCCGTCAG GTCGAATCAC TCTACGCCAA ACAAGGTCGG GGACAGCACT TTCAAACAGT
ACAAGATCGA GACGCCTTTT TGCAAAGCTC CGTAGAGGAA CTCTTGGCGA CACAGCGCGA
TAAAACCAGT GCCGTACAGG CGCAACAAGA CACACTGGCC AACTTGCGCC GTTCCGTTAC
GCAAGAAACG ACGGAGATTG ACAAGCTTAC CTCGCAACTC ACCAGCCAAG CTGCCGGGTT
GCAGTCTTTA TCCAAAACCA TTGAAGAAAC CAAGCGCCAA CGCTTGGAAC TACACGATGC
GCGCAAGGAA GCTTGGCGGG AGGCCGAAGC CCTTCACGAT CAAGTTCGGG AAGCCAGGGG
AACTTTTCAA CGGGCCAAAC AAGACACCGC CAAGGTCATG CCTCGCGCAA CCGCAATGGG
ACTCAAGGCG CTAACGAGTG TGGTCGAACA GGAAGGACTC ACCTCGGATC AGTATTTCGG
CATGCTCATG GACAATTTCG TTCTGCGAGA CGACAAGTAT CAAACGGCCG TCGAAGTAGC
GGCGCAGAAT GCGCTCTTTC ACGTCGTTGT GGACACCGAC GTTACGGCGT CCCGGCTCAT
GAAGCGCTTG GAGGCCGACA AGCTCGGTCG CGTAACCTTT CTACCGCTCA ACCAGCTCCG
TATCGATCAG CCCAATTTGC CCCAATCTAA CGATATTCGT CCCATGCTCG ATTTATGTCT
CCAGTACGAC CGAAAGGTCG AACGTGCCTT GCAACACGTC TTTGGCAAAA AACTCCTGGC
TCGAACACCA GAAATAGCAT CGGAATGGAG TGCTCAGCAC GGCGTCGACG CAATCACCTT
GGACGGCGAC TTGTGCAGTC GCAAGGGCGC CTTGACGGGC GGTTACGTGG ACACGTCCAA
GTCTCGGCTC CGTGCGCACG CCAAACAAAC CGAAGCCCAA GCGGCCTTGC AAAACGTGGA
GACGCTCCAT CAAGGCAAAA GTCGTGAAGC CGAGCAAGTG GAGCAGCAAG TAACGAATCT
TATGCAGGAG CTGCAGCGAC AGGAAAGCAA ACAGGCGGAG CTCAGTCGGA TGGTCCAAGG
TAAGGAAATG GAACTTGATC GAATACAGGC TCGATTAGAA AATCACAAAA AGCAGGTGGA
AATGGTCGAA AAGACGCTAA TTCCACCTTT GGAACGGGAC ATTGTTGCCT TGAACGGGGA
CATGGATCGT CTCAAGGCCG AAATGGGGAC CCCCTTGACG CAGACGCTCT CCGATGAGGA
CCGCAAGCTG TTGGCCACGC TCAAAGAAAC GCAAACACGG CTTGTTGCCG AGATTGAATC
CCAGAGTGAT AAGGTTGCTC AAGTTGGTTT GGACCGCCAA AAGCTCCAGA GTCTACTCGA
CGATAATCTG CTGAAACGTC AACGCGAGCT AAGGGAAGGT GGCGTCGACG GGCGTCGGCG
TCAGAGCCAT GGTCGTCTTT CGTCAGCCGC GGTGCAAGCG GAGCAGCAAG AAGAATTGGC
CGAATGCCAG CGCCGATTGG ACGATGCCCT GCGCGTCAAA GACGAAATTG AGGGGCGTTT
GGAAGAGTCA CGTCGGGTGG ACGAGGAGTT GCGGGGCGAA ATTATTGTGG TGAAAAATGA
GCTGGAACAG CTGAAGAGCG AGTATCTGAA TGTTTCTAAG CGCCTCGAAG AAGCGCAGAA
CGAAACTGAG CGGCTCATGA ACAAGGTACG AACGTGTTGT GGTGTATTGT GCGGTTGTGT
CTATCGTCTC TGCGACAAGG CCGTATACTG ACTTAGATTC GGCATTTCTT TATTTGACTA
CTGGTAGCGC TCCATGTGTA TTTCTACTCG CGAAGAAAAG ATGCGCTCCA TCCGGGAATT
GGGATCCCTT CCCCCTCCGG CCGAGCTCGA CAAGCACTCG GGCAAGTCCA CTGAGGCACT
CAAAAATTCC ATCGAGGGCG TCAATAAGAA GTTGAAAAAG TATTCGCACA TCAACAACAA
GGCGTTCGAC CAATACATCA ATTTTAGTGA GCAGCGCGAA TCGCTGCTGG TGCGCAAGGC
TGAACTGGAC CAAGGTGCCG AAAAGGTGGA GGAGCTCGTA TCGAGTCTCG ACCAACAGAA
GGACGAGGCC ATCAATCGTA CCTTTCGCGG GGTCAGCGCC CATTTCAAGG ACGTGTTTGA
AGAGCTCGTT CCCAATGGGG CGGGTGAGCT CATTTTACGC ACGGCCATGG ACGAAGCGAT
GGAGGACGAT GCGAACGATA CGGATCAGGA CGACGATTCC GTCAACGCGG ATTCTCCGAA
AAAGGCCAAG GGTTTCGACG CGAACAATCC GGATGTCAAT TTGTACCGCG GTATTGGCAT
CCAAGTACGT TTTTCGGCCG TGGGCGAAAA CTATCTCATG TCCCAGCTGT CGGGTGGTCA
AAAGGCCTTG GTCTCGCTGG CCCTGATTTT TGCTATTCAA CGGTGCGATC CGGCTCCCTT
TTATATATTG GACGAGCTGG ACCAGGCGTT GGATGCCTCG TACCGTGCGG CGGTGGCCAA
TCTGATTCAA AAGCAGGCCA CATCCACGGA GAATCCAACA CAGTTCATTG TGAGTACCTT
CCGTCCGGAA CTGGTAGCGA TTGCGAATCG TTGTTACGGA ATTTCATTGC AGAACAAGGT
GAGTCGCATC CACCCGTTGA GTAAGAAGGA TGCGTTACAC TTTATC
 
Protein sequence
MHIKQITISN FRSFRQQPEI EAFSTHTNCV VGRNGSGKSN LLDAVQFVLL APRFANLRQE 
ERQALLHEGS GSAAVNAFVE IVFDNADHRF ALEHSDEVVL RRTVGLKKDE FFLQRKRVTK
QEVQSLLEGA GFSRSNPYYI VQQGKVQDLC TMSDVERLRL LKEVAGTVVY DQKKTESLAK
MQENGHSITK IQEILDDIEA KLDELQAEKE ELTAYQALDR QRRALQYVLY DKQLRKARHD
LDQLEHARTS HVQNLSALHE ELKDTHEAIR NKDAVLKTKT NALRRNQTTL EAHEQDKTSR
VTAVTRLQLE CQELLEAVRS GAEQLQSNEK ELEAVQAEIE ASQTNLRDTV QPAYDQAVAV
LQDLTTRRDE ASRQVESLYA KQGRGQHFQT VQDRDAFLQS SVEELLATQR DKTSAVQAQQ
DTLANLRRSV TQETTEIDKL TSQLTSQAAG LQSLSKTIEE TKRQRLELHD ARKEAWREAE
ALHDQVREAR GTFQRAKQDT AKVMPRATAM GLKALTSVVE QEGLTSDQYF GMLMDNFVLR
DDKYQTAVEV AAQNALFHVV VDTDVTASRL MKRLEADKLG RVTFLPLNQL RIDQPNLPQS
NDIRPMLDLC LQYDRKVERA LQHVFGKKLL ARTPEIASEW SAQHGVDAIT LDGDLCSRKG
ALTGGYVDTS KSRLRAHAKQ TEAQAALQNV ETLHQGKSRE AEQVEQQVTN LMQELQRQES
KQAELSRMVQ GKEMELDRIQ ARLENHKKQV EMVEKTLIPP LERDIVALNG DMDRLKAEMG
TPLTQTLSDE DRKLLATLKE TQTRLVAEIE SQSDKVAQVG LDRQKLQSLL DDNLLKRQRE
LREGGVDGRR RQSHGRLSSA AVQAEQQEEL AECQRRLDDA LRVKDEIEGR LEESRRVDEE
LRGEIIVVKN ELEQLKSEYL NVSKRLEEAQ NETERLMNKR SMCISTREEK MRSIRELGSL
PPPAELDKHS GKSTEALKNS IEGVNKKLKK YSHINNKAFD QYINFSEQRE SLLVRKAELD
QGAEKVEELV SSLDQQKDEA INRTFRGVSA HFKDVFEELV PNGAGELILR TAMDEAMEDD
ANDTDQDDDS VNADSPKKAK GFDANNPDVN LYRGIGIQVR FSAVGENYLM SQLSGGQKAL
VSLALIFAIQ RCDPAPFYIL DELDQALDAS YRAAVANLIQ KQATSTENPT QFIVSTFRPE
LVAIANRCYG ISLQNKVSRI HPLSKKDALH FI