Gene PHATRDRAFT_30352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_30352 
SymbolSMC2 
ID7195804 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp252225 
End bp256294 
Gene Length4070 bp 
Protein Length1213 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184096 
Protein GI219127758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGATACGAAG TGTCATCGGT TTATTTGCCT TTCTTTTCGC AAGAGAGAGC GTACGCCAGA 
AAAGGGGATC TCTGAAGCGC TGTTGATAGT ATTATCGTCG TGTGAATCGA TAAAATTCTG
AGGCAGTCAG CAGCACCGGA AACGCTCCAT TTGGTTAACC ATGTTCATTC AAGAAATTGT
CATTGATGGT TTCAAGTCCT ACGCTCGTCG AACAGTCGTG GAGGGGTAAG ATAAAGCGAG
TGACCTTTAT GAAGGTCGGA TTTTGTTTCG TGCTAAATCT CACGTCTTTG CACGATGTAC
TATAGTTTCG ATCCTCACTT CAACGCCATT ACCGGCTTGA ACGGGTCTGG TAAATCCAAC
ATTCTCGACG CCATTTGCTT CGTGCTCGGT ATAACGAACC TGTCCCAGGT ACGCGCCGGA
AATCTTTCCG AGCTCGTCTA CAAACAAGGA CAGGCTGGAG TAAACAAAGC CACCGTTACG
ATCATTTTCA ACAACGAGGA CGAATCTTCC AGTCCAGTAG GTTACGAGCA ATGCCCTCAG
GTTACCGTCA CACGCCAGGT TTTGATCGGC GGCAAGAGCA AATATCTCAT TAATGGACGC
AATGCCCCCG CGAATCAGGT ACAGAATCTT TTCCATTCGG TACAGCTCAA CGTGAACAAT
CCCCACTTCC TCATTATGCA AGGACGCATC ACCAAGGTCT TGAATATGAA GCCGCACGAA
ATTCTCGGTA TGGTCGAAGA AGCTGCCGGT ACCCGCATGT ATGAAACGAA ACGCGTCGGT
GCCTTGAAAA CGATTGAGAA AAAGCAGCTC AAGCTGGATG AACTCAACGC AGTTCTAGCC
GAAGAAATTA CACCCACTTT GGAACGATTG AGGGGCGAAA AACAATCCTA CCTTAAATGG
AGCAAGAACA ACGCTGACAT GGAGCGTATT GAACGTTTCG TCATTGCCAA TGAATTCATG
CAGGCACAGA AGGCTTTGGA TAATAACACA GAAGGCTCCG CCGAAATGGA AGAGCAGGTT
GCCATTCTAG ACGACAAGAC TTCGCAAATT CGAGAACTCA TTGTTGCCAA GGAACGCGAG
ATTGAAGAGC GCTCGTCCTC CCTCAAAGGA GAGTTTGAGA ATTCACATAA CGAAGCAAAA
GTTTTGGAAG AGCAGCGCTC CAAAGATCTC GTCAAAATAA CTTCCTCCTG GAAAAATGCA
AAGACCAATG TTACCAAGGC AGAGAGCGAC CTGGACGCGG CGCGAAGTCT CGTCACTGAA
ACGAAACAAG CGGTAGTTGC CAAGGAAAGC GACATCGCTA CTGAATCGCA GAGCATTGAA
CACAAGATTC TGGCTGCCAA AGAGGCTGAG GAACGACTTG CACGGCTAAC TCTGGACTAC
CAGAACATGT CGGCCGGTAT CAGTTCCACA GAAGGAGACG AAGGCCGTAC ATTACCGGAA
CAGATAAGCA AGGCGCACAG CGATTCAAAG TCAGCCGAGG CAAAGGTGCA GCAAGCCAGC
ATGAAGATGA AGCACTTGTC AAAAGAGTTG AAGGTATGTG TACTTGTCGA TATTTTTGGT
TTAGTTTTTT ACCCTTTCGA CTCAAGAGCC AAATTGCTCT GACTGTTTCC AGCTGGTCGA
GAAAGACCTC CAAAAGGAAG GGAAAACTGC TGAAAAGATG GCCCAGAAGC GTGCAGTAGC
AGCTCACAAA GTAGAGGATT GCCGTGGCAA GCTTAAAGAT ATGGGCTTTT CGCCGGAAGA
GTTCAATGCT CTGGATCAAG AAAAGACAGA CCTGGAAATT ACCGTCTCGG AGTTGTCGGA
GCGCGTTGAC ACGCTTTCCG CACAGCTTGA AGGGAGGCTC CGTTTCAAAT ATTCCGATCC
CGTGCGTGGG TTTGATCGTA GCAAGGTCAA GGGGCTTGTG GCAAAGCTCA TCGAAGTGAA
GGATCACAAG AATGCTACTG CTTTGGAGGT TGTTGCCGGT GGAAAGCTGT ATCAAGTCGT
GGTCGACGAA GCAATTACTG GTAAAGCGCT TTTGGACCGC GGCAAGTTAG AGCGACGTGT
GACCATCATC CCACTGGACA AGATCAAGCC GCGTAATGTT AGTCACACTG CTTCGGAACT
AGCCAATGAT ATTTCCCAGT CGCTCGATTC GAGGGCTTCT CCGGCAATCG AATTAGTTGG
TTTCGATGAA GAGGTTCGTA GTGCCGTTGA GTACGTCTTT GGCTCAACTA TTGTGGTCGA
CGGCATGAAA GCTGCAAACG CTATCTGCGA TGCAACAAAA ACACGGACCG TTACCTTGGA
AGGCGACGTT TACGATCCGT CGGGGACTAT ATCTGGTGGC TCCAACAACC AATTGGGGAC
AACTCTAGTC AAGCTCACTG AACTAACTCA AGTGACAAGT AAGCTCGACG AAAAGCGCTC
GCTCCTTGCT TCTATATCGA TGAAAGTGAA GTCTATGGCT ACGCATGCTT CTTCCTACGA
CAAGCTCAGC GCAACTTTGG AGCTAGCAGA GGCGGAACTG AGCAATATCG ATAAGCATCT
GTCACAGACT AGCTTTGGTA TGCTGGTTGA GCAGCGTGAT TCTATGGCTG CCGAACTGGA
AGCGGCCCAG AACGAGTCGA TTGAAATGGA AGAGGAGAAA GAAAAAAAGT GGACACTCTT
TGTCAATCTC CAGGCACAAG AAGCTGAATT GACCGAGCGT CGAGAACAAC GCTTAGCTGA
GATTGATCAA GCGGTCAAAG ATGCAAAAGC TGACACTGTT GAGAAAGGGC GCATCGCTCG
ACAGGCGGAC TCAAAATCTC AAACATTTTC TTTGGAACTC GATAGTCTCC AAGCTGAGGT
CGCAGCAGCA GAGGAAGCCG TTTCAGTAGC GGAGCAACTA CTTGATGAGG CCACGGGTGA
CGAATCGAAG GTACAAATGA AAGTTGGAGA AGTTCGCGCG CTGTACGAAG AAGCGAAGAA
AGAGTTGGAT GAACTTGACG GCCGCCTAAA TTTATACTCT GCCAAGCTTG TGGAGCTCAA
ACGCGCCAAG AGCTATCTCG TCAAAGAAGC CGAAGTGGCA ACCTTGGAAG CCAAGAAATT
GTCCGTGACT ATCACTCGGA TTCACAAGGA ACGAAGTGGG GCGGAAAAGC TTGTTGCCAC
ATTGATGAAA AAGTATGCTT GGATCGACAG CGAAAAGAGC GCTTTCGGGG TGCCCGGGGG
AGACTACGAC TTCGAAGAAA CAAACCCGCG CCATGTTGGG CAACAGCTAC AGTCTCTCAA
AGCCGAACAG GAATCCTTGG TAAGCACTTC ATGCTGTTTA AAACCTTTTC AAAGGTACAC
TCACATTTGC ATCTCCACAT AGTCCAAGAA AATCAATAAG AAAGTTATGG GAATGATTGA
GAAAGCAGAA GGGGAATACA CTGAGCTTTT GCGAAAGCGG AAGGTGGTCG AGAACGACAA
GAAGAAGATA CAGGCCGTCA TTGAGGAATT GGACGTTAAA AAAAAATCGG AACTCGAGCG
TACTTGGGTC AAGGTCAATC GGGATTTTGG ATCCATATTT TCGACACTGT TGCCCGGCGC
TTTTGCGAAA CTTGAACCTC CGGATGGCAT GAAAGCCTGG GAGGGTCTCG AAGTGAAGGT
GGCTTTCGGT GACGTCTGGA AAGACAGTCT GAGCGAACTC AGTGGTGGAC AGCGGTCTCT
ATTGGCTTTG TCCCTAATTC TGTCATTGCT ACTTTTCAAG CCTGCCCCAA TGTACATTCT
TGATGAAGTC GATGCCGCTC TGGATTTGAG TCATACCCAG AACATCGGAA ATATGTTGAA
AACCCACTTT TCGCAGAGTC AGTTCGTTGT CGTGTCGCTG AAAGAAGGCA TGTTCAACAA
TGCCAATGTC ATTTTCAGAA CGAAGTTTGT GGACGGGATT TCTACGGTTA CTAGAACAAT
TGGAATTGGG TCCAGCCGCA ATCGTGCCTT AGCCGAATCT GACAACGCCG ACTCCACCAA
TACTTCTGAA AAAGGACGAA CAGAGCAGTC AAGGAGAATT GGCAAAGAAA ATACTGTAGT
TTAAGGGTTT TGTCGACAGC TCACAGTCCA GCATAATTGT AGTGCTGTTT
 
Protein sequence
MFIQEIVIDG FKSYARRTVV EGFDPHFNAI TGLNGSGKSN ILDAICFVLG ITNLSQVRAG 
NLSELVYKQG QAGVNKATVT IIFNNEDESS SPVGYEQCPQ VTVTRQVLIG GKSKYLINGR
NAPANQVQNL FHSVQLNVNN PHFLIMQGRI TKVLNMKPHE ILGMVEEAAG TRMYETKRVG
ALKTIEKKQL KLDELNAVLA EEITPTLERL RGEKQSYLKW SKNNADMERI ERFVIANEFM
QAQKALDNNT EGSAEMEEQV AILDDKTSQI RELIVAKERE IEERSSSLKG EFENSHNEAK
VLEEQRSKDL VKITSSWKNA KTNVTKAESD LDAARSLVTE TKQAVVAKES DIATESQSIE
HKILAAKEAE ERLARLTLDY QNMSAGISST EGDEGRTLPE QISKAHSDSK SAEAKVQQAS
MKMKHLSKEL KLVEKDLQKE GKTAEKMAQK RAVAAHKVED CRGKLKDMGF SPEEFNALDQ
EKTDLEITVS ELSERVDTLS AQLEGRLRFK YSDPVRGFDR SKVKGLVAKL IEVKDHKNAT
ALEVVAGGKL YQVVVDEAIT GKALLDRGKL ERRVTIIPLD KIKPRNVSHT ASELANDISQ
SLDSRASPAI ELVGFDEEVR SAVEYVFGST IVVDGMKAAN AICDATKTRT VTLEGDVYDP
SGTISGGSNN QLGTTLVKLT ELTQVTSKLD EKRSLLASIS MKVKSMATHA SSYDKLSATL
ELAEAELSNI DKHLSQTSFG MLVEQRDSMA AELEAAQNES IEMEEEKEKK WTLFVNLQAQ
EAELTERREQ RLAEIDQAVK DAKADTVEKG RIARQADSKS QTFSLELDSL QAEVAAAEEA
VSVAEQLLDE ATGDESKVQM KVGEVRALYE EAKKELDELD GRLNLYSAKL VELKRAKSYL
VKEAEVATLE AKKLSVTITR IHKERSGAEK LVATLMKKYA WIDSEKSAFG VPGGDYDFEE
TNPRHVGQQL QSLKAEQESL SKKINKKVMG MIEKAEGEYT ELLRKRKVVE NDKKKIQAVI
EELDVKKKSE LERTWVKVNR DFGSIFSTLL PGAFAKLEPP DGMKAWEGLE VKVAFGDVWK
DSLSELSGGQ RSLLALSLIL SLLLFKPAPM YILDEVDAAL DLSHTQNIGN MLKTHFSQSQ
FVVVSLKEGM FNNANVIFRT KFVDGISTVT RTIGIGSSRN RALAESDNAD STNTSEKGRT
EQSRRIGKEN TVV