Gene PHATR_54192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_54192 
SymbolSMC5 
ID7204040 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1008701 
End bp1012215 
Gene Length3515 bp 
Protein Length1099 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186447 
Protein GI219113727 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACCTGCATCC AAAGTGCACC ATGACGAGTG ACTCGGACAG TGAAGAGGCC TACATCCGAC 
GTGTGGGTAC GCACAAGGCG GGTTCTATTA CAAGGATTAA GCTACACAAC TTTCTGACTT
ATTCCGATGT CGAATTTCGT CCCGGGCCGC GGTACGTATC CGTATTGTCC ACGCCGCGCT
CGAATGCATC CAGCCATTGC ACGGTTTTCT GAAAACGCGG AAAAGTCACG CGTTCTCAAC
TCTGTTGCGT ATTGTTTTTG TGCAAAAGTC TCAATATGGT CATTGGTCCT AACGGGACGG
GAAAGTCGTC GATTCTGAAC GCCATTTGTT TCGGACTGGG AGGGGAGCCC AAATTACTCG
GTCGGGCAGA CGACGCCCGT GCATTCATCG CACACGGCAA GGACCACGCC GAAATTGAAA
TCGAGTTGGC GCCGCTACCC GGCAAGGGTA CTCACGTGTT TCGACGTACC ATTGATCGTC
ATAAGGGGTC AGAGAAAGGC AAAGGTCGGG GGGCGTCGCA GTACTTTGTC AATGACGAAA
AGGTACACCC CAACGTGATT CGGGAAATTG TGTCGGAGGA CTATAATATT GCCATTGATA
ACCTCTGTAC CTTTCTGCCA CAAGACAAGG TGGGGAGCTT CTCGGGCTTT GATTCGAAAC
AGCTTTTGCA AGAGACGGAG AAAACGCTCT CCACGTCGCA GCACTTGTAC AGGCTACATA
TGGATCTGAT CCAAGCCGAG GCTGAGCTAC AGTCGGGAGT CGTCAACGTC GATACAATAA
AATCCAAATT AAAAAAACTA GAGCACGAGA ACAAACAGCT CGAGCAAGAA AAAATGCGTG
TGGAAGAACG CGAAGAAGCT CTAGTACAAG CCGAAGTTTT GGAGAAGAAA AGAATTTGGC
TGCAAGTCGA TGTGTTGCGC GAAGAAGCCG TCTCGTTAAA AGAAGCGAAG ACGGAAGTCA
AGGACCGCCT CAAGGCCGCT CATGCAGAGC TCGCTCCTTT GCAAGAGGAG CAGCAACGTC
TTGCAAAGGC TTGGAAGGAA GCCGATCTGC AGTTGAAGGT TCTCGAAATG AATAAGCAGA
AATGTAACAA AGAAATGGAA AAACAGCTGA AGAAGTACGA GAATCATGAC GATGGTATCG
AGGAGGCGCT CGCTATGCTA CGCGAGCTGG ACACCAAGCA TGAAGAAGTA CAAGCTCGTT
ACCGCTCTCA GGAAGAACGC GTCGCGACTT TGGAAGAGCA GCTCTCTAGT TTTGCGACCA
CTGAAGAAGA AATGACAGAC CAGTATAACG AAGCCAGGGA GGCGGCTCGT GTTGCGTCGC
GCGCATACGA ATCAGCTAAA CGCGAGCTTG CCAGACATCT CGAAAAAGCT CACCTACTGA
AAGAAAAGGG TAAAGAGGCG CAGATGAAGC TTGCGAAGAT GAACGACGAA GGTGCCCGGC
GAAAGGAACG AATATTTCGA CAAGAAAGGA ATCTCGGGCA GATTTTCGAA TGGCTGGAAA
GTAACCGGGA TAAGTTTCGT CGCCCTGTCT GGGGTCCGGT GGCCTGCGAA GTGGCAACCA
AAGATCAGAA TACTGCGGCT GCTTTGGAGC AACACGTTCC AAATTGGGTA TTGAAATCCT
TTGTCGTCGA AAACAAAGAA GACTATGATT TTTTGTTTAG CGAGATTCGT GAACGTCGTA
AAATTCCGAT CAATATTGTG AATACGGATG GCCAGCGACT TTCGGATCCG CAACGGCCGT
ACTCGGAAGA GAAAATGTCG ATCCTTCAAA AGGAGTACGC GATAGCAGGG TATCTTGATC
ATTATTTTAC AGCTCCTGAC CAGATTATGC TAGTACTTCG CAAGCAAGCA GCGGTGCACA
AAGTTCTAAT GGGAGGAGAA GAAACGAATC AAAAGCTTAC AAAACTGACG GACTTCATTT
CGGAGCCGGA TATCTCCCTA GGTCAGACCG ACAAGCAGCC TTCGGTCCTA TTCTGTTCTG
ATAATGGCAA AGCGCTTAAG TTCAGCAACG TTGTCTCAAG ATACAGTAAG GAAATCTCAT
CGCGACAGGA TGATATCAGT CAAGCTCGCC TTCTGGCGCC GGGTGTCAAT CCTCGCGTAA
AAAAAGAGGC CGAGGACAGA ATTGCAGAGG CCAATGCCGA AATGAATGAA CTGAGGCCAG
CGATTGAAGA CTCCCAGAAA GAAAAAAATA AAACTGAACT TGCAGCCCAA GAAGTGAAGG
CCAAATTACA GTCATCAAAG CAGTCTTTGG AAAGTCTGAA GAAGTTTCAG CAGAAACTGG
AGAACGCTCG CAATAAACTT GATGATGCTC GGCGCGATTT GGAAAGCGAT GACGAAAAGG
AAAAGAAGGC GCTGGTACAA TCTCTAATGA ATCGAGTTGC TCATGGGGTG TCGGCCCTGG
AAGTCCATGC TCAACAACAT GAGCAAATGC TGCTAGCAAC AATGGAGAAT GCCGGTCTTC
AAATTTCTCG CAACGATTTT TCAGTAGCCG AGCGTAGAGC GAAGTACGTA TGTGTCATTG
CTAGCCGTCG ATTGTTGTGT TCGCGTCCCT AACTAATCTT TGATTCGCAT CTCAGAGAAC
TTGTCCTCGA GAAGAACACT AGCTTTAAGG GACTTGAGAC TCGAGCGGTG AAGATTCAGA
CAGACTTCAT GAATGTGAAA AAAGAATACG CGAAGCTGAA GACGGAAGCT GAGCGAGTTG
CGCCTTTGGA AGACGAAAAC GGCAACAAAA CAGAGCTATT TGATCAGCTC CAGGAACTCG
AGGTTACGAC TTTGCATGAC TGCGAAGCCG CCCTCGACGA AGCAGTCAGC AAAGCAGATG
AATACGCGGA CAACCCGGAC GCCCTACGGC AATATGAGCG AACAAAGGCA GAGATCGAAG
AAGTTCAGAC GAAGCTGGAC GATTTGACGA GCTCGAAAGA TGCCAAGCTG CAGGAAATCC
GAAATAAGAG CAATCCGTGG CAGGCTGCGT TGGAGAACTA TGTAAGCAAG GTGGACAAAC
TCTTCAGCGA GTACATGCAG GAAATGGAAT GCACCGGCGA AATTCGTCTA AAAAGGGGTA
AAATCGATGA AGATGATGAA AATCAGATAG GCAACTTCAA GGACTGGGGT ATCGAAATTC
TGGTGAGTTT TCGGGAAGGT ACGAAGGCGC AAATTTTATC AGCACAGGTC CAGTCTGGAG
GTGAGCGTTC GGTCAGTACA ATCATGTATT TAATGGCTCT ACAAGACATG ATGGTAGCCC
CATTCCGATG CGTTGACGAA ATCAACCAAG GCCTTGACGA TCGGAACGAG CGGCTTGTCT
TTCGTCGCAT TGTTGAGAAC TCAACCCGTC CGCCGAAGGG CGAGCCGTTC GAGCACGTTG
GACAATATTT TCTCATCACG CCGAAACTTT TGCCGAATCT TGTGGATATG GAAGAGGAAG
GCGTCACGAT TCTTTTTGTT TTTAATGGTG AGGGAATGCA CCAGAGTATA TTTTTTTATG
AATTGTCTCA TCTCACACTT TGTCCTCTTG TTTAG
 
Protein sequence
MTSDSDSEEA YIRRVGTHKA GSITRIKLHN FLTYSDVEFR PGPRLNMVIG PNGTGKSSIL 
NAICFGLGGE PKLLGRADDA RAFIAHGKDH AEIEIELAPL PGKGTHVFRR TIDRHKGSEK
GKGRGASQYF VNDEKVHPNV IREIVSEDYN IAIDNLCTFL PQDKVGSFSG FDSKQLLQET
EKTLSTSQHL YRLHMDLIQA EAELQSGVVN VDTIKSKLKK LEHENKQLEQ EKMRVEEREE
ALVQAEVLEK KRIWLQVDVL REEAVSLKEA KTEVKDRLKA AHAELAPLQE EQQRLAKAWK
EADLQLKVLE MNKQKCNKEM EKQLKKYENH DDGIEEALAM LRELDTKHEE VQARYRSQEE
RVATLEEQLS SFATTEEEMT DQYNEAREAA RVASRAYESA KRELARHLEK AHLLKEKGKE
AQMKLAKMND EGARRKERIF RQERNLGQIF EWLESNRDKF RRPVWGPVAC EVATKDQNTA
AALEQHVPNW VLKSFVVENK EDYDFLFSEI RERRKIPINI VNTDGQRLSD PQRPYSEEKM
SILQKEYAIA GYLDHYFTAP DQIMLVLRKQ AAVHKVLMGG EETNQKLTKL TDFISEPDIS
LGQTDKQPSV LFCSDNGKAL KFSNVVSRYS KEISSRQDDI SQARLLAPGV NPRVKKEAED
RIAEANAEMN ELRPAIEDSQ KEKNKTELAA QEVKAKLQSS KQSLESLKKF QQKLENARNK
LDDARRDLES DDEKEKKALV QSLMNRVAHG VSALEVHAQQ HEQMLLATME NAGLQISRND
FSVAERRAKY VCKNTSFKGL ETRAVKIQTD FMNVKKEYAK LKTEAERVAP LEDENGNKTE
LFDQLQELEV TTLHDCEAAL DEAVSKADEY ADNPDALRQY ERTKAEIEEV QTKLDDLTSS
KDAKLQEIRN KSNPWQAALE NYVSKVDKLF SEYMQEMECT GEIRLKRGKI DEDDENQIGN
FKDWGIEILV SFREGTKAQI LSAQVQSGGE RSVSTIMYLM ALQDMMVAPF RCVDEINQGL
DDRNERLVFR RIVENSTRPP KGEPFEHVGQ YFLITPKLLP NLVDMEEEGV TILFVFNGEG
MHQSIFFYEL SHLTLCPLV