Gene PHATRDRAFT_36531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36531 
Symbol 
ID7201693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp681001 
End bp682299 
Gene Length1299 bp 
Protein Length432 aa 
Translation table 
GC content50% 
IMG OID 
Producttype II DNA topoisomerase VI subunit 
Protein accessionXP_002180881 
Protein GI219120278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGACC TTATCGAGAG CGGCGTCGTC GAAGATGGAA CCAATCAAGG CCGGTACGCC 
GTCATTGATG CGACTTTGGA AATTCCCGGA CGAGCACTAT CCGTCACGCA AACACCAGTC
TACGGCCGAC ATGATCATCT GACATCCGAC GAAGTAATCG CTCGCATCGA GCGCTTGATT
GAGACAGTGG TTTTGGCACT AGAGAGAGGT AGAATGCCCA TCCTCGAGAC GTTGTGGATC
CCTTTGGAAA ATGGGAGCGG CGGCGAAGAC GCGGTTTTGG GTGCGCAGCA AGGCGACATT
TTACGCAAGA CCTTTCACTT GCATCAGTGT CGATCGTTTA CTAGTATTTT ACTTGTTCTA
GACTTCTGTC ACTCTCTGCT ACGTGCTCGG CGTACAACAA CAACCCGTGA AGTTTACTAC
TACCACGTGA CGCACTACCG CTCCCAAAAA GAATGCGATT CCGCCATTCA GGATACCGCA
ATATTACTTC AGGTTCCACG CAGCAGTCTT GGTTTAAAAG CCTCACCAAA AGGATGGTTT
TGTGGAGATG TCCAGCTGGT GTCGAACGGT CAGGTTGTCT TGGACGGACG GCATTTGCAA
TCTATTCACG GTGCCCCCAT TAGTGGCGAA TGGCTCGCCC CTACTCGCGA CTTTACAATT
CATTCGTGCG CCGCCACATG TATCCTGGTC ATTGAAAAGG AAGGCGTGTA CAATCGTTTG
GTGGAGGATC GTTTTTTCGA TCGATTTCCT TGCATCTTGG TCACGGGCAA GGGTTTTCCA
GATTTGTCAA CCAGGGCCCT CGTCCACGTG CTGCACCACA CATTGGGGCT TCTACCCGTC
CGTGGACTCT GTGATTGCAA TCCATACGGT GTCATGGTCT TGCATACGTA TCAACATACC
GCGCGGAAAG GTGTGGATGG TGGACACCGT TTTGGGGTTC CAATATCGTG GATTGGTTTG
CGACCATCGC AAGTTCAACA GCTTCAGCGG CAGCCCAACA CCAAACATGG TCAGTCCAAA
CTGCCGGATC AAGTTTTTCA AAGCCTGACA GCTCTCGATA AGCGACGCTT AGAACATCAC
TTGTTGAGTG AGCAACATGG CTGGACAACA TTCGGACCAG ATGAGCGACG GGTGGAAGAG
TTGGAGGAAA TGCTGAAGAA CGGCTACAAG ATGGAATTGG AAGCTTTGAA CTGGTTGGGA
ATGGACTTTA TCACAAAGTG GCTTGGTGAT ATCTTTCATT ATCAAGACAG AGCGGGACAC
GGGCATGAAG GGAACAGTTG TTGGATGGAT ATTATTTGA
 
Protein sequence
MEDLIESGVV EDGTNQGRYA VIDATLEIPG RALSVTQTPV YGRHDHLTSD EVIARIERLI 
ETVVLALERG RMPILETLWI PLENGSGGED AVLGAQQGDI LRKTFHLHQC RSFTSILLVL
DFCHSLLRAR RTTTTREVYY YHVTHYRSQK ECDSAIQDTA ILLQVPRSSL GLKASPKGWF
CGDVQLVSNG QVVLDGRHLQ SIHGAPISGE WLAPTRDFTI HSCAATCILV IEKEGVYNRL
VEDRFFDRFP CILVTGKGFP DLSTRALVHV LHHTLGLLPV RGLCDCNPYG VMVLHTYQHT
ARKGVDGGHR FGVPISWIGL RPSQVQQLQR QPNTKHGQSK LPDQVFQSLT ALDKRRLEHH
LLSEQHGWTT FGPDERRVEE LEEMLKNGYK MELEALNWLG MDFITKWLGD IFHYQDRAGH
GHEGNSCWMD II