Gene OSTLU_49589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49589 
Symbol 
ID5001736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp497763 
End bp499781 
Gene Length2019 bp 
Protein Length672 aa 
Translation table 
GC content55% 
IMG OID640417157 
Productpredicted protein 
Protein accessionXP_001417791 
Protein GI145346636 
COG category[L] Replication, recombination and repair 
COG ID[COG1389] DNA topoisomerase VI, subunit B 
TIGRFAM ID[TIGR01052] DNA topoisomerase VI, B subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00930668 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGAAAA AGCCGACGAA TCTGAACGCG AAAGGGGACA AAAATCTGAC GCAAAAGTCT 
CCGGCGGAAT TTTTCCAAGA CAACAAAGGC ATCGCTGGGT TCGAAAACGC GGGGAAATCG
CTGTACACGA CGCTTCGAGA GTTCATCGAG AACGCGTTGG ATGCGGCGGA AGCGATCGGG
ACGCTGCCGG AGATCGATAT CGCGGTGGAG GAGGTGTCGG TGTCTCAATA CGAGAGCCTC
ATCGGGCTCA AGGCGCACGT GCGGTTGGAC GACGAGCTGT ACGCGGATTT TGAGAGCGAT
AAGGCGAAGG CAAAGCGGTT GGCGCAAGAG GCGAAAAAGG CGGCGAAAGC GGCGTCGGCG
GCGTCGAAGC GAGGGGGAGG TGGGAAGCGG GATCAGGCGA AATGTTTTTA TAAGATTACC
GTCAAGGACA ACGGGAAGGG CATGCAACAC GAGGACATTC CGAACATGTT GGGACGGGTT
TTGAGTGGTA CCAAGTACGG GGTGCGACAA ACGCGAGGGA AATTCGGACT CGGGTCGAAA
ATGGCGTTGA TTTGGTCCAA GCAAACCACA GGATTGCCCA TTCGCATTCG AAGCGCGTTG
CCGAACCAGT CGTTCATCTC CGAGTACGCG TTAGACATCG ACATCGAGAA GAACGAGCCA
AACATTCATA AGGAGGCGAA ATTACCCAAT GACGATAAGT GGCACGGTGC GGAACTGTCG
GTGACCATCG AAGGTAACTG GACGACGTAC AGGCAATACG TGTTGTCGTA TTTGAGACAG
CTCGCCATCG TCACGCCGTA CGCGCAGTTT CGGTTTAGAT TTGTTACCTC TGCGGCGACG
ACGTCGACGG CGAAAGATAT CGATATCGTA TTTAGACGCC GTACCGATAT CATGCCGGCG
CTTCCCACGG TGACGAAGCA TCACCCTTCG GCGGCGAAGG AGAACATGCT CTTGATAAAG
GATTTACTTT CGCAGACGCG TGAGAAGAAT TTGCTCAACT TTTTGCATAA AGAGTTCACG
TGCATCAACA AGGACCATTC TTCTCGCCTG ATTCAAGAAC TCGGCTACGG TTTCGATACG
GACATGCACC CGTCCGAGGT GACGGATAAG CAAGCGACGA GAATACAGCA GCTTTTGGCG
GACGCGCGCT TCGACGATCC AGACGGCTCG TGCCTTTCGC CGGCGGGCGA ATACAACCTG
CGTTTAGGCA TCATGAAGGA GCTCGGCCCA GAGTGGATCG CATCATTTTG CGCTCCTGCT
TTAGCTTGCG GTGGACACCC GCTCGTCGTC GAAGCTTGCG TTTCGCTCGG CGGTCGCGAA
GTGAAACCGG GATTCAACGT GTTTCGCTTC GCAAATAGAA TTCCGCTCCT CTTTGAAGGC
GGTAACGACG TCGTTACGAG ATGCGTGCAG CGCTTGAACT GGAACACGTA TAAGATTGAC
AAGAACAACG ATAAGATTGG CGTCTTCGTC TCCATCGTGT CCACAAAAAT CCCTTTCAAA
GGCACGTCCA AGGAGTACAT CGGGGATGAA AACAACGAAA TCGCCGACGC CGTCGACAAG
GCAATCAAGC AGTGCGCCTT GCAGCTTCGA GGGAAGATTG TCCGCGCGCA AGTGGCGAAA
GATCGGAAAG CTCGCAAAAA GCAACTCGCC AAGTACATTC CTGATGTGGC GCGCGCAGTC
TTCGCGATGC TTGCGGCCGC CGCGGACGCG GACGAGCCGC CATCGAAACG CGCTCGCGTC
GGCGCCGCGT TATGGAAGGT GGACGAAAAG TGGGAGACGG ACGTTTTGGA GCGGGCGAGA
GAAGGTGAAG TGAGTGAAGC CGTGCTCAGG GCAAAGCTTG AAGAGCACGT GCAACGAAGT
GATCACGAAC AAGCGCTGGA ATTTGCCATG CAAAACAACA AGGATGGCTT AAGCGAACTC
ATGTACTTGG TGCCCAAGAC GCCCGAGCAC GAGTACCTTC CCGAGATTCG AAGCGGATCG
TGTGCGTTTA GATTTCTCAA GGCGGCTGAA TTGCGATGA
 
Protein sequence
MPKKPTNLNA KGDKNLTQKS PAEFFQDNKG IAGFENAGKS LYTTLREFIE NALDAAEAIG 
TLPEIDIAVE EVSVSQYESL IGLKAHVRLD DELYADFESD KAKAKRLAQE AKKAAKAASA
ASKRGGGGKR DQAKCFYKIT VKDNGKGMQH EDIPNMLGRV LSGTKYGVRQ TRGKFGLGSK
MALIWSKQTT GLPIRIRSAL PNQSFISEYA LDIDIEKNEP NIHKEAKLPN DDKWHGAELS
VTIEGNWTTY RQYVLSYLRQ LAIVTPYAQF RFRFVTSAAT TSTAKDIDIV FRRRTDIMPA
LPTVTKHHPS AAKENMLLIK DLLSQTREKN LLNFLHKEFT CINKDHSSRL IQELGYGFDT
DMHPSEVTDK QATRIQQLLA DARFDDPDGS CLSPAGEYNL RLGIMKELGP EWIASFCAPA
LACGGHPLVV EACVSLGGRE VKPGFNVFRF ANRIPLLFEG GNDVVTRCVQ RLNWNTYKID
KNNDKIGVFV SIVSTKIPFK GTSKEYIGDE NNEIADAVDK AIKQCALQLR GKIVRAQVAK
DRKARKKQLA KYIPDVARAV FAMLAAAADA DEPPSKRARV GAALWKVDEK WETDVLERAR
EGEVSEAVLR AKLEEHVQRS DHEQALEFAM QNNKDGLSEL MYLVPKTPEH EYLPEIRSGS
CAFRFLKAAE LR