Gene OSTLU_18991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18991 
SymbolHMGB3501 
ID5006737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp186780 
End bp188648 
Gene Length1869 bp 
Protein Length622 aa 
Translation table 
GC content62% 
IMG OID640422158 
Productpredicted protein 
Protein accessionXP_001422680 
Protein GI145356938 
COG category[B] Chromatin structure and dynamics
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG5165] Nucleosome-binding factor SPN, POB3 subunit
[COG5648] Chromatin-associated proteins containing the HMG domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0475639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.137229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG AGTTTACCGT CGCGGCGCGC GCGCTCGGAC GCGGGAGCGG GACGCAGGGA 
GAGTTGACGC TGAGCAAGGC GATCGGCGCG CGGTTTCGAG CGGCGACGGG CGCGGGCGAG
CAGAAGAAGA CGGAGATCGA GGCGGGGAAG GTGCGCGAGG TGCGATGGAG CGACGCGCCG
ACGGGTGGGG TGCTGCGAGT GCGATCGACG GACGGACGGA CGCTCGTGCT GGGAGGGATG
GGGACGGAGG ACGCGAAGAA CGCGGCGGAG TACGCGGCGC GCGAGCTGGG GTGCGCGAGC
GGCGAGACGA AGATGAACGT GAACGGACGG AACTGGGGCG ACGTCGCGAT CGAGGGGAGC
GGGACGGTGT TTGAGGTCGG TGGGAAGACG GCGTTTGAGA TCGATGGGCA GTACATCTCC
GAGGCGACGG TCGTCGGGAA GAGCGACGTC GTCTTGCAAT TTCACCACGA CGACACGGCG
GCGGAGAAGG ATTCTTTGGT GGAGATGTCG TTTTACGTTC CGCCGGGCAG CGAGACTTGG
AAAGGAGACG ATATGGAAGA TCCCGATGAC ACCGCGGCGA AGCGTTTGCA CGCGGCGATC
ATGTCGATCG CCGCCGCCGA CGCCGAGGCG GGCGAACCCG TGGCGGAATT CGACGGCGTC
TCCATGGTCG TCCCGAGAGG CAAGGTTTCG ATCGAGTTGC ACAACACGCA CATGCGCATG
CAATCCTCGA CGCTGGACTT CAAGGTTCAG TACAGCTCCA TCGTTCGCGT GTACTTGCTT
CCGAAGCCGC ACTCGAACCA GTCGCACGCG GTCATCGCGC TCGACCCACC GATTCGCAAG
GGACAAACGT TTTACCCGCA CATTCTCGCC ATGTTCAACG ACGACGATCA TCTCACGGTG
GAACCAAACT TGGCTGCAGA CATGAAAGAC AAGTTCCCCA CCCTCGAGTC GACTTACGAC
GGCTCCTCGG GAGAAGTCTT CGTGCGCGTA TTGAAAAACA TGGCTGGCGT CAAGTTGACG
AGACAAAGCT TATTCACCGC CTCGGCGGGC GGACACGCCA TCCGCGTGTC GCACAAGGCT
GACGTCGGCC TACTGTACCC GCTTGAAAAA GCGTTCTTCT ACTTGCCCAA ACCTCCGCTG
TTGTTGCACT ACAGCGAAGT CGATGAGGTT GAGTTTGAGC GTCACAGCGC GCAGGGCGCG
ACGAGCGGCA GGACGTTTGA CGTCTCTGTG AATATGAAGA ATGGCTCATC GTACGATTTC
CACGGCATTC AACGGTCGGA GTTCCAGAAC TTGGTTAACT TCTTGACCGC CAAACAAGTG
AGAATCTCCA ACGTCGACGC CAACGCGCGC GCGGACCAGC TCATCGCCGA AGCTTCGGAC
GACGACGACG AAGGCTACGC GCGACGCGGC GACGACGACA GCGAAGAGGA CGAAGACTTC
GCCGCGGGAA GCGAGTCCGA CGGCGGCGAG CCCACCGACA GCGATTCCGA CAGCGAAAGC
GACGAGGGCG CGAAGAAGTC GAAGAAGTCG CCCAAAGCCA AACGCGCGAA GAAGGATCCG
AACGCTCCGA AACGAGGCTT GTCCGCGTAC ATGTTCTTCT CCGCCGCCAA GCGCGCCGAA
ATCACCGCCG CTAACCCTTC GTTCGGCGTC ACCGACGTCG CCAAGGCGCT CGGCGAAAAG
TGGAAGACGA TCACCGACGA AGAAAAGAGC GTGTACCAGC AACAAGCCGA CGAGGACAAG
ATTCGATACG AGCGCGAGAT GGAAGCCTAC CGCGCCGGCG GTTCGCAGCC CAAGGTGGAG
ATCAAGGACG ACGACGACTC CGACGCCGAC GACGCCGCGC GCGACGTCGA CGCCATGGAC
GAAGATTAG
 
Protein sequence
MSAEFTVAAR ALGRGSGTQG ELTLSKAIGA RFRAATGAGE QKKTEIEAGK VREVRWSDAP 
TGGVLRVRST DGRTLVLGGM GTEDAKNAAE YAARELGCAS GETKMNVNGR NWGDVAIEGS
GTVFEVGGKT AFEIDGQYIS EATVVGKSDV VLQFHHDDTA AEKDSLVEMS FYVPPGSETW
KGDDMEDPDD TAAKRLHAAI MSIAAADAEA GEPVAEFDGV SMVVPRGKVS IELHNTHMRM
QSSTLDFKVQ YSSIVRVYLL PKPHSNQSHA VIALDPPIRK GQTFYPHILA MFNDDDHLTV
EPNLAADMKD KFPTLESTYD GSSGEVFVRV LKNMAGVKLT RQSLFTASAG GHAIRVSHKA
DVGLLYPLEK AFFYLPKPPL LLHYSEVDEV EFERHSAQGA TSGRTFDVSV NMKNGSSYDF
HGIQRSEFQN LVNFLTAKQV RISNVDANAR ADQLIAEASD DDDEGYARRG DDDSEEDEDF
AAGSESDGGE PTDSDSDSES DEGAKKSKKS PKAKRAKKDP NAPKRGLSAY MFFSAAKRAE
ITAANPSFGV TDVAKALGEK WKTITDEEKS VYQQQADEDK IRYEREMEAY RAGGSQPKVE
IKDDDDSDAD DAARDVDAMD ED