Gene OSTLU_32993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32993 
SymbolHMGB3507 
ID5003401 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp216219 
End bp217291 
Gene Length1073 bp 
Protein Length306 aa 
Translation table 
GC content59% 
IMG OID640418822 
Productpredicted protein 
Protein accessionXP_001419099 
Protein GI145349350 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5648] Chromatin-associated proteins containing the HMG domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.161398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.121587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACT CGAAGGTGCG ACGACGACGA CGACGCGAAC GGGTGATTTC GATTTCGGGC 
GCGCGCGCCA ACGGCTTTTT AAACCGCGCG GCGAATCGGA CGAATTGGAC GCGGGAGGGG
GCGATCGCGG GCGCGAGACT GACGAAAACG CGGCGAACGC GCGATAGAAT GCGTTGCGGT
TGACGGATGA GAGCTCGCCG GAGCCGCCGA AACGGGCCAT GACGGCGTAT TTGATCTTTT
GCAGCAAGCA TCGCGAGCGA GTGATGCGGG AGGTGCACGG AGACGACGGC GCGCGGAAGT
TTTCGCGAGA CGAAATGCAG TTAGTGACGA CGCGGTTGGC GGAGATGTGG AATAACATTT
CGGAGAAGGA GAAGAAGGAG GTGCAGGCGA AGGCGGCCGC GGCGAAGGCG GAGTATGAGA
TGCAGAAGGC TGCGTTTTCG CCGGCGTTGC TGAAGAAGTT GCATCGGTTG AAGAGCAAAC
CGAAGGGAAC GGTCGTCGTG GAGGCGCAGG GTGAAAAGCC CGTGCGCGCC AAAACGGCGT
ATTTGATCTT CTGTGGTAAG CATCGCGCGG CGGTGATGCG GAAGATTCAT CCGGAACCAG
AGGCCAAGTT TACGCGCGCT GAGATGCAGC AAGTCACCAC GGAGTTGGCT GCGTTGTGGA
ATAACATCTC CCCGCAGGAG CTCGCCGAAT GCAAGGCGGC GGCGGCGAAA GAGCTCGAGC
GATACAAGCA GTTGAAAGCA GAATACCGCC CGCCGGTGTA CGGGCCATCC AAGCGGAACA
AAGGCAAGAG CGTGCCGGGC AAGCCCAAGC GCGCTCCCAC CGCGTACCTC ATCTTTGCCG
AAGAGTTGCG CGCCAGAATC AGACAGGAGC GACCGCATTT GAAGCACGAC GAAATCTCTC
AAAAACTGTC TACGGCCTGG AAGGAGATCG ACGAAGCCTC CAAGAGAATC TTCCAGCAAA
AGGCGGACGC AATCAAGGCG GATCTCATGC AAAACATGCC GAGTTCTGTG ATGCTCACGG
GCCTGGAACA CTCCTTACCG GAGCCACACT ACAATACGCA CATGTACCCC TAA
 
Protein sequence
MVDSKNALRL TDESSPEPPK RAMTAYLIFC SKHRERVMRE VHGDDGARKF SRDEMQLVTT 
RLAEMWNNIS EKEKKEVQAK AAAAKAEYEM QKAAFSPALL KKLHRLKSKP KGTVVVEAQG
EKPVRAKTAY LIFCGKHRAA VMRKIHPEPE AKFTRAEMQQ VTTELAALWN NISPQELAEC
KAAAAKELER YKQLKAEYRP PVYGPSKRNK GKSVPGKPKR APTAYLIFAE ELRARIRQER
PHLKHDEISQ KLSTAWKEID EASKRIFQQK ADAIKADLMQ NMPSSVMLTG LEHSLPEPHY
NTHMYP