Gene OSTLU_28032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28032 
SymbolHMGB3502 
ID5005878 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp135618 
End bp136683 
Gene Length1066 bp 
Protein Length273 aa 
Translation table 
GC content55% 
IMG OID640421299 
Productpredicted protein 
Protein accessionXP_001421849 
Protein GI145355189 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5531] SWIB-domain-containing proteins implicated in chromatin remodeling 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.0307251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.690356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGACA CGCAGCGCCA GATCGCGAGA CTGCTCCCGG ACATCATCCG CGGCGCAGAC 
TTGGAAAAGG CGACCGTGCG GACGCTGCAA AAGTCGCTGG AGGATTCGTT GGGACGGGAT
TTGGGCGAAC ACAAAAACTT TATACGCGCC GAGGTGCGAG CTCGAGCGCG CGCGCGATCG
AGAAGAGACG CCCGCGCGAT GATTGATTCG CGGTTTGGTG CGGTTTGGGA GGGATTCGAC
GCGCGACATC GACGACGTTG CGTGAATTTT GGTTTAAGAT CGTTGAGAGG AATGATTGAA
TTCGCCGCGC GACGACGGCG GCGCCGGCGC TCTCGGACGA GCGCGCGGTG TGGAACGCTC
GATGACTGAC GCACGCGATG CGCTTTAACC GTGGCAGGTG GAACACTTCT TGAAGGGTGC
GGTAACGAAG AAAAGGGCGG CCTTGGATGA GGAGGGTTCG AAGGGCAAAA AGGCCAAGGC
GCAGAAAAAA ACGGGTCGGG GCAAGACGAA GGAATTAGTG GACCCAACTC GACCGAAAGG
ACCGAAAGGG GCGTACATGT GTTTTGTGAG TGCGCGAAGA TCACAAATTA AGGATGCAAA
CCCCGATATG ACGTTCCCAG ATATCGCTCG CGAGCTCGGT GTGGAGTGGA AGACGATGTC
GGAGGCGAGT CGCCATCGGT ACGAACAAAT GGCAGAGTTG GATAAAGATC GATACACGCG
GGAGATGTTG TCCTACGTTC CCTTGAGTGA TGAAAAGATG CAGGAATTGA GAGAACAACA
AAGCAGGCGA AAGGCCGCGG GGGGTCTCCA AGTCATGTAC CACTGTTCGC CCGAGCTCAC
AGCGTTTCTC GGCGGCGCGA AGACGATAAA CCGCAAGGAA CTCACGACGA GAATCTGGAA
GTATTTCCGT GAGCATAATT TGATGGATCC AATCAACAAG CGATTCATTG TCCCAGATAC
GAAATTGTCC AAGCTTTTGA AGCTTCAAGA CGGCGAAAGA TTCCTTGCGT TCACCGTCAG
TCGCTACCTC AATCCGCACT TGGTGAAGAA AGTTGAGAGT CAGTGA
 
Protein sequence
MEDTQRQIAR LLPDIIRGAD LEKATVRTLQ KSLEDSLGRD LGEHKNFIRA EVEHFLKGAV 
TKKRAALDEE GSKGKKAKAQ KKTGRGKTKE LVDPTRPKGP KGAYMCFVSA RRSQIKDANP
DMTFPDIARE LGVEWKTMSE ASRHRYEQMA ELDKDRYTRE MLSYVPLSDE KMQELREQQS
RRKAAGGLQV MYHCSPELTA FLGGAKTINR KELTTRIWKY FREHNLMDPI NKRFIVPDTK
LSKLLKLQDG ERFLAFTVSR YLNPHLVKKV ESQ