Gene OSTLU_33057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33057 
SymbolHAG3501 
ID5003090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp321091 
End bp322599 
Gene Length1509 bp 
Protein Length447 aa 
Translation table 
GC content58% 
IMG OID640418511 
Productpredicted protein 
Protein accessionXP_001419344 
Protein GI145349859 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCGCGCCCG CGCGCGTCGA ACGCGCCCGC GCGCTCGAGC GATCGGAAGA AATCTTTTCG 
AGCGCACGGC GCGCAGGAGA TCGCGCGGAC GCGCTCGCGG GTGGCGTGCG AGCGCGTTGA
ATTAGCTCGT CGGGACATCG CGCGGAACGG CGATCGGCGA GCGCGATGCA GCGCGGCACG
CGCGCGCGTG AGGACGCGAG CGATCAGGAT GATGTCTCGG GAAGCGCGCG CGGCGTGAAG
CACGCGCGCG CGGATGAGAG CGGCGCGAAC GGGTCGAGGG ACGGCGCGGG GGCGGCGGAC
GGAACGCCGA ATAAACAAAC GACGACGACG GGCGCGACTG CGGGCGCGAC GGGGGCGACG
TCGCCGGTGT CGCCGTCGAG AGGGGCGTAC GCCACGCGAG AATCGCACCT GCGTAAGCAA
GAGCGTGATG GGGAGCTGAA GTGGGAGGTG ATTAAAAACG ACGGGAGCGA GGCGAATTCG
CGCTTGCTCG TGGCGCTGAA GAACATTTTT AGCAAACAGC TGCCGAATAT GCCCAAGGAG
TACATCGTGC GGTTGGTTTT TGACTCCAGA CATTACTCGA TGCTGTGCAT GAAGAATGGG
AACGTCATCG GGGGGATCAC GTACAGGCCG TTTCCGAAGC AACGCATGGG TGAGATTGCT
TTTTGCGCGG TGAGCGCAAA TGAACAGGTG AAGGGATACG GTACGCGGTT GATGAATCAC
ATCAAGGAGT ACGCTAAGGA AAAGGAAAAT ATGACGCACC TGATCACGTT CGCGGATAAC
AATGCGGTGG GGTACTTTCA AAAGCAAGGA TTCACAAAGG AAATCATGAT GGAGCGCGAA
AAGTGGTACG GGTACATCAA AGAGTACGAC GGTGGGACGA TTATGGAGTG TCAATTAGAT
GGGCACGTCT CCTACGTGGA CTTTGTGAAT CAGATCCGAG AACAGCGCAA GGCGGTGGAG
GCCAAGGTAC GAGAGATGAG CACGGCGCAC AAAGTCTACC CGGGATTGAA GGACCACTTC
AAGCCGAGCG CCGAGGGCAA GTATATCCCC ATCGACGTCA AACACATCAA AGGCTTGAAA
GAGGCGAAGT GGGAGGACCC AGGACTACCA AAGTACCGTT TAGTCCACCC AGGATGCGGC
GATGGCATAC CCACGAAGGC AAACTTGCAT AAATTCATGA GAGCCATCGT AAACGTGATT
CAGGCGCACT CCGATGCGTG GCCGTTTGCC GCACCCGTGA ACCCGCTCGA AGTCACAGAC
TACTACGACG TCGTCAAGGA TCCCGTCGAT ATGGAACTCA TCCAAGAACG CGTCTCGGCG
GGGAATTACT ACGTCTCTTT GGAGATGTTT TGCGCCGACT TCCGTTTGAT GTTCAACAAC
TGTCGCATAT ACAACTCACG CGACACTCCG TATTTCAAAG CGGCCAATCG TCTCGAGGCG
TTCTTCGAAT CGAAAATCGC CGCCGGGGTG AATTGGAAAA TTCGCGAGGC ACCTTCTCGA
GGTCGATGA
 
Protein sequence
MQRGTRARED ASDQDDVSGS ARGVKHARAD ESGANGSRDG AGAADGTPNK QTTTTGATAG 
ATGATSPVSP SRGAYATRES HLRKQERDGE LKWEVIKNDG SEANSRLLVA LKNIFSKQLP
NMPKEYIVRL VFDSRHYSML CMKNGNVIGG ITYRPFPKQR MGEIAFCAVS ANEQVKGYGT
RLMNHIKEYA KEKENMTHLI TFADNNAVGY FQKQGFTKEI MMEREKWYGY IKEYDGGTIM
ECQLDGHVSY VDFVNQIREQ RKAVEAKVRE MSTAHKVYPG LKDHFKPSAE GKYIPIDVKH
IKGLKEAKWE DPGLPKYRLV HPGCGDGIPT KANLHKFMRA IVNVIQAHSD AWPFAAPVNP
LEVTDYYDVV KDPVDMELIQ ERVSAGNYYV SLEMFCADFR LMFNNCRIYN SRDTPYFKAA
NRLEAFFESK IAAGVNWKIR EAPSRGR