Gene OSTLU_36784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36784 
Symbol 
ID5006999 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp256270 
End bp258546 
Gene Length2277 bp 
Protein Length576 aa 
Translation table 
GC content57% 
IMG OID640422420 
Productpredicted protein 
Protein accessionXP_001422941 
Protein GI145357469 
COG category[K] Transcription 
COG ID[COG1405] Transcription initiation factor TFIIIB, Brf1 subunit/Transcription initiation factor TFIIB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.00059248 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0129799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCACC GGTGGTGCGA GACGTGCGGG AAACGGGTCG CGGCGGAGAC GAACGAGGCG 
AACGGGTTCA CGTGCTGCAC GACGTGCGGG AAGATTTTAG ACGAGCGCGC GGCGTTCAGC
GCGGACGCGA CGTTCGTGAA GAACGCGCAG GGAGCGTCCG TGCCGGATGG ACACTACGTG
CCGGAGAGCG GGGTGGCGCA CGGGGTGATA CGGGCGACGC GAGGCGGAAG ACTGTACGGC
GTGCAGTTGG ACTCGCACGA GCGCACGCTG TATCGAGGGA AGCTGGAGAT TAAACAGTTG
GCGGATCGGT TGGGGATACG ACCGAGGGAG GACGTCGTGG ACGCGGCGCA TCGGCTGTAT
AAGCTCGCGG TGCAGAGAAA TTTTACGCGA GGGCGAAGGA TTTCGCAAGT GGCGGGGGCG
TGCATGTACA TCATCTGTCG ACAGGAATCG AGACCGTACA TGTTGATTGA TTTCGCGGAT
ATTTTACAGA CGAACGTGTA CGTTTTGGGG GGCGTGTTCT TACAGTTGTG TCGGTTGTTG
CGACTGGAGC AGCATCCGCT CATGCAAAAG CCGATCGATC CGAGTCTGTT CATTCATCGA
TTCGCGGACA AGTTGAACTT GGGACGACGG ATGCACACCG TGGCGAACAC GGCGCTGCGG
CTCGTGGCGT CCATGAAGCG GGATTGGATG CAGACTGGTC GTCGTCCGAA TGGAATTTGT
GGTGCCGCGT TGTGGGTCGC CGCTCAAATT CATGGATTCA GTCCGAGCAA GCGCGATGTC
GTGGCTGTGG TGCACGTCGG CGAATCGACG CTGAAGAAGC GTCTGAGCGA ATTCGAAAAC
ACGCCGAGCG CGGCGCTGTC GATCGAGGAG TTTGACACGC AAGCTCGCAC GTTTGAGGCT
GAAGAAGAAG CGAATAAAAA CACAAAATCG CTAGCGTCGA GCCCAATGTC GGTGCTGAGC
TGTGTGCACA AAGACAACGA AAACATTCCG CACTTTGCGC ACGGAATGTG TCGCGCGTGT
TACGTGGATT ACGTTAGAAT TTCGGGGGGT TCGGTGGGAG GCGCCGATCC GCCCGCGTTC
ATGCGCGCAG AAGCGAAGCG GAAAATCGAT GCAAAACAAA AGCTTTTGTT GCCCGCGCTG
TCGTCGGGCG AATTGGGAGA CGAAGACGCG TTGACGCAAG AATTTAACTC GGCGCTCGAG
CAAGACTTGA GCGCGCTGCT CGCGTCGCCT ACGCCGTTGA ATTCAGTTCA GCCCTTGGCC
TTACCTTGCT CGTCGAAACG CGCGACTTCG GCGAGGAATA TGACGAAAAA GGGGCAAAAG
CAGCAACAGC AACATCACGT TCAAACTAGT CGCCGCGAGC CGGTCGACGC TGATTTCTTG
AGACGCGCCG AGGACGCCCT GCGCCTGCTC GTCGGATCTC GATGGGCCGA ACTCGTTTGC
TTACCGTTTA CAAGCGATCT CTCCAAGGCG CGACTGTCGA AGATGCATAT GTGTGAGCTC
CATCCAAACT ACGAAACCTT TGTGAACAAC GAAGGTGAAC GCATCGCGCA AGTGGACGCG
CTGGTCTTAC ACTTTTTAAT AGCCGCGAAG TGTTTCGACG ATTCAGCTCT GGCGCAGTTG
GCAAAGCATT CGCCGCACGA CGTCGAGGCT TTCCAAACGA CGCCGTTCGA TCCATCGCAA
ACGTCATCGA AGGCCGCGCT TGCGGTCGTT GAAAGCGACG GTCTCGTCGC CAAGGAGGAT
AACGAAGTCA TCGATACGCT CTCGGACGTT GACGACGATG AAATCGATTC GTACATTCAC
AACGAAAACG AAGTCAACCT TCGTCGTTTG GTTTGGTCTG AGATGAACAA GGAGTACTTG
GAATTCCAAG CCTTGAAAGA GCAAGCCGCC AGCCGCACGA GCGCGCCGAC GAAAAAGAAG
CATAGAAAAG CCCCCGACAC GCTGCCCGCG GAGACTCCCG CGGAAGCTGC GCGTCAAGTC
TTAGCTAAAA AGAAGGGCAG CTCGAAAATC AACTACGAAG CGTTGGAAAA TCTCTTCAAA
GTTTCTGATG GTTCGCAGCC GCCTCCGAAC TCAAAAGCGA CGTCTGACGT CGAAAATGAC
GCTTCTCCGA CAAAGTCTCC TCGCACGAGA CGCGCGCGTC CCGCAGGCTT ACCCTCGAGC
GCTCCGATGT CGACGAAATC AACCGCCAAG CGTCGCGGCT CGAGCGTCTC GACGCACGCG
CCGTCGTCCG CGCGTCCGAG CGGTCTCGCG AAGAAGCCAT CGGCGAAGAA AAAGTGA
 
Protein sequence
MVHRWCETCG KRVAAETNEA NGFTCCTTCG KILDERAAFS ADATFVKNAQ GASVPDGHYV 
PESGVAHGVI RATRGGRLYG VQLDSHERTL YRGKLEIKQL ADRLGIRPRE DVVDAAHRLY
KLAVQRNFTR GRRISQVAGA CMYIICRQES RPYMLIDFAD ILQTNVYVLG GVFLQLCRLL
RLEQHPLMQK PIDPSLFIHR FADKLNLGRR MHTVANTALR LVASMKRDWM QTGRRPNGIC
GAALWVAAQI HGFSPSKRDV VAVVHVGEST LKKRLSEFEN TPSAALSIEE FDTQARTFEA
EEEANKNTKS LASSPMSVLS CVHKDNENIP HFAHGMCRAC YVDYVRISGG SVGGADPPAF
MRAEAKRKID AKQKLLLPAL SSGELGDEDA DGLVAKEDNE VIDTLSDVDD DEIDSYIHNE
NEVNLRRLVW SEMNKEYLEF QALKEQAASR TSAPTKKKHR KAPDTLPAET PAEAARQVLA
KKKGSSKINY EALENLFKVS DGSQPPPNSK ATSDVENDAS PTKSPRTRRA RPAGLPSSAP
MSTKSTAKRR GSSVSTHAPS SARPSGLAKK PSAKKK