Gene OSTLU_39977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39977 
Symbol 
ID4999732 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp358476 
End bp359450 
Gene Length975 bp 
Protein Length285 aa 
Translation table 
GC content61% 
IMG OID640415153 
Productpredicted protein 
Protein accessionXP_001415467 
Protein GI145340718 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG5242] RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 
TIGRFAM ID[TIGR00627] transcription factor tfb4 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCTC GCGAAGACGA CGTGGCGGAC GATAAGTCAC TTTTAGTTGT ACTCGTGGAG 
ACGAACCCGC GGTACTGGGC GGCGAGCGAG GGGAAGGGAG ACGGCACCGC GGCGGCGAAC
GGGTTGTCGA GCGTGCTCGA GGCCACGACG GTGTTTCTGA ATAGTTTCTT TGCTTTGAAT
CAGCAAAATA GAGCGGCGGT GATCGCGGTG CACGACGATG GATGCCATTA TCTGTATACG
TCGCCGTTGG GTGGGGCCAT CGACGACGAG GCCGAGGACG CGGAGGACGC GCGTTGGACG
AACAAGGTGG GCGGCTTGGA TCCGTTGCAA ACGGAAGCGG GACCGACGAT GTTGACGCGG
TTAGGAGAGT TAAACGCTGG AGACGCTTCG GCAAGTGCGA CGGGTAAGAA GCGTACGAAG
ACGGCGACCC CATCGTCGCG CGCGGCGATG TCGTCGCCTT TCGCCGGCGC GTTGAGTCTG
GCGCTGTGTT ACTGCAACCG CGCGCAAACG CTGGAAACCG CCGCCGGTTT ACGAGTCAGA
CCGCGCATTC TGTGCTTACA AGCATCTCAA GACAATCCCA CGGATTACAT CTCGATGATG
AACGCGATAT TTTCGGCGCA ACGACAATCA ATACCCATAG ACGCGTTCGC GCTCGGCGAG
CACGATTTGC CGTTTCTCCA ACAAGCCGCG CACATCACTC GCGGCGCCTA CGTAAAGCCC
ACGCATGGCG CCGGTTTGCT TCAATACCTC CTCTCCACCG CCGCCCTGGA CATGCGCAGT
CGCTCTCACC TCAAGCTCCC CGCCGCGCGC GGCGTCGACT TTCGAGCCTC TTGTTTCTGC
CACAAGCGTC CGGTGAGCGT CGGCTTCGTG TGCTCGGTGT GCCTCAGCAT CTTTTGCGAA
CGTCGATCGT CGTGCGATAC CTGCGGCGCC GACTTCGCCG CCGACGCCCA AGTCACGAGC
GTTCCATCCG CGTAG
 
Protein sequence
MAAREDDVAD DKSLLVVLVE TNPRYWAASE GKGDGTAAAN GLSSVLEATT VFLNSFFALN 
QQNRAAVIAV HDDGCHYLYT SPLELNAGDA SASATGKKRT KTATPSSRAA MSSPFAGALS
LALCYCNRAQ TLETAAGLRV RPRILCLQAS QDNPTDYISM MNAIFSAQRQ SIPIDAFALG
EHDLPFLQQA AHITRGAYVK PTHGAGLLQY LLSTAALDMR SRSHLKLPAA RGVDFRASCF
CHKRPVSVGF VCSVCLSIFC ERRSSCDTCG ADFAADAQVT SVPSA