Gene OSTLU_39835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39835 
Symbol 
ID4999735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp1049178 
End bp1052618 
Gene Length3441 bp 
Protein Length1146 aa 
Translation table 
GC content53% 
IMG OID640415156 
Productpredicted protein 
Protein accessionXP_001416013 
Protein GI145341844 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.695228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTGCC TCACGCGCGA CCCCTCGAAG ATCACCGCTC CGATCGCGAC GTTGGAGGAT 
AAATACGAGC TCCTGCCGGC GTTTCTCAAG GTGAGAGGGC TGGTCAGGCA GCACATCGAC
TCGTTCAACT ACCTGGTGAA CGAGGAGATC AAAAAAATCA TTCACGCCAA GGCGAACGAG
CGAGTGACGT GCGATAGTGA TCCAAACTTT TACCTGAAGT ACACGAATAT ACACGTTGGA
CGACCGAGCG TGGAGGAAGA TTACGTGGTG GAGGAAATAA CGCCGCAGCA GTGTCGATTG
CGAGACATGA CGTACGCGGC GCCGATTACG GTGGATGTGG AGTACACGCG TGGGAAGGAG
ATCGTGACGC GTCAGGCGAA GAATGGTGTG GGTGGCGTCG TTATTGGACG GATTCCTTTG
ATGCTGCGAA GCTCGCGGTG TGTGTTGACC GGAAAAAATG AGGACGAATT GGCGAGGTTG
GGGGAGTGTC CGTTGGATCC GGGTGGATAC TTCATCGTCA AGGGCGTGGA GAAAGTGATT
TTGATTCAAG AGCAGTTGAG CAAGAATAGA ATTATCATAG AAGTCGACGC CAAAGGAGAG
ATTGGGGCAT CGGTGACTTC GTCGACGCAC GAACGGAAGA GTAAGACAAA CATCGTAGTC
AAGCACGGAA AGTTCTACCT TCGACACAAC ACGTTTGCGG ACGACATTCC AATCATGATC
GTTCTCAAGG CAATGGGGTT GGAGAGCGAT CAAGAAGCGG TGCAAATGAT CGGATCGGAA
CCAGCGTACG CGATGCTTTT GGGGCCGACT TTACAGGAGT GTCAAGCAGC TGGAGTTTTC
ACTACGCAGC AGGCGCTAGA GTATTGCGCG AACAAGGTGC GCGTGGTGAA GACGCACATG
ACGAATAGAC CGGGTGTGCG AAACTTCGGT AGATCGCGCG TAGATGAAGC GCGCGACATT
CTTGCCGGCG TCGTACTCGC ACACGTGCCC GTGACGTCGT ATGACTTTAG GCAAAAGTGC
GCTTACATTT CCATCATGGT TCGCAGAATT CTGAACGCGA TGCTCGACCC GACGCAGATC
GATGACAAGG ACTACTATGG TAACAAGCGT CTTGAACTCG CCGGTCAGCT CATTGCCCTA
TTGTTTGAAG ATTGCTTCAA ACGTTTGAAC GCTGATTTGA AGCGACAAGC CGACGCCGTG
CTTTCCAAAG CAAATCGTGC GACGCAGTTT GACATCTTAA AGTGCATTCG TCAAGACACC
CTGAGTAACG GCCTGGAACA CGCAATTTCG AGCGGTAACT GGACGGTAAA GCGTTTTAGA
ATGGAACGAA AAGGCGTGAC TCAAGTGCTG AGTCGACTGA GTTTCATCAG CGCGCTCGGT
ATGATGACGC GCATAACGTC TCAGTTTGAA AAGACCAGAA AGGTTTCCGG TCCGCGGGCG
TTACAACCGA GTCAATGGGG GATGCTCTGC CCGAGTGACA CTCCCGAAGG AGAGTCATGC
GGGCTCGTAA AAAACTTGGC GCTGATGACG CACGTCACGA CGGATGACGA GGAGGAGCCG
CTTCGCCGTT TAGCACACGC GCTCGGTGTA GAACCGCTTA CGTGGTTAAA CGCCGGTGAG
ATGCATTCGC CGTCTGGTGC GCACGTTTTG ATGAACGGTT CCCTGCTCGG CGTGCACGCG
CAACCCGAAG CGTTCTCGCA CGCGTTTAGA AAGCTGCGAC GGGCGGGGCG AATTGGTGAG
TTCGTTTCTG TGTACACTGC GGATGGATGC GTGTACATTT CTTCAGACGG TGGACGAGTG
TGTCGACCAC TCATCATCAT CGAGCGTGGT GAGCCTTTAC TGACTCAAGA ACATCTCGAC
GAGCTGAAAG ATGGTCATCG TACGTTCAAT GATTTCCTAC GCGAAGGATT AGTCGAGTAC
CTGGACGTGA ATGAAGAGAA CAATTCGTAC ATTGCGTTGT ACGAAGACGA GGTGAACGAC
GAGACGACGC ACTTGGAGAT TGAACCTTTC ACTTTGCTTG GCGTTTGTGC TGGCATCATC
CCGTATCCTC ATCACAATCA GTCACCGCGA AATACATATC AATGCGCCAT GGGTAAGCAA
GCGATGGGTA ATATTGCGTT CAATCAATTA AACCGAATGG ACACGTTGAT GTACTTGCTC
GTATACCCGC AAAAGCCAAT CGTCAAGACA AAGACCATAG AGTTAATAGG CTATGATCGT
CTCGGTGCGG GTCAAAATGC GACAATTGCC GTGATGTCGT ATAGTGGGTA CGACATCGAA
GACGCCATCG TGATGAATCG CGCGTCTCTT GACCGAGGTT TTGGCCGGTG CACGGTGTTG
CGCAAATACT CTGCGCAGGT GAAAAAGTAT AGCAATAGAA CCATGGACCG AATCGTCGGG
CCCAAAGATG ATCATCGCGC GAACCAAAAT AGTCGACATC ACTTATTAGA TGATGATGGC
ATCGCCGCCG TCGGTGGTCG CATCAAGCCT GGTGACATTT ATGTCAACAA GCAAACGCCG
GTGAACACTC GGGATCCGAT GGCGAATCCG CACGCGATGC CAGACACCAT GTATCGCTCG
AATCCGCTTT CATACAAAGG TCCCGCAGGT GAATCGGCGA TTGTTGATAA AGTCTTACTC
ACAATGACGG ATGAAGGGCA ATTCAATATC AAGACGCTCG TGCGTCAGAC GCGTCGACCA
GAAGTTGGTG ATAAGTTTTC TTCTCGTCAT GGCCAGAAAG GCGTTTGCGG CATCATCCTC
GATCAGGAAG ATTTCCCGTT CAGCGAACGT GGGATTACTC CTGACCTCAT CATGAACCCG
CACGGGTTCC CGTCACGTAT GACGGTCGGA AAGATGATCG AACTTCTCGG CGGTAAGGCT
GGACTCGAGA GCGGCCGCTT CCACGACGGA ACCGCGTTTG GTGGAGACAC GGTTGAAGCC
ATTCAACAAA CCTTGGTTGA ATCCGGGTAC TCTTACAAAG GCAAGGACAT GCTTCACTGT
GGAATCACCG GCGAAGCGCT TGAAGTCAAC GTTTTCATGG GTCCAGTGTA CTATCAAAAG
CTCAAACACA TGGTTCAGGA CAAGATGCAC GCTCGCGCGC GCGGTCCTCG CGTCGTCCTC
ACACGCCAAC CCACCGAAGG TCGAGCCCGC GACGGTGGTT TGCGCCTCGG CGAGATGGAA
CGCGATTGCC TCATCGGTTA TGGCGCATCA CAACTTATTT TGGAGCGTTT GATGATCTCT
TCCGATCAAT TCGAAGCGCA AGTCTGCACC AAGTGCGGCT TGCTCGGATT CCAGCACCAC
ATCACCCGCA GGAACGCGTG CACGCTGTGC AAAACTGAGG CCGAAGTCGC GACGTTAAAG
CTCCCGTACG CGTGCAAGCT CTTATTCCAA GAGTTGCAAT CCATGAACAT AGCACCTCGG
TTATCTTTGA CCGAGGCTTA G
 
Protein sequence
MDCLTRDPSK ITAPIATLED KYELLPAFLK VRGLVRQHID SFNYLVNEEI KKIIHAKANE 
RVTCDSDPNF YLKYTNIHVG RPSVEEDYVV EEITPQQCRL RDMTYAAPIT VDVEYTRGKE
IVTRQAKNGV GGVVIGRIPL MLRSSRCVLT GKNEDELARL GECPLDPGGY FIVKGVEKVI
LIQEQLSKNR IIIEVDAKGE IGASVTSSTH ERKSKTNIVV KHGKFYLRHN TFADDIPIMI
VLKAMGLESD QEAVQMIGSE PAYAMLLGPT LQECQAAGVF TTQQALEYCA NKVRVVKTHM
TNRPGVRNFG RSRVDEARDI LAGVVLAHVP VTSYDFRQKC AYISIMVRRI LNAMLDPTQI
DDKDYYGNKR LELAGQLIAL LFEDCFKRLN ADLKRQADAV LSKANRATQF DILKCIRQDT
LSNGLEHAIS SGNWTVKRFR MERKGVTQVL SRLSFISALG MMTRITSQFE KTRKVSGPRA
LQPSQWGMLC PSDTPEGESC GLVKNLALMT HVTTDDEEEP LRRLAHALGV EPLTWLNAGE
MHSPSGAHVL MNGSLLGVHA QPEAFSHAFR KLRRAGRIGE FVSVYTADGC VYISSDGGRV
CRPLIIIERG EPLLTQEHLD ELKDGHRTFN DFLREGLVEY LDVNEENNSY IALYEDEVND
ETTHLEIEPF TLLGVCAGII PYPHHNQSPR NTYQCAMGKQ AMGNIAFNQL NRMDTLMYLL
VYPQKPIVKT KTIELIGYDR LGAGQNATIA VMSYSGYDIE DAIVMNRASL DRGFGRCTVL
RKYSAQVKKY SNRTMDRIVG PKDDHRANQN SRHHLLDDDG IAAVGGRIKP GDIYVNKQTP
VNTRDPMANP HAMPDTMYRS NPLSYKGPAG ESAIVDKVLL TMTDEGQFNI KTLVRQTRRP
EVGDKFSSRH GQKGVCGIIL DQEDFPFSER GITPDLIMNP HGFPSRMTVG KMIELLGGKA
GLESGRFHDG TAFGGDTVEA IQQTLVESGY SYKGKDMLHC GITGEALEVN VFMGPVYYQK
LKHMVQDKMH ARARGPRVVL TRQPTEGRAR DGGLRLGEME RDCLIGYGAS QLILERLMIS
SDQFEAQVCT KCGLLGFQHH ITRRNACTLC KTEAEVATLK LPYACKLLFQ ELQSMNIAPR
LSLTEA