Gene OSTLU_43496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43496 
Symbol 
ID5006565 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp158624 
End bp163846 
Gene Length5223 bp 
Protein Length1663 aa 
Translation table 
GC content54% 
IMG OID640421986 
Productpredicted protein 
Protein accessionXP_001422669 
Protein GI145356916 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0622972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCG ATGCGACGGC GACGACGGCG CGACACCGGG TCAAACAGCG CGGACTGGTG 
CAGAAAGAGA TCGCGAGCGC GAGCTTCAGC TTTTACGACG CCGAGGATGT GCGCAAGATA
TCGGTGAAGA GGATCACGAA CCCGGTGCTG TTCGACGGGT TGAACAACGC GGTGGCGGAT
GGTCTGTACG ACCCCGCGCT CGGACCGACG GACTCGAAAA CGACGTGCGT GACGTGTAAA
TTTCCGGGAG GGATGTGCGC GGGACACTTT GGACACCTCG AGCTCGTCGT GCCGGTGTAC
AACCCGTTGA CGTTCGGCAC GGTGGTGCGG TTGCTGAAGA CGACGTGTTT TCATTGTCAC
AAGTTTCGCT TGCACGCCAG TCGCGTGCGA AGGTTTCGAG AGCGGTTAGA GATGCTCATG
GATGGTGACA TGGAAGCGGC CGAGGGAGTA CTTCCGGAGA TTAGCAAGAA GGCGAAGGAA
GAGATGAGCT CAGTGTTTAA AGAGGTCGAG GGCGACGGCG ATGCGGAGGA GATGGATTTA
GACAACATCC ACGACGTCTT GCCGAGATTG AAAACGCGTG GTCGAGGTGA ACCGGTCGTA
TGGACTTCGA TCACGTCCAC GGCAGCGCGA AATTTAATCA AGGAGTTTCT CGCGATACAA
CCGAAGAAGT GTGAGAACTG TGGGGCTATG AATCCGAAGG TGTCGCCCGA GGGACACAAT
AAGATCTTCA GAGGTGCACT TCCGAAGGCG CATCACGAGA ACAACTTAGC CAAGGGCATT
GACATCAACG ACGATATGGC GTACCTTGCT CGCGAAGCTA GCGCCGAGAG CGCAGATTCA
CACGCTGGAG CGACGAAGTT GGCCGGCGCA GCTGTGGAAC CGAAGCTCGT GCCGGCGAAA
AAGAAGGCGC GCAAGAGACT CGGCGAAGGG GATGAGTCGG ATAGATCGAC GGATGACGTG
GGTGCGGCTG AAAAGAGTAG TGAAGCCCGA GACGTCGACG CGCAGTCTGA TAGCAGCGAC
GATAGCGATA GTAGCGACTC AGAGACGATG TCCGTGGATG AGCGCGTGCA GGAGGCGGCA
AAGTCGCTTT ACATCACCCC AATTGAGGCT CGAGCTTTGC TTAAACGTTT ATGGATGTAC
GAGTACGATT TTTGTTCCAT GATCTGGGCG ACGACCCCAC CGAATAAGTG CACGAAGCGA
GGCGAAGAGC GTCGAAGTGA TCCGGCGAGG TTTTTCATTC AAACACTCCT CGTGGCTCCG
TCAAAGTTCC GGCCGCCAAG CAAGATGGGC GATATGATCT TTGATCACCC GCAAAATACG
GCGCTCACGA CGATTATCCA GGCCAATTTA AGTCTGGCCG AGCTCTTTAG GACGCCTCCA
ACGGTCCCAG AGCCGCCTGA GGTGCGAGCA GGCCGAGCCG TACGCGCTTG GTTGGCTTTG
CAATCCGGCG TGAATCGGCT CATCGACGCC ACGAAGGCTG ACACCCAAGA AGCCAAGCAG
GCTATTGGGA TTCGTCAGCA GCTTGAGAAG AAAGAGGGTT TATTTCGCAT GAACATGATG
GGTAAGCGCG TGAATTTCGC CGCACGGTCG GTGATTTCCC CCGATCCATA CCTGGGCACG
AGCGAAATTG GCGTTCCTCC GGTGTTCGCG AAAAAGCTCA CGTTTCCCGA GCTCGTCACC
CCACATAACG TTGACTTGAT GCGCACACTC GTTGAAAACG GACCTGAAAT CCATCCAGGG
GCGAACGCGA TCGAAGACGA ACGAGGGCGT GTGATTCACT TGGACAAGTT CACCGCCGAA
AAGCGAGCTG CCATAGCGAA GACTCTCTTA GCAACGACGG CCGCTGGATC TGCGGACGGG
CCGGCGAGAC CGCTCGCAAA GACGGTGTAT AGACATTTAC GCGACGGCGA CGTGATGCTC
GTTAATCGTC AGCCAACGTT GCACAAGCCT GGTATTTTGG CGCACACTGC GCGGGTTCTA
CCGGGGCAGC GAGTCGTCCG TATGCACTAC TCCAACTGTT CCACCTTCAA CGCCGATTTC
GATGGGGATG AAATCAACCT TCACTTTCCG CAGGATCACT TAGGACGAGC CGAAGCGTAT
GAAATCATGC ACGGCGATCG TCAGTTCACC GTACCAACGG ACGGGAAGCC TCTCAGAGGG
TTGATCCAAG ACCACATCTG TTCCGGGTTG TTGCTCTCCA TGCGAGACAG CTTCTTCGAT
CGATCCGAGT TCACGCAGCT TCTTTACAGC GGTCTCGTGG ACTACTGCGG TGATGAGCAC
GGGAAAATCG ACGTCCCGGC GCCTGCTCTC CTCAAGCCAA AAGCACTTTG GACCGGCAAA
CAAGTCATCG CCGCGGTTTT ATCGCACATC ACTCGAGGGC GACCACCGCT AACGTTCAGC
GCACCGTGCA AGATCCCAGC CACGTTTTTC GGCGGTGAAG ACTCTGGCGA AGATCGACTG
ATTATTAGAC GAAACTACTT TTGCTCTGGG GTCGTAGACA AGAATATGTT CGGCAAGTAC
GGCCTTGTGC ACGCCGTGGC CGAGCTCCAC GGACGGTCCA CAGCCGGCGC TTTATTGTCA
ATTTTTTCCC GACTTTTTAC CAACTTTTTG CAAAAGCATG GCTTTACGTG CGGTATCGAT
GATTTGATTC TCACCGCTGA TGCAGAAAAA GATCGCGTAG TGGAGCTAAA CAAGGCAGAC
GAGATGTGTA AGACGGCTAC AGCAGATGTC GCAGAAGCGA GCGGGAAATC GGATGAGGAA
GTGATGACGG CTATCGCGGC GAAGTTGCTC GAAAATCCCG AATGGGGCGC TCAATTGGAC
ATGAAAGCGT CAGGCGCTTT GAACAAGGTG ACTTCGGCGA CCGTGAAGAA GTGTCTCCCG
TTCGGTACAA AGAAGCCATT CTCAAAGAAT TGTCTCTCCA TCATGACCAT CTCTGGCGCA
AAGGGTTCGC TTGTGAACTT TTCTCAAATC GCCGCCGCCT TGGGCCAGCA AGAGCTCGAG
GGTCGCCGTG TACCTCGCAT GCCGAGTGGT AAGACGCTCC CTTCATTCGA GCCTTTCGAC
ATCAGCTCGC GGGCGAACGG TTACATCGCC TGTCGGTTTT TTAGCGGATT GGATCCCGCC
GAGTACTTTT TCCACTGCAT GGCTGGTCGC GAAGGTTTGG TCGATACCGC CGTGAAGACT
GCGCGCTCTG GTTACTTGCA GCGTTGTTTG GTGAAAAACC TAGAATCGAT GAGAGTGCAT
TACGATTTTA CCGTGCGTGA CAGCGATGGA AGCATTGTGC AGTTTCAATA CGGGGAGGAT
TGCGTCGACG TCACGCGTTC TGGGTACTTG GAGAAGTTTG AGTTCCTTGC CGAAAACCCG
GAACTCATTT TGCTCAACAA CGAAGCCGCG ATTTCGATGT TGCCCAAGCT CAACAAGAAA
AAAGTAAGTG TTCTTGAGAC CATAGGTCGT AAAGAAGAGC TACCGCGATT CTGCATGGAA
AACTTTGGCG ACCAGTTGGG AGTGTTGCCA GAGAAATTCG GCGATGAACT TAAAACGTTC
ATCGACTCGC GACCGAAAGG TTACTGGGCC GAAGAGAAAG GAAAGAAGAA TGAAAGTTCA
TTGGCGGCAA AGTCCGGTTT GACCGCTGAA GAGTTTGCCA TGCTGATGAA TATGCGTTAT
CTCACGTACG TTGCGCCCCC CGGCGAAGCC GTCGGCGTCA TTGCCGCGCA GTCTGTTGGA
GAACCATCCA CACAAATGAC GTTGAATACC TTCCACTTTG CCGGCCGCGG CGAAGCAAAC
GTCACGCTTG GTATTCCTCG TCTTCGAGAG TTGCTCATGG CTGCATCAAA GAAGCTTTTG
ACGCCCGTCA TGATACTTCC GTTGAAACCT GGGTGCAGAA CTAAGGAAAA TGCTGAGACG
CTCTGTCGCC GACTCCGACG AGTGATCCTC GCCGAACTCA TCACAAAGCT TTGTATCAAG
GTGAAAGACT ACGGTATAGG TCCAGATGAG GGACTCTCAC GATTGTATAC CGTTGTCATT
CAAATGCGCG AAGGTCAAAA TGACGACGAT CCAAATGAGG TGACGTTTGC AGAATTTCTG
CACGCCGTCA AGCGCAAGTT CGCCAAAATG CTCGTGGCGA GAATTGGCTC TGACATGAGG
AAGAGCAAAA GCAACAATGG TGTCATCGTG AAAAACGCGA AAAATGGCCC GATGCAAGGA
ACCACGGACC CTCGCAAGGA CGATGACGAA GAAGAGGATG ACGAAAAAGA GAAGAACTCC
GCCGCGCGAG CGAAAAATGT CAAATTTGAG AATGGCGACG CGTCGGATGA GGACGAGGAC
GATGAGGAAG ACGAGGAAGG TGCAAAGACG GAGAATCGCA AAATAGACGA AACTGAGGTC
GACAGCGATT CTGATGGCTC ATCTTCTTCC GGTGACGATG ATTCCGATGC TTCTTCGGAC
GACGCTAGCG CCAAGACGCC AAAGTCAAAG AAGAAGCGAG TTCCTTCGAG TGAGATGACG
GACTGTGGCG TACAGCTGAC GGAGGATGAA GTTGTCGAGA CAATCGTGTG TGACGAAAAA
TCTCGAACCA TAGAGTTCAC GGTTCCAGCT GGTATAAACT CTCCTCACGT ACTCGTGCTC
GAAATCGCAC AAGAAGTTGC CGTGAAAACG ATCATTCGCG AAACGCCGGG AATCAAGCAA
ACTTTCGTCG TCGGCAAGAC GGACGATGAA GATCCCACTG GCTCGCAACC TTTATCCATT
CAAACCGATG GTATCAATTT TGGCGCCGCT TGGGCGAACT CTGATCTCAT CGAGGTCAAC
TCCATGAAAT CAAATTCTGT ATGGGACATC ATGCAAACCT TTGGGGTCGA GGCGGGCCGA
TCGACGCTCG TGAGCGAAGT GCAAGCTGTA TTCGGCGTGT ACGGCATCGG CGTTGATACG
CGTCATCTTT CTCTCATCGG TGACTTCATG ACGCAACAAG GTGAATACCG ACCCTGCAAC
AGATCTGGCA TCGAAAAGAG CACTTCGCCA TTCCTGAAGA TGTCTTACGA GACGGCGACG
GCATTCTTAA CGGACGCAAC AATTCGCGGA GAAACGGACG ACTTGAGCTC GCCATCGTCG
AGAATCGTCG TTGGTCGCAC TGTTGATCTG GGCACGGGCT CGTTTTCTCT CAAACACGAC
ATCGTGCGGG CTGCACAGTA CCAAGAGGCT AACAAAGCCA CGGGCAAGCA CATTCGTCTC
TGA
 
Protein sequence
MPRDATATTA RHRVKQRGLV QKEIASASFS FYDAEDVRKI SVKRITNPVL FDGLNNAVAD 
GLYDPALGPT DSKTTCVTCK FPGGMCAGHF GHLELVVPVY NPLTFGTVVR LLKTTCFHCH
KFRLHASRVR RFRERLEMLM DGDMEAAEGV LPEISKKAKE EMSSVFKEVE GDGDAEEMDL
DNIHDVLPRL KTRGRGEPVV WTSITSTAAR NLIKEFLAIQ PKKCENCGAM NPKVSPEGHN
KIFRGALPKA HHENNLAKGI DINDDMASDS ETMSVDERVQ EAAKSLYITP IEARALLKRL
WMYEYDFCSM IWATTPPNKC TKRGEERRSD PARFFIQTLL VAPSKFRPPS KMGDMIFDHP
QNTALTTIIQ ANLSLAELFR TPPTVPEPPE VRAGRAVRAW LALQSGVNRL IDATKADTQE
AKQAIGIRQQ LEKKEGLFRM NMMGKRVNFA ARSVISPDPY LGTSEIGVPP VFAKKLTFPE
LVTPHNVDLM RTLVENGPEI HPGANAIEDE RGRVIHLDKF TAEKRAAIAK TLLATTAAGS
ADGPARPLAK TVYRHLRDGD VMLVNRQPTL HKPGILAHTA RVLPGQRVVR MHYSNCSTFN
ADFDGDEINL HFPQDHLGRA EAYEIMHGDR QFTVPTDGKP LRGLIQDHIC SGLLLSMRDS
FFDRSEFTQL LYSGLVDYCG DEHGKIDVPA PALLKPKALW TGKQVIAAVL SHITRGRPPL
TFSAPCKIPA TFFGGEDSGE DRLIIRRNYF CSGVVDKNMF GKYGLVHAVA ELHGRSTAGA
LLSIFSRLFT NFLQKHGFTC GIDDLILTAD AEKDRVVELN KADEMCKTAT ADVAEASGKS
DEEVMTAIAA KLLENPEWGA QLDMKASGAL NKVTSATVKK CLPFGTKKPF SKNCLSIMTI
SGAKGSLVNF SQIAAALGQQ ELEGRRVPRM PSGKTLPSFE PFDISSRANG YIACRFFSGL
DPAEYFFHCM AGREGLVDTA VKTARSGYLQ RCLVKNLESM RVHYDFTVRD SDGSIVQFQY
GEDCVDVTRS GYLEKFEFLA ENPELILLNN EAAISMLPKL NKKKVSVLET IGRKEELPRF
CMENFGDQLG VLPEKFGDEL KTFIDSRPKG YWAEEKGKKN ESSLAAKSGL TAEEFAMLMN
MRYLTYVAPP GEAVGVIAAQ SVGEPSTQMT LNTFHFAGRG EANVTLGIPR LRELLMAASK
KLLTPVMILP LKPGCRTKEN AETLCRRLRR VILAELITKL CIKVKDYGIG PDEGLSRLYT
VVIQMREGQN DDDPNEVTFA EFLHAVKRKF AKMLVARIGS DMRKSKSNNG VIVKNAKNGP
MQGTTDPRKD DDEEEDDEKE KNSAARAKNV KFENGDASDE DEDDEEDEEG AKTENRKIDE
TEVDSDSDGS SSSGDDDSDA SSDDASAKTP KSKKKRVPSS EMTDCGVQLT EDEVVETIVC
DEKSRTIEFT VPAGINSPHV LVLEIAQEVA VKTIIRETPG IKQTFVVGKT DDEDPTGSQP
LSIQTDGINF GAAWANSDLI EVNSMKSNSV WDIMQTFGVE AGRSTLVSEV QAVFGVYGIG
VDTRHLSLIG DFMTQQGEYR PCNRSGIEKS TSPFLKMSYE TATAFLTDAT IRGETDDLSS
PSSRIVVGRT VDLGTGSFSL KHDIVRAAQY QEANKATGKH IRL