Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43496 |
Symbol | |
ID | 5006565 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 158624 |
End bp | 163846 |
Gene Length | 5223 bp |
Protein Length | 1663 aa |
Translation table | |
GC content | 54% |
IMG OID | 640421986 |
Product | predicted protein |
Protein accession | XP_001422669 |
Protein GI | 145356916 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0622972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGCG ATGCGACGGC GACGACGGCG CGACACCGGG TCAAACAGCG CGGACTGGTG CAGAAAGAGA TCGCGAGCGC GAGCTTCAGC TTTTACGACG CCGAGGATGT GCGCAAGATA TCGGTGAAGA GGATCACGAA CCCGGTGCTG TTCGACGGGT TGAACAACGC GGTGGCGGAT GGTCTGTACG ACCCCGCGCT CGGACCGACG GACTCGAAAA CGACGTGCGT GACGTGTAAA TTTCCGGGAG GGATGTGCGC GGGACACTTT GGACACCTCG AGCTCGTCGT GCCGGTGTAC AACCCGTTGA CGTTCGGCAC GGTGGTGCGG TTGCTGAAGA CGACGTGTTT TCATTGTCAC AAGTTTCGCT TGCACGCCAG TCGCGTGCGA AGGTTTCGAG AGCGGTTAGA GATGCTCATG GATGGTGACA TGGAAGCGGC CGAGGGAGTA CTTCCGGAGA TTAGCAAGAA GGCGAAGGAA GAGATGAGCT CAGTGTTTAA AGAGGTCGAG GGCGACGGCG ATGCGGAGGA GATGGATTTA GACAACATCC ACGACGTCTT GCCGAGATTG AAAACGCGTG GTCGAGGTGA ACCGGTCGTA TGGACTTCGA TCACGTCCAC GGCAGCGCGA AATTTAATCA AGGAGTTTCT CGCGATACAA CCGAAGAAGT GTGAGAACTG TGGGGCTATG AATCCGAAGG TGTCGCCCGA GGGACACAAT AAGATCTTCA GAGGTGCACT TCCGAAGGCG CATCACGAGA ACAACTTAGC CAAGGGCATT GACATCAACG ACGATATGGC GTACCTTGCT CGCGAAGCTA GCGCCGAGAG CGCAGATTCA CACGCTGGAG CGACGAAGTT GGCCGGCGCA GCTGTGGAAC CGAAGCTCGT GCCGGCGAAA AAGAAGGCGC GCAAGAGACT CGGCGAAGGG GATGAGTCGG ATAGATCGAC GGATGACGTG GGTGCGGCTG AAAAGAGTAG TGAAGCCCGA GACGTCGACG CGCAGTCTGA TAGCAGCGAC GATAGCGATA GTAGCGACTC AGAGACGATG TCCGTGGATG AGCGCGTGCA GGAGGCGGCA AAGTCGCTTT ACATCACCCC AATTGAGGCT CGAGCTTTGC TTAAACGTTT ATGGATGTAC GAGTACGATT TTTGTTCCAT GATCTGGGCG ACGACCCCAC CGAATAAGTG CACGAAGCGA GGCGAAGAGC GTCGAAGTGA TCCGGCGAGG TTTTTCATTC AAACACTCCT CGTGGCTCCG TCAAAGTTCC GGCCGCCAAG CAAGATGGGC GATATGATCT TTGATCACCC GCAAAATACG GCGCTCACGA CGATTATCCA GGCCAATTTA AGTCTGGCCG AGCTCTTTAG GACGCCTCCA ACGGTCCCAG AGCCGCCTGA GGTGCGAGCA GGCCGAGCCG TACGCGCTTG GTTGGCTTTG CAATCCGGCG TGAATCGGCT CATCGACGCC ACGAAGGCTG ACACCCAAGA AGCCAAGCAG GCTATTGGGA TTCGTCAGCA GCTTGAGAAG AAAGAGGGTT TATTTCGCAT GAACATGATG GGTAAGCGCG TGAATTTCGC CGCACGGTCG GTGATTTCCC CCGATCCATA CCTGGGCACG AGCGAAATTG GCGTTCCTCC GGTGTTCGCG AAAAAGCTCA CGTTTCCCGA GCTCGTCACC CCACATAACG TTGACTTGAT GCGCACACTC GTTGAAAACG GACCTGAAAT CCATCCAGGG GCGAACGCGA TCGAAGACGA ACGAGGGCGT GTGATTCACT TGGACAAGTT CACCGCCGAA AAGCGAGCTG CCATAGCGAA GACTCTCTTA GCAACGACGG CCGCTGGATC TGCGGACGGG CCGGCGAGAC CGCTCGCAAA GACGGTGTAT AGACATTTAC GCGACGGCGA CGTGATGCTC GTTAATCGTC AGCCAACGTT GCACAAGCCT GGTATTTTGG CGCACACTGC GCGGGTTCTA CCGGGGCAGC GAGTCGTCCG TATGCACTAC TCCAACTGTT CCACCTTCAA CGCCGATTTC GATGGGGATG AAATCAACCT TCACTTTCCG CAGGATCACT TAGGACGAGC CGAAGCGTAT GAAATCATGC ACGGCGATCG TCAGTTCACC GTACCAACGG ACGGGAAGCC TCTCAGAGGG TTGATCCAAG ACCACATCTG TTCCGGGTTG TTGCTCTCCA TGCGAGACAG CTTCTTCGAT CGATCCGAGT TCACGCAGCT TCTTTACAGC GGTCTCGTGG ACTACTGCGG TGATGAGCAC GGGAAAATCG ACGTCCCGGC GCCTGCTCTC CTCAAGCCAA AAGCACTTTG GACCGGCAAA CAAGTCATCG CCGCGGTTTT ATCGCACATC ACTCGAGGGC GACCACCGCT AACGTTCAGC GCACCGTGCA AGATCCCAGC CACGTTTTTC GGCGGTGAAG ACTCTGGCGA AGATCGACTG ATTATTAGAC GAAACTACTT TTGCTCTGGG GTCGTAGACA AGAATATGTT CGGCAAGTAC GGCCTTGTGC ACGCCGTGGC CGAGCTCCAC GGACGGTCCA CAGCCGGCGC TTTATTGTCA ATTTTTTCCC GACTTTTTAC CAACTTTTTG CAAAAGCATG GCTTTACGTG CGGTATCGAT GATTTGATTC TCACCGCTGA TGCAGAAAAA GATCGCGTAG TGGAGCTAAA CAAGGCAGAC GAGATGTGTA AGACGGCTAC AGCAGATGTC GCAGAAGCGA GCGGGAAATC GGATGAGGAA GTGATGACGG CTATCGCGGC GAAGTTGCTC GAAAATCCCG AATGGGGCGC TCAATTGGAC ATGAAAGCGT CAGGCGCTTT GAACAAGGTG ACTTCGGCGA CCGTGAAGAA GTGTCTCCCG TTCGGTACAA AGAAGCCATT CTCAAAGAAT TGTCTCTCCA TCATGACCAT CTCTGGCGCA AAGGGTTCGC TTGTGAACTT TTCTCAAATC GCCGCCGCCT TGGGCCAGCA AGAGCTCGAG GGTCGCCGTG TACCTCGCAT GCCGAGTGGT AAGACGCTCC CTTCATTCGA GCCTTTCGAC ATCAGCTCGC GGGCGAACGG TTACATCGCC TGTCGGTTTT TTAGCGGATT GGATCCCGCC GAGTACTTTT TCCACTGCAT GGCTGGTCGC GAAGGTTTGG TCGATACCGC CGTGAAGACT GCGCGCTCTG GTTACTTGCA GCGTTGTTTG GTGAAAAACC TAGAATCGAT GAGAGTGCAT TACGATTTTA CCGTGCGTGA CAGCGATGGA AGCATTGTGC AGTTTCAATA CGGGGAGGAT TGCGTCGACG TCACGCGTTC TGGGTACTTG GAGAAGTTTG AGTTCCTTGC CGAAAACCCG GAACTCATTT TGCTCAACAA CGAAGCCGCG ATTTCGATGT TGCCCAAGCT CAACAAGAAA AAAGTAAGTG TTCTTGAGAC CATAGGTCGT AAAGAAGAGC TACCGCGATT CTGCATGGAA AACTTTGGCG ACCAGTTGGG AGTGTTGCCA GAGAAATTCG GCGATGAACT TAAAACGTTC ATCGACTCGC GACCGAAAGG TTACTGGGCC GAAGAGAAAG GAAAGAAGAA TGAAAGTTCA TTGGCGGCAA AGTCCGGTTT GACCGCTGAA GAGTTTGCCA TGCTGATGAA TATGCGTTAT CTCACGTACG TTGCGCCCCC CGGCGAAGCC GTCGGCGTCA TTGCCGCGCA GTCTGTTGGA GAACCATCCA CACAAATGAC GTTGAATACC TTCCACTTTG CCGGCCGCGG CGAAGCAAAC GTCACGCTTG GTATTCCTCG TCTTCGAGAG TTGCTCATGG CTGCATCAAA GAAGCTTTTG ACGCCCGTCA TGATACTTCC GTTGAAACCT GGGTGCAGAA CTAAGGAAAA TGCTGAGACG CTCTGTCGCC GACTCCGACG AGTGATCCTC GCCGAACTCA TCACAAAGCT TTGTATCAAG GTGAAAGACT ACGGTATAGG TCCAGATGAG GGACTCTCAC GATTGTATAC CGTTGTCATT CAAATGCGCG AAGGTCAAAA TGACGACGAT CCAAATGAGG TGACGTTTGC AGAATTTCTG CACGCCGTCA AGCGCAAGTT CGCCAAAATG CTCGTGGCGA GAATTGGCTC TGACATGAGG AAGAGCAAAA GCAACAATGG TGTCATCGTG AAAAACGCGA AAAATGGCCC GATGCAAGGA ACCACGGACC CTCGCAAGGA CGATGACGAA GAAGAGGATG ACGAAAAAGA GAAGAACTCC GCCGCGCGAG CGAAAAATGT CAAATTTGAG AATGGCGACG CGTCGGATGA GGACGAGGAC GATGAGGAAG ACGAGGAAGG TGCAAAGACG GAGAATCGCA AAATAGACGA AACTGAGGTC GACAGCGATT CTGATGGCTC ATCTTCTTCC GGTGACGATG ATTCCGATGC TTCTTCGGAC GACGCTAGCG CCAAGACGCC AAAGTCAAAG AAGAAGCGAG TTCCTTCGAG TGAGATGACG GACTGTGGCG TACAGCTGAC GGAGGATGAA GTTGTCGAGA CAATCGTGTG TGACGAAAAA TCTCGAACCA TAGAGTTCAC GGTTCCAGCT GGTATAAACT CTCCTCACGT ACTCGTGCTC GAAATCGCAC AAGAAGTTGC CGTGAAAACG ATCATTCGCG AAACGCCGGG AATCAAGCAA ACTTTCGTCG TCGGCAAGAC GGACGATGAA GATCCCACTG GCTCGCAACC TTTATCCATT CAAACCGATG GTATCAATTT TGGCGCCGCT TGGGCGAACT CTGATCTCAT CGAGGTCAAC TCCATGAAAT CAAATTCTGT ATGGGACATC ATGCAAACCT TTGGGGTCGA GGCGGGCCGA TCGACGCTCG TGAGCGAAGT GCAAGCTGTA TTCGGCGTGT ACGGCATCGG CGTTGATACG CGTCATCTTT CTCTCATCGG TGACTTCATG ACGCAACAAG GTGAATACCG ACCCTGCAAC AGATCTGGCA TCGAAAAGAG CACTTCGCCA TTCCTGAAGA TGTCTTACGA GACGGCGACG GCATTCTTAA CGGACGCAAC AATTCGCGGA GAAACGGACG ACTTGAGCTC GCCATCGTCG AGAATCGTCG TTGGTCGCAC TGTTGATCTG GGCACGGGCT CGTTTTCTCT CAAACACGAC ATCGTGCGGG CTGCACAGTA CCAAGAGGCT AACAAAGCCA CGGGCAAGCA CATTCGTCTC TGA
|
Protein sequence | MPRDATATTA RHRVKQRGLV QKEIASASFS FYDAEDVRKI SVKRITNPVL FDGLNNAVAD GLYDPALGPT DSKTTCVTCK FPGGMCAGHF GHLELVVPVY NPLTFGTVVR LLKTTCFHCH KFRLHASRVR RFRERLEMLM DGDMEAAEGV LPEISKKAKE EMSSVFKEVE GDGDAEEMDL DNIHDVLPRL KTRGRGEPVV WTSITSTAAR NLIKEFLAIQ PKKCENCGAM NPKVSPEGHN KIFRGALPKA HHENNLAKGI DINDDMASDS ETMSVDERVQ EAAKSLYITP IEARALLKRL WMYEYDFCSM IWATTPPNKC TKRGEERRSD PARFFIQTLL VAPSKFRPPS KMGDMIFDHP QNTALTTIIQ ANLSLAELFR TPPTVPEPPE VRAGRAVRAW LALQSGVNRL IDATKADTQE AKQAIGIRQQ LEKKEGLFRM NMMGKRVNFA ARSVISPDPY LGTSEIGVPP VFAKKLTFPE LVTPHNVDLM RTLVENGPEI HPGANAIEDE RGRVIHLDKF TAEKRAAIAK TLLATTAAGS ADGPARPLAK TVYRHLRDGD VMLVNRQPTL HKPGILAHTA RVLPGQRVVR MHYSNCSTFN ADFDGDEINL HFPQDHLGRA EAYEIMHGDR QFTVPTDGKP LRGLIQDHIC SGLLLSMRDS FFDRSEFTQL LYSGLVDYCG DEHGKIDVPA PALLKPKALW TGKQVIAAVL SHITRGRPPL TFSAPCKIPA TFFGGEDSGE DRLIIRRNYF CSGVVDKNMF GKYGLVHAVA ELHGRSTAGA LLSIFSRLFT NFLQKHGFTC GIDDLILTAD AEKDRVVELN KADEMCKTAT ADVAEASGKS DEEVMTAIAA KLLENPEWGA QLDMKASGAL NKVTSATVKK CLPFGTKKPF SKNCLSIMTI SGAKGSLVNF SQIAAALGQQ ELEGRRVPRM PSGKTLPSFE PFDISSRANG YIACRFFSGL DPAEYFFHCM AGREGLVDTA VKTARSGYLQ RCLVKNLESM RVHYDFTVRD SDGSIVQFQY GEDCVDVTRS GYLEKFEFLA ENPELILLNN EAAISMLPKL NKKKVSVLET IGRKEELPRF CMENFGDQLG VLPEKFGDEL KTFIDSRPKG YWAEEKGKKN ESSLAAKSGL TAEEFAMLMN MRYLTYVAPP GEAVGVIAAQ SVGEPSTQMT LNTFHFAGRG EANVTLGIPR LRELLMAASK KLLTPVMILP LKPGCRTKEN AETLCRRLRR VILAELITKL CIKVKDYGIG PDEGLSRLYT VVIQMREGQN DDDPNEVTFA EFLHAVKRKF AKMLVARIGS DMRKSKSNNG VIVKNAKNGP MQGTTDPRKD DDEEEDDEKE KNSAARAKNV KFENGDASDE DEDDEEDEEG AKTENRKIDE TEVDSDSDGS SSSGDDDSDA SSDDASAKTP KSKKKRVPSS EMTDCGVQLT EDEVVETIVC DEKSRTIEFT VPAGINSPHV LVLEIAQEVA VKTIIRETPG IKQTFVVGKT DDEDPTGSQP LSIQTDGINF GAAWANSDLI EVNSMKSNSV WDIMQTFGVE AGRSTLVSEV QAVFGVYGIG VDTRHLSLIG DFMTQQGEYR PCNRSGIEKS TSPFLKMSYE TATAFLTDAT IRGETDDLSS PSSRIVVGRT VDLGTGSFSL KHDIVRAAQY QEANKATGKH IRL
|
| |