Gene OSTLU_49109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49109 
Symbol 
ID5001075 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp366527 
End bp369817 
Gene Length3291 bp 
Protein Length994 aa 
Translation table 
GC content53% 
IMG OID640416496 
Productpredicted protein 
Protein accessionXP_001416923 
Protein GI145344821 
COG category[C] Energy production and conversion 
COG ID[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.808055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.27646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAG CGGTGTTCGC GCGATCGAAG GCGAGCGCGA GCGTGGCGCC GCCGAAACCG 
ACGCCGAATC GAGAGATGAG GGATGACTTT TTAAACGCCT CGAGCGCGGC GTACTTGGAG
GCGATGGAGG ACGAGTACAG GAAGGATCCG AAGAGCGTAC CGGAATCGTG GGCGTCGTTG
TTGAGACAGA TGGGTACGTA GCGATCGTCG ATGGATGCGC GAGACGGCGA GGATGAGGTT
TTGAAGGCGA ACTGGAACGG CGCGCGATTC AATTCCCTCA ATTCGGTTGG CGGACCAATC
CAAATTCGGA GCGAACGAAG ACTGACGAGA ATTCCGTGCA TTTTTTCGCG TAAACAGATG
CGGGCGTGAG TGGAGCTGAA ATATCTGAAA TGTACACCGC GATGTCGACG GGTACGGCGC
CTATGGCTGT GGGGCGACCG TTGGACGCGC AAACGATTCA GGAATCCATG CGGTTGATGA
TGTTGATCCG CTCGTATCAA ATTAGCGGAC ATAGCATCGC CAATCTCGAT CCATTGGCGC
TTGATGAGCG TGAGATGCCA ATCTCTCTCG ATCCAAGCCT GTACGGGTTC ACGGAAGACG
ATATGGATCG AGATTTCTTT ATTGGAACTT GGAAAATGAA GGGCTTCTTG AGCGAAGACC
GTCCGGTGCA GACGTTGCGT CAAATCTTGA CCAGGCTCAA AGAGACGTAC TGCGGCACGG
TGGGCTACGA GTACATGCAT ATTCAAGATC GTGAGCAATG TAACTGGCTT CGAGCGAAAA
TCGAGACCGA GCGCAAGAAG CAGTATTCGC CGGAACGGAA ACAAATTATT TTGGACCGCT
TAGCTTGGGG CGAACTTTTC GAAGGTTTCT TGTCCAACAA GTACAGCGCA GCGAAGCGAT
TCGGTTTGGA AGGTTGTGAA AGCCTCGTGC CTGGATTTAA GGAAGCGATC GATAAGGCGG
CCGAGATGGG GGTTGAAGCT ATCACGATCG GTATGCCGCA CCGCGGTCGT TTAAATGTGC
TCGCGAACGT CGTGCGCAAA CCGCTTCAAA CGATTTTCAA CGAGTTTAAG GGTGGTCCAA
AACTCGTGGA CGAGTTACCG AACACGGAAT CGCAGTATAC TGGATCAGGC GATGTGAAAT
ATCATCTCGG GACTTCGTTC GATCGTCCGA CGTTACGCGG CGGTCAAATC CATCTTTCCT
TGGTTGCGAA TCCATCGCAT CTTGAGGCGG TAAACACGGT GGTTACTGGA AAGACTCGTG
CGAAGCAGTT CTACACGAAA GATCCGAACG GCGATCGTTC TATGCCTATC TTGCTGCACG
GTGACGGTGC TTTCAGTGGT CAAGGGATTG TGTACGAGAC GCTTGACATG AGCAAATTGC
CCGAGTACAG CGTCGGTGGG ACTTTGCACA TCGTCGTGAA CAACCAAGTC GCCTTTACGA
CTGATCCAAA GTACTCACGC TCGAGCGCTT ACTGCACCGA TGTAGCTAAG GGCATGGAAG
TTCCAGTGTT TCACGTTAAC GGTGATGACG TGGAAGCGGT GGCGTGGGTC ATGGAACTCG
CCACCGAATG GAGAATGAAA TGGAAGACGG ACGCCGTAGT CGACATCGTG TGCTATCGCA
AATACGGTCA CAACGAGATT GATGAACCCA TGTTCACGCA ACCTCTTATG TACAAGGTGA
TTCAGCAACA TCCCAGTGTT TTGACGAAAT ATAGCGCCAA ACTCGTCAAC GAAGGCATCA
TCACCCCGGA AGACTTCGTC AGCATGAAGG AAAAGATCAA TAACATTATG GAAGAGGAGT
TCACGGCGTC AAAGGACTAT GTTCCGAAGC AACGTGACTG GCTCGCATCG CATTGGCAAG
GTTTCAAGAG CCCGGATCAG CTTTCGCGCA TCGCAGACAC TGGCTTACCG ATGGATCACA
TAAAGAATCT CGGACAACTT ATCACCGCGA TTCCTGCGGG ATTTACACCA CATCGAGTCG
TGAAGCGCGT TTATGAAAAT CGTCGCGCCA TGATTGAAAA CGGCAACGGT ATCGATTGGG
CCATGGGCGA AGCTTTGGCG TTCGCTTCTT TACTCGATGA AGGCAACCAT GTGCGTTTGT
CTGGACAAGA TGTCGAGCGT GGCACTTTTT CTCATCGTCA CGCGCTGATT CATGATCAAA
TCACGGGCGA GCGTTTCATT CCTTTGCGAA ACGTCTACAG CGGCAACCCG GGTCGAGGTC
AGAACTTCTT CACGGTTTGC AACTCTTCTT TGTCCGAGTA CGGTGTGCTC GGTTTCGAAC
TTGGCTACTC TCTTGAGCAT CCCAACTGTT TGATCTTATG GGAAGCGCAG TTTGGTGATT
TCTCGAATAC GGCTCAAGTC ATCATCGATC AATTTATCAG CAGCGGTGAG GCAAAGTGGT
TACGTCAAAG CGGTCTCACG CTACTTCTTC CGCATGGTTA CGACGGACAA GGTCCCGAGC
ACAGTAGCGC TCGACTTGAA CGTTTCTTGC AAATGGCTGA CGAAGATCCG ACGCAGATCC
CTGAGATGGA AATGGAACGC CGCACGCAGT TGCAAGAATG CAACTGGCAA ATTTGCAACG
TCACCACGCC CGCAAACTAT TTCCACATGT TGCGCCGACA GGTTCATCGT GAATTCCGCA
AGCCGCTCGT CGTGATGAGC CCGAAGAATC TATTGCGTCA TCCGAAAGCG GTCTCCGATA
TCAGCGAGTT CGATAACAGC GACGACAACG ATTCACTTCA AGGCGTTCGC TTCAAGCGAC
TTATCATGGA CAAGACTTCG AAGTCGCGCA GCATGGACTC TCCTGCGGAG AATGAGGTGG
AAAGAGTCAT CTTCTGCTCC GGAAAGGTTT ACTACGATCT CGACGACGAA CGCGATGCGG
CGAAGAACAT CGACAGGGTG AAAATCTGCC GCATCGAACA ACTCGCGCCG TTCCCGTGGG
ATCTCGTCAA GCGCGAATTG AAGCGTTACC CGAATGCCGA AGTTGTTTGG TGCCAAGAGG
AACCGATGAA CATGGGCGCG TGGTGGCACG TCCAACCGAG AATGAGTACG TTGTTCAAAG
ACCTCGGTCG ATCGGGCGAA ACGCGCTACG CTGGTCGCAA ACCCGCGTCT TCGCCCGCAA
CCGGGTACGC CGCCGTGCAC GCGCAAGAGC AAGCCCAACT CGTCGCCGAC GCCATTCGGT
AAACTTCCAC CCGCGCCGTG TACAACGGCA AATTGACAAT CGAATCGAGC GCATCGCTCG
CGCGCGTCTT AGTCTCAACC GCACGCGTTC GCGATCATTT ACCAACTCAA T
 
Protein sequence
MAEAVFARSK ASASVAPPKP TPNREMRDDF LNASSAAYLE AMEDEYRKDP KSVPESWAGA 
EISEMYTAMS TGTAPMAVGR PLDAQTIQES MRLMMLIRSY QISGHSIANL DPLALDEREM
PISLDPSLYG FTEDDMDRDF FIGTWKMKGF LSEDRPVQTL RQILTRLKET YCGTVGYEYM
HIQDREQCNW LRAKIETERK KQYSPERKQI ILDRLAWGEL FEGFLSNKYS AAKRFGLEGC
ESLVPGFKEA IDKAAEMGVE AITIGMPHRG RLNVLANVVR KPLQTIFNEF KGGPKLVDEL
PNTESQYTGS GDVKYHLGTS FDRPTLRGGQ IHLSLVANPS HLEAVNTVVT GKTRAKQFYT
KDPNGDRSMP ILLHGDGAFS GQGIVYETLD MSKLPEYSVG GTLHIVVNNQ VAFTTDPKYS
RSSAYCTDVA KGMEVPVFHV NGDDVEAVAW VMELATEWRM KWKTDAVVDI VCYRKYGHNE
IDEPMFTQPL MYKVIQQHPS VLTKYSAKLV NEGIITPEDF VSMKEKINNI MEEEFTASKD
YVPKQRDWLA SHWQGFKSPD QLSRIADTGL PMDHIKNLGQ LITAIPAGFT PHRVVKRVYE
NRRAMIENGN GIDWAMGEAL AFASLLDEGN HVRLSGQDVE RGTFSHRHAL IHDQITGERF
IPLRNVYSGN PGRGQNFFTV CNSSLSEYGV LGFELGYSLE HPNCLILWEA QFGDFSNTAQ
VIIDQFISSG EAKWLRQSGL TLLLPHGYDG QGPEHSSARL ERFLQMADED PTQIPEMEME
RRTQLQECNW QICNVTTPAN YFHMLRRQVH REFRKPLVVM SPKNLLRHPK AVSDISEFDN
SDDNDSLQGV RFKRLIMDKT SKSRSMDSPA ENEVERVIFC SGKVYYDLDD ERDAAKNIDR
VKICRIEQLA PFPWDLVKRE LKRYPNAEVV WCQEEPMNMG AWWHVQPRMS TLFKDLGRSG
ETRYAGRKPA SSPATGYAAV HAQEQAQLVA DAIR