Gene OSTLU_32286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32286 
Symbol 
ID5002527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp610141 
End bp613486 
Gene Length3346 bp 
Protein Length1034 aa 
Translation table 
GC content61% 
IMG OID640417948 
Productpredicted protein 
Protein accessionXP_001418530 
Protein GI145348173 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.414837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.522406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGCGACGCG CGCGCGCGCG AAGGAACGAA CGCGCGACGG ACGAGGGACG AGCGACGCGA 
AGACGACGCC GAACGCGCGG TGCATGCTTC GAGCGGTCGC GCGACGGTCG ACGACGACGG
CGACGACGAC GACGCGCGGA TTCGCGAGGA CGAGACTCGG CGAACGACGC GGGCGAAGAG
GACGAGACGC GACGCGAGCG ACGCGAGCGA CGCGGCGAGA GGTGATGGTC GGGACGAGCG
CGGCGTCGAT GTCGGCGATC GGGGCGGGAC GGGTCGGGGG GGCGAACGCG ACGGCGACGG
CGACGAAACG GGCGACGAGC GAGGACGCGA AGGCGACGCT CGACGCGCTG GAAAATTTGA
AACCGGGGAC GAGGCTCGCG GGAGGGGCGT TCGAGGTGAC GTCGACGAAG CGAGTGATGC
CGTACGACGT CGTGGCGGTG GAGCTGGAGC ACGTGAAGAC GGGGGCGAAG GTGCTGCACG
TGGGCGCGGA CGACTCGAAC GCGGGATTCA ACGTGGCGTT TCGGACGACG CCGCGGGATT
CGACGGGCGT GGCGCACGTG TTGGAACACA CGGTGTTGTG CGGAAGCGAA AAGTTTCCGG
TGAGAGATCC GTTCTTTAAC ATGCTGCGAC GATCGTTGAG CACGTTCATG AACGCGATGA
CGGCGAGCGA TTTCACGTGC TACCCGTTTT CGACGATGAA CCGCGTGGAT TACAAAAACT
TGCTCGACGT CTACTTGGAC GCCGCGTTTT TCCCCAAGAT CGCCGCGGAG GATTTCTCGC
AAGAAGGTCA CCGCTTCGAG TTCGCGAAGA TGGACGACCC GACGAGCGAT TTGATTTACA
AGGGCATCGT GTTCAACGAG ATGAAGGGCG CGATGGGGTC GCAGAGCGCG CGATACGGCA
GAGCGCTCGG CGAGAACTTG TTCCCGACGT CCACGTACCA TTGGAACAGC GGTGGAGATC
CGGTCAACAT TCCCGACTTG ACGTACGAAC AGCTCAAGGC GTTCCACGCG TTGCATTATC
ACCCGTCAAA CGCGAAGTTT TACACGTACG GCGACTTGCC GCTCGAGGAG ACTTTGCAGC
AAATCGAAGA CTCGGCGTTG CACCGCTTTG ATAAACTCGA CGTGAGCAAG TTGATCGTCG
AAGACGAAAA GCGTTTCACC GCGCCGAAGC GCGTCGAAGC CACCGTACCC GCGGACGCGG
TGGTGGCTGA CGCGAACAAG CAGTCGCTCA TCTCTCTCGC GTGGCTCATG GTGAACCAAA
TCGAGGATCC AGTTTCGTTG GACAACTTCG CCTTGGGCGT CGCCAGTGAC TTGCTCACGA
GCGGACCGCA ATCGTACCTT TACGAAGCGC TCCTCGAGCC CGGCCTCGGG AGCGGTTTCG
CCCCGGGCAC TGGCTACGGT GGCTCTCGCC GAGAGACGTC CTTCGCCGTC GGCTTGAAGG
ATGTCGCCGA CGCGGATATG GACAAGATTG AAAAGACTAT TCTCGACGTT CTCGAGCGCA
TCTCTCGCGA AGGGTTCCCG CGCGAGCGCG TCGAAGCGGT GATGCACCAG CTCGAACTCG
ACTCCGCGGC GGTTACGACG CAGTTCGGAT TGTACACCGG TTTCGGAGCC TTCTCGACGT
GGGTGCACGA CGGCGACTCG CTGCGCGCGT TGCGCACGCC CGAGCTCGCG GCCAAGCTCA
ACGCCGCGCT CGACGCGGAT CCGCAGTACT GGCAAAAGCT CATCAAAAAG TGGTTCTTAG
ACAACACGCA CCGTCTCACG ATCACCGCGC GCACTGATCC TGATTACGAC AAAAAGCTCG
ACGAAGCGGA GAAGGCAAAG CTGAAGAGCA TCGAAAAGAC GTTGACCGAG GACCAGAAGA
AGAAGATCGT CGCCGACGCG CTCGTGCTCA AAGAGAATCA AGATAAGAAG GAAGACGTAT
CGGTGTTACC CACCCTGATC GTCGCCGAGG CCGTGCCGAA AGACATTAAG CGCTGGGGGT
CGAAGAATAT GAAAATCGCC GGTAACATTC CGCTTCAGTA CGACGAGCAA CCGACCAACG
GTGTCGTCTA CTTTAGCACG CACTTTGATC TGGATGGTCT TCCGCAGCGG TTGGTGCCTT
ATTTAGACAT GTTCATGGAT TTCATCGATC AGCTCGGCAC GGAAAAGATG AAGTACAAGG
ATTTGGCGGA ACAAATCAAG CTTCGCACAG GCGGCTTTTC CGTCGGCTCC GTCGTTCGTA
CTCCCACCGA CGGCAAAGGT ACGCCGACGA TGTCGCTGTC CATCAGCGGT CACGCGCTCG
AACGCAACGT CGACGCGATG TTCGATATTC TCACCGATTT GCAAACGGCG AAGTGGCGAG
GGGAAGAAGA GCGCGTCAAG TTGTTGTTGA CACGTCGTGC CGCCGCACTC GGCGCCTCCG
TTGGACAGCA AGGCATGCAA TACGCGAGAA ACCTCGCAGG CGCTCAAATC AGCGCCACGA
GTGCGTTGTC GAACGAAACG AGTGGATTGC CTCACGTCGG CCTCGTCTCG CGCTTGTCCA
AGGAAGGCGC GATCGATGAA GTTGAAACCG CGATGGCCGA AATCGCCGCG TTCGCGCTTC
GCCCCGAGCG CGTACAGCGA TGCCGCATCG CGTGCCAAAA AGAAAGCTTT TCCGCCACCG
AACGTCGATT CGCAAAGTTC TTGAAAGATA TCAAGCCGGT TGCCGCCAGT CCGAGCGACA
AGGACACCGT GGCGACGAAG TTGAAGACGT TCAAACCCGA GCTTTCCAAG GTGTTTGTCT
CCATCCCGGG GCAAACCAAT TACTGCTCCG CCGCGCTTCC GGCGTTGCCG TACAGCCACC
CCGACGCTCC GGCATTATTC TTGCTCGCGC AAGCGTTGAG CGCGGGCTAC CTTCACCGTG
AAATTCGCGA AAAGGGCGGC GCTTACGGCG GTGGTTGTGC CTCGGACCCG ATGTCCTCGC
TCTTCACCTT CTTCTCCTAC CGCGATCCGA ACACGACGGA GACGTTGGAC ACTTTCACCA
AATCCATCGA ATGGGCCACG AACTCGGAGA ACATCACCAC AAAAGAGCTC GAAGAGGCTC
AACTTCGCGC GTTCAAGCAA CTCGACGCCC CGCTCGCTCC CAGCGCGCGC GGAAACTCCG
GTTTCTTAAC CGGCGTCACC GACGAAGAGC GTCAGCGTTT CCGCGACGGG TTGCTCGCCG
CGTCGCCGGC GGATTTATCT CGCGTCGCCG CCGCCCACCT CCGCGGCGTC GCCCCCGCGA
TCGCCATCAT CGGTTCATCC GAGAAGGCTC CGCTCGCGGA CGCGAGCTGG GTGAATCTTG
ACGCCCAGGG CGCGCCTCGC GTCGCCTAGT CGTCGCCGCG CCCTCG
 
Protein sequence
MVGTSAASMS AIGAGRVGGA NATATATKRA TSEDAKATLD ALENLKPGTR LAGGAFEVTS 
TKRVMPYDVV AVELEHVKTG AKVLHVGADD SNAGFNVAFR TTPRDSTGVA HVLEHTVLCG
SEKFPVRDPF FNMLRRSLST FMNAMTASDF TCYPFSTMNR VDYKNLLDVY LDAAFFPKIA
AEDFSQEGHR FEFAKMDDPT SDLIYKGIVF NEMKGAMGSQ SARYGRALGE NLFPTSTYHW
NSGGDPVNIP DLTYEQLKAF HALHYHPSNA KFYTYGDLPL EETLQQIEDS ALHRFDKLDV
SKLIVEDEKR FTAPKRVEAT VPADAVVADA NKQSLISLAW LMVNQIEDPV SLDNFALGVA
SDLLTSGPQS YLYEALLEPG LGSGFAPGTG YGGSRRETSF AVGLKDVADA DMDKIEKTIL
DVLERISREG FPRERVEAVM HQLELDSAAV TTQFGLYTGF GAFSTWVHDG DSLRALRTPE
LAAKLNAALD ADPQYWQKLI KKWFLDNTHR LTITARTDPD YDKKLDEAEK AKLKSIEKTL
TEDQKKKIVA DALVLKENQD KKEDVSVLPT LIVAEAVPKD IKRWGSKNMK IAGNIPLQYD
EQPTNGVVYF STHFDLDGLP QRLVPYLDMF MDFIDQLGTE KMKYKDLAEQ IKLRTGGFSV
GSVVRTPTDG KGTPTMSLSI SGHALERNVD AMFDILTDLQ TAKWRGEEER VKLLLTRRAA
ALGASVGQQG MQYARNLAGA QISATSALSN ETSGLPHVGL VSRLSKEGAI DEVETAMAEI
AAFALRPERV QRCRIACQKE SFSATERRFA KFLKDIKPVA ASPSDKDTVA TKLKTFKPEL
SKVFVSIPGQ TNYCSAALPA LPYSHPDAPA LFLLAQALSA GYLHREIREK GGAYGGGCAS
DPMSSLFTFF SYRDPNTTET LDTFTKSIEW ATNSENITTK ELEEAQLRAF KQLDAPLAPS
ARGNSGFLTG VTDEERQRFR DGLLAASPAD LSRVAAAHLR GVAPAIAIIG SSEKAPLADA
SWVNLDAQGA PRVA