Gene OSTLU_31794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31794 
Symbol 
ID5001777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp541475 
End bp543736 
Gene Length2262 bp 
Protein Length753 aa 
Translation table 
GC content54% 
IMG OID640417198 
Productpredicted protein 
Protein accessionXP_001418044 
Protein GI145347161 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0561113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG AGAAGAAGAA AATGGGAAAG AAGGAGCGAC GAGCGGCGTT CGGACCGAGC 
GTGGCGTTGA ATAAAGAGCT GATGTCGTGT GAGACGTTGG AGGCGTTGGC GGAGACGTCG
CGTCGCGTCG CGCGGGACAT GTCGGCGGTA AACGTCGCGA CGACGTACGG AAAATTGGCC
AGGTTCGCGC GAGGGGGACG AGGGAAAGTT AGTGAAGAGA TCAAGCGGTC GGAATGGTTC
GCCGAACTCG AAGAGCGCGC GATGGAGAAG CTGGGGGAGT GCGAAACCAA GGCGCGCGCC
GTCGCGCAAA TTGCGTGGGC GTGCGGGTAC TTGGATAGGG GTCGACATCT GCATGGTGAA
GATGCGTTTT GGGATGAATT AGAGCGGGCT ATTGAGAGAG AAATTAGTAA ATGCGAGCCA
CAAGGTGCGA GTAACGTCGC GTGGGCGTAC GCAAAGCTTG AGATGCGCAT GCCGAACGGC
ATTCGCAACG CCATAGAAAA GCACATTGTG CAAAATGCGA GTGCGTACAA ACCGTTCGAA
TTGAGCATCA CGTTTTGGGC GCTCACGAAG TTTGGAGACA TTCCGACGGA GGAAATGCTA
GATGTATTCG AGCGCGAGAT GCGACTTCAA TCGTGTGGTT CACAAGAGCT TACGAACATC
GCTTCGGCGT ACGCGCGCAT CAGCGGACGG AGAGTGCGTC AGGGGACTCA AGGTTTCCTG
AAAGAGTTAG CCTCGAATGC TTTCACCATT CTGCACGAAT TCGACAACAC GGAACTTGCG
ATGTTTCTCT GGGGGTTGTC AAACGCGGGT TACTACCTGG ACGACGACGA TGCGATGGAG
ATATTCAAAG TAGTTGAGAG GCGAGCGAGC GGCGCCAAAC GATTGGAGCC ACAACAAATT
GCCTTGATCA CGGGATCGTT TGCGACGTTC ACGGACGACG TCACGGTGAC GCACTACAAT
TGCGAGACCG GAAGTCCGCT TTCGCATCTT CGAATGAGTG CGCAAACACG AGCGGCGCTC
AAATCCGCGA TCGTAGCTCT GGAGAACACC TTTCTGGCCT CCATCGCGAA GTGCAACATG
GACGATTTGA GCTACGTAAT GTGGGCGTTC GCGCACCTGG AGCATCGGCC CAGTGACGAG
TTTGTTCGAC GGCTCGAGGA AGAAGCGATC GATAAAATCG AAGAAGCGAG CGCAAAAAAT
CTATCAAACC TATTGTACGG ATTTGGAACG CTAAACTTGG CAGGACTCGG AGTGTTTACG
CACGCCATGT TTTGCGTCTC GCAAAAACTG GAAGAGTTTA CACCAGTTGG AATCTTCATG
GTTTGCTCGG CTTTGGCGAG TAGTAACTAC GATCCGGGAC CACAGATGAT GTTACAGTTC
GAAAACAAGC TCATGAAATC GGCGCATGCG TTTGAATCGC AAGATTTCAC CGAATTCTTG
CGCGTATTCG CGAGGCTTAG GTACATGCTC GCGGACGAAA CTTTCGATTT CATCGGCGTA
AGCTCGGCGA AGACGCTTGA TCGCTTCGAC TCATACAGGA TAAGCATGAC TTTATGGTCG
CACGCGACGC TGTGCGCACA GCCTCATGAT GCTTTACTCG CGCGGATTGA AGACGAGATT
CGAGGTTCTG CGTCCCAGTT CAAACCGCAA AACTTTGTGT TGGCTTTGTG GTCGCTCGTT
TTGCTCGGCT CACTGGAAGA CGCGCGAGAC AGTGTCGTGC GTGTCTTGCA CGCGCTCGTC
AAACTTCAAG GTGGTGCTTT GACGAGTTCG GAAGACTTGG AGGATGCGCA ACTATGCTCT
CTCTACATGG CGCGTCTCAC TTCTATGGGA AAGCCGTTCG AAGAGCTCAT CCTTGGCGTC
ACGGATGGCG TCGCCGACGA ATGCGAGCGT GCCTGGCTCC GCGCAAAGGC TCAAGATCCG
ACGATTAGCA AAGTTCAGCA TCACATCGGC GAAGTACTCC GCGAAATTGG CGCGCAAGAT
TTTGAGGTTG AAGCGCTCGT GGAGGGCGGC AAGATTCGTT CTGATATCGT ATTCCCGAAC
TCGCGAATCG TCGTTGAAGT CGACGGTCCG CATCACTACA GCCGCGACGC ATCCGGTCGT
CTTCGCGAGC TCGGTCAAAC CGTCATGCGC AACAATCTTT TAAAATCATG GGGTTGGCGT
GTCGTCATCG TTCCCTACGC CGATTGGGGC GACATGCTCA CCATTGAAGA GAAAGCGTCG
TATTTACGTT CTCTCCTCGG CGACGAAGTC TTCGTCGCGT AG
 
Protein sequence
MTAEKKKMGK KERRAAFGPS VALNKELMSC ETLEALAETS RRVARDMSAV NVATTYGKLA 
RFARGGRGKV SEEIKRSEWF AELEERAMEK LGECETKARA VAQIAWACGY LDRGRHLHGE
DAFWDELERA IEREISKCEP QGASNVAWAY AKLEMRMPNG IRNAIEKHIV QNASAYKPFE
LSITFWALTK FGDIPTEEML DVFEREMRLQ SCGSQELTNI ASAYARISGR RVRQGTQGFL
KELASNAFTI LHEFDNTELA MFLWGLSNAG YYLDDDDAME IFKVVERRAS GAKRLEPQQI
ALITGSFATF TDDVTVTHYN CETGSPLSHL RMSAQTRAAL KSAIVALENT FLASIAKCNM
DDLSYVMWAF AHLEHRPSDE FVRRLEEEAI DKIEEASAKN LSNLLYGFGT LNLAGLGVFT
HAMFCVSQKL EEFTPVGIFM VCSALASSNY DPGPQMMLQF ENKLMKSAHA FESQDFTEFL
RVFARLRYML ADETFDFIGV SSAKTLDRFD SYRISMTLWS HATLCAQPHD ALLARIEDEI
RGSASQFKPQ NFVLALWSLV LLGSLEDARD SVVRVLHALV KLQGGALTSS EDLEDAQLCS
LYMARLTSMG KPFEELILGV TDGVADECER AWLRAKAQDP TISKVQHHIG EVLREIGAQD
FEVEALVEGG KIRSDIVFPN SRIVVEVDGP HHYSRDASGR LRELGQTVMR NNLLKSWGWR
VVIVPYADWG DMLTIEEKAS YLRSLLGDEV FVA