Gene OSTLU_31874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31874 
Symbol 
ID5001998 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp684984 
End bp688354 
Gene Length3371 bp 
Protein Length1060 aa 
Translation table 
GC content54% 
IMG OID640417419 
Productpredicted protein 
Protein accessionXP_001418083 
Protein GI145347243 
COG category[R] General function prediction only 
COG ID[COG1204] Superfamily II helicase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.630707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGCGC GTCCGCGTCG GGACGCGCTG CGCGCGACGA CGGACATAGA ACCTTTAGAC 
ACGGCCAGAG CGTTTGATTT CAAGTGCGTT CGCAACGGTC GCGCGGGAAC GACGGTGTCG
CGCGCGACGC GACGGACTGA CGCGAAGGCG TCGCGACGAC GCGCGCGTCG TAGGTATTTC
AACAGCATTC AGAGCGAGAT GCTCGAGTTT TTGGTGAACA CGCGACGGAG CTTCGTGATG
AGTGCGTGAC GCGACGCGAA CGCGAGACGC GCGGCGAGTG CCTCGTTTTG GAAAGGACTG
ACGAATGAAC GCGTGATCGC GCGTCGAACG CGAACGCAGG CGCGCCGACG GGGAGCGGGA
AAACGACGGT GTTTGAGTTG GCGATGTTGG CGGCGTTGCG AGGACCGAAC GCGACGGAGG
GGGGATTGAG CGCGGCGCGA GGGCGGAGAA AGGTGATTTA TCTCGCGCCG AGTCGGGCTT
TGGTGAGCGA AAAGGCGCGC GAATGGCGAG AACGATTAGG CCGGATCGGA ATCACGTTTG
CGGAGTTGAC GGGAGATAAC GATTTTGGTG GGGACGTGTG GGGGGAGATT GAGAATGTCG
ATGCGATTTT GGCCACGCCG GAAAAATTCG ACCGCGTGAC TCGTCTGGAC GCGAACCGCG
GCGGGATGTC GTTCTTCAGC GACGTCGCCG CGGTGTTGAT TGATGAAGTG CATCTCATCG
GTGACATTCG TGGAGGGTGT TTGGAAGCCA TTGTTAGTCG TTTAAAGTTG CTGAGCAAGT
CGTCGGCTCT ATTGCAATCG CACTTGCGAA ACGTTCGCTT TGGCGCCGTG AGTGCGACGA
TTCCGAACAT TGAAAACCTC GCGAATTGGC TCGGCGCCAA TCGCGATGGC ACGTTTGTGT
TCGGTGAAGA GTTTCGACCT GTCAAGTTAC AGACGTACGT GCGCAGTTTT CCGGACACAA
CGAGCGATTT TTTATTCAAC AAGTATCTCA AGCAAAAAGT GTTTGCGGTG ATTCGGGAAT
TCTATCGCGG CAAGCAGACG CTCGTGTTTT TGGGCAGTCG CAATGACGCG CAGCAGACGG
CGAAGCAGCT CGTGGTGGAC TCTCGTAGAC AGTTCGTAAA CCCACAACTC TCCCAATTCT
TGCTCGAAGC GTCCATGCAA GCGCAGAACA AACATCTTGC TGAGTGTATC ACCGCCGGCG
TAGCGTTTCA TCACGCCGGG CTGGAACGAG GCGATCGCGA GTTGGTCGAG GGATTGTTTT
GCTCTCGCGC TATTATGGTT TTGTGCAGTA CGAGCACTTT AGCTGTGGGC GTTAATCTTC
CAGCGTACTT GTGCGTCGTC GCTGGTACGG ATATTTACGA CGGTGGTGGC GCGTACAAGG
AAATATCCAT GGACACCCTA CTTCAGATGA TCGGCCGCGC GGGGCGACCT CAATTCGACA
CCGAGGGCGT GGCGGTGGTG ATGACGAAGA ACTCTTTGCG ATCGCGCTAC GAGGGATTAG
TTCATGGCAA GTATCCGCTC GAGTCAAGTC TCGGTTCATC GCTGCCTGAG TACTTAAACG
CTGAAATAAG CTGTCGAACT GTGAACACGG TGGACGACGT CCTGGAGTGG GTGCAAAGCA
CGTATTACTA TATTCGCGTT TGCGCGGAAC CAAGAAAGTA CGGAATTAAG AACGATGATA
CGGTGACAGA ACACGTGAAG CGGTTAATCA AGGCGACTTT GGATGAGCTT ATTGCGTCAG
GAATGTGCGC CGTGGTAGGA AATAGCGCGC TTCAACCGCT CAAAGCCGGA GACATCATGT
CTCTTCGTTA TTTGCGCTTC AAAACGATGA AGAACATCAT GCGAAACGTC GCGACACCGT
CGTACGCGGA TCTTTTAAGA ATGTTGTGTG AGAGTTACGA GTTCAAAGAC ATCAAGCTTC
GCCGAGATGA AAGGAAGTTG TTGAAGCAGT TGAACACAGA TCAAAAGATT ATTCGCTTTC
CCGTGCAAGA GACTACTGGA AAGTCGCAAA AGTTGTCCTT AGCGAAGGTA ATTCGTACGC
CGGGCGAAAA GCTGTATCTT TTGGCGCAGT ATATTTTAAC TGACGTGGTC GAACCGAGCA
TCACGCTCTC GCATCCGTCG ATGCGAATGG AAGGTGATAA AGTTCTTCAG CTCGGAACTC
GCATCATGCG CGCGGCGTCA GAGTACTATC AATCCACGTT GACGTTTACC GCTGCAGCGA
ACGCCTTTTC GCTCGCCAAA GGTCTCGACG TGCACATGTG GCCGGACACG AAAGTGCAAA
TTCGACAGCT CAAGCACTCG CGCATGAAAA AAGTTATCGA AAAGCTCATT GAAGCTAGAA
TTATGACGCT CGAAGACGTT GAAGACGCAG ATCCGCGTCG CATCGAAATG AAGCTCGGCA
AGTCGTTTCC GTTTGGCAAC ACCCTTCAAG GTGACGTAAA GGATTTTCCA TCCGAGCTCA
AGGTTGAAAT GACGCATCAC GCCATGTCGA AGGAGTCCTA CAGCGTCGAC GTCAAGCTGT
CGTTTAAGAA TACTTCTGAG AGCCGCTTGA GCACGAATCA TCTACCGGCA AAATATCCGG
GGATGCTCTT CATCGGCTCA GAGCACGACG ACCGGTTGTT GTACGTTCAG CGTTTGCCGA
CGCGAGAGTT TGATTTGGAT GCTAACATGG ACGCATCCGC GAACATCGTC TTCCACGGCC
GGTTCACGTG TACGAACGTG TCGAGCGGAA ACGTCCCGCT TTGTTTCGTG GCGCGTGTGA
TTTTCGAACG TTGCATCGGT CGCGACGTCG TCGAATACCA CGTTGTCAAT CGCGGCGGCG
CGAGTCCGCG AACTCCGGAC AAATCAGGTG TTTCTCCCGC GAAGTCGCCC ACGTCGACGA
CGCCCGCGTC AACGATGAAA CAAACAAAGA TTATCGCTGA CGCGCACACC AAAAAATTCA
GACTGATGAA TTCGGACGAC GAAGTTCGCC GGAAGGTGGC ATTCGAAGAC TTTAAGCTGT
CTAACGAATC GTCGCCGAAC TCGAGTCCGC GTGAACGCAA AGCTACGTCG CGTTCAGAAG
ATGTTTGCAA TAGGTGCGGT GTCAAAGGCC ATTGGGCAAA GGACTGCTTA TATCCCGACA
ATCGACCCGA AGAGCTGCGA CCAGGGCCGA AACCGACGGA CAAATGTCGT AGATGCGGTG
AGCTTGGGCA CTTCGCTCGC GATTGTTCGT TCGACGAGGA CACATGCAAA ATTTGTCAGC
AGCACGGCCA TCGGGCTCGA GATTGTCCAA GCGTGGCCGA CGTCTTTGCT TCGCTTGACG
ACACGACAAC GACGGTGAAC GACGCATCCG ATTCGGACAA GGAAGAAAAC TGGGAGTTCG
TCTTTGGTTG A
 
Protein sequence
MYARPRRDAL RATTDIEPLD TARAFDFKYF NSIQSEMLEF LVNTRRSFVM SAPTGSGKTT 
VFELAMLAAL RGPNATEGGL SAARGRRKVI YLAPSRALVS EKAREWRERL GRIGITFAEL
TGDNDFGGDV WGEIENVDAI LATPEKFDRV TRLDANRGGM SFFSDVAAVL IDEVHLIGDI
RGGCLEAIVS RLKLLSKSSA LLQSHLRNVR FGAVSATIPN IENLANWLGA NRDGTFVFGE
EFRPVKLQTY VRSFPDTTSD FLFNKYLKQK VFAVIREFYR GKQTLVFLGS RNDAQQTAKQ
LVVDSRRQFV NPQLSQFLLE ASMQAQNKHL AECITAGVAF HHAGLERGDR ELVEGLFCSR
AIMVLCSTST LAVGVNLPAY LCVVAGTDIY DGGGAYKEIS MDTLLQMIGR AGRPQFDTEG
VAVVMTKNSL RSRYEGLVHG KYPLESSLGS SLPEYLNAEI SCRTVNTVDD VLEWVQSTYY
YIRVCAEPRK YGIKNDDTVT EHVKRLIKAT LDELIASGMC AVVGNSALQP LKAGDIMSLR
YLRFKTMKNI MRNVATPSYA DLLRMLCESY EFKDIKLRRD ERKLLKQLNT DQKIIRFPVQ
ETTGKSQKLS LAKVIRTPGE KLYLLAQYIL TDVVEPSITL SHPSMRMEGD KVLQLGTRIM
RAASEYYQST LTFTAAANAF SLAKGLDVHM WPDTKVQIRQ LKHSRMKKVI EKLIEARIMT
LEDVEDADPR RIEMKLGKSF PFGNTLQGDV KDFPSELKVE MTHHAMSKES YSVDVKLSFK
NTSESRLSTN HLPAKYPGML FIGSEHDDRL LYVQRLPTRE FDLDANMDAS ANIVFHGRFT
CTNVSSGNVP LCFVARVIFE RCIGRDVVEY HVVNRGGASP RTPDKSGVSP AKSPTSTTPA
STMKQTKIIA DAHTKKFRLM NSDDEVRRKV AFEDFKLSNE SSPNSSPRER KATSRSEDVC
NRCGVKGHWA KDCLYPDNRP EELRPGPKPT DKCRRCGELG HFARDCSFDE DTCKICQQHG
HRARDCPSVA DVFASLDDTT TTVNDASDSD KEENWEFVFG