Gene OSTLU_52090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_52090 
Symbol 
ID5007018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp312631 
End bp315897 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table 
GC content58% 
IMG OID640422439 
Productpredicted protein 
Protein accessionXP_001422868 
Protein GI145357321 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.0580796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACG AGCCTTTGAC GCCGCACGAA GAGATTCGTC GAGGAACGCT CGCGAATGGG 
TTGAAGTACG TCATCTTACC GAACAAAGTT CCCGAGGGGA GATTTGAGGC GCACTTGGAG
ATGCACGTCG GGTCCGTGGA CGAACGCGAA GACGAGCAAG GGCTGGCGCA CCTCGTCGAG
CACGTCACGT TTTTGGGTTC TCGAAAACGA GATCAGTGGC TCGGGAGCGG TACGCGAGGG
AACGCGTACA CGGATTTCCA CCACACCGTG TTCCACATCC ACGCACCGAC GACGAACAAG
GATGGACATT ACATGCCGCC CAACGTGCTG GACATTCTCT ACGACGTCGC GTTTGCGCCG
CAACTCTTGG ACACTCGCGT GGCGAAGGAG AAGAAAGCCG TGCTCGCCGA GGCTCAAATG
ATGAACACGA TCGAGTACAG AGTGGACTGT CAGCTTCTAG AGCATTTGCA CTGGGACAAC
CTCTTGGGCA CGCGATTTCC GATCGGTAAG CTCGATCAGG TGGAAGCGTG GCCGGCGCAG
GCGGTGCGAG ACTTTCACGC GCGCTGGTAT TTCCCAGCCA ACGCCACGCT CTACGTCGTG
GGCGACTTCG ATGCCTCCGT GGACGAAGTC GAGGGTATGA TTTCCGCCGC GTTCGACGAA
GCGGCGCCCG CGGAAGGCGC GGAAGAAGCC GAGTCTCCTC TCAAACGTCA CGCGGTGCGT
CCTCCGGTGA AGCACGCGTA CGGGGCGCCG TCGTACGAGC TCGACGAAAT TCAACGCTCG
AACGACGAGG CGAAGGCGTC GGGCAAAGAC GACCCGTTCT TGCCTTTCGT CGCCCCCGAG
GGTAAAGTGT CCATGTTCCA GCACGAACAC TTGTCCAACG CGAGTTTCAA CATATTTTCA
AAATTGGCCG TCAAGCCGCT CGAAAAGATG GGCGACTTGC ACCGGACGAT TTTGCAGCGC
ATTGTGTTGT TGGTGTTGCA GAGCCGCATT CAGGCGAGAT ACGCGGAAAC TAATGCGGAT
TACAAACGCA TTGAACTCGA TCATAGCGAC AGCGCGAGAG AGGGGTGTTG CGTCAGCACC
GTCACCGTGA CGTGCGAGCC GAGAGAGTAC GAATTCGCCT TGCAAGTCGC CGTCGAGGAA
TCTCGACGAT TGCAAAAGTT TGGATTGACG CCGAGCGAGC TCGATCGTTT TAAGGCGGCG
ATGTTGAGAG ATTCTGAACA GCTCGCGCAG CAGGCGGGAT GCGTGCCGAG CCTCGAAAAT
CTCGACTTTG TCATGGAGCA GGACGCCTGC GGACACGTCG TCATGGACCA GGTCGCGGGT
CACGAGGCCT TGGTGCGCAT GTCTGATTAC ATCACGCTCG AGGCGTGCAA CGGCGCGTGC
GATGAGCTTT TAGGTTTTAT CGGTGAGTAC GGCGTGGAGA ACAATCGCAA ACCGAACAGC
GGTAAGTGCA CCGCCATCGT CGCGTGCGTC CCAGCGACGA TGACGAACGT CGACGGCGAG
ACGGTGCCGT TCGATATCAC GCCCGAGCGC ATAGAACAAG TTTTGGCGGC GGATTACGGT
GAAATCACAG AGCCCGAGGA TATCTTTGTG CCGGAGGTGT TGATTGCGGA TGACGAAATC
AACGCCCTCA TCGAGCAAAC GGCGCCGACG TTTACCGAGG CGACGTATCA CGAGCCGACC
GGCGTGTACC AGCGCACGCT GAGTAACGGT ATTCGCGTCA ACTACAAGGT GCTCGACGCC
GAGCCTGGGA GCGCGTTCTT ACGACTCGTC GTCCCGGGTG GTCGTTCGGT TGAGAGCCCG
AACATCGGGC CCGGCGGAAT CGGTTCGTCG GCGGTCGGTT TGCGCACCGT GCAAGAGGCT
GGCGACGTCG GCGAATGGTC GCGCAAGCAA ATCGAGCTCT TGACGATGCA ACACTTGCTC
GTGTACGACG TCGAGCCCGA GGTCGAGTAC ATGTTTTTGG ATTCCGCGTT CGCCACGGAT
GGAGGCTTGA GGACTATTCT CGAGATCATG CACCTCACGC TGACGAAGCC GACGTGGGAC
GAGCAAGCGC TCGAGCGCGT CAAGGACATT TACCGCATGT TTCAGATTAA CACGAACAAG
AACATCGAGT TGCTCACGCA TGACACCGTC AACGCCGTGA TTTACGAAGA CCGAAGAATC
ATGGACCCGA ACAAGGAGGC ACTCCAGGCG CTGACGCTGG ATGGCGTTCG AGACATGATC
GAAGCTCAAT TCGCGAGCGG GGCTTTGGAA CTCAACATCG TCGGCGACAT CATCCCCGAA
GAAGTCGACG AAATGGTTCT CGCGTTTATG GGATCGATTC AGACCAAGCC CGCGCCGCCG
TTGCCGCAAG TGCCCCCGCT GAAATTGAAA GAGGTGCCAA AGGATGATCC GGTGCGCGCG
CAGCGGTTGT GGTTGAAGGA CAGCGACGAA AGAGCGTGCG CGGTGGCCGC CGGACCGGGG
CCATCCATGT GGGCGCCGAT GACGAGTCTG TACTCTGAAA TTCCGAGCTT TGTCAATCAA
GACGCGTACA CGCCGGAACA GATTGCAGCC TTGCGCGATC CCGTGAACGA GGTGGCGTCG
GCGAAGGGGA ACCCATTCGC ACAGCAGTCT GCGCGACGCG CGAATTCTTT GGCGACGTAT
TGCGCGGGGA TGATGCTCGC AGAAGTTATC GGTGGTAGAT TGTTCACAAC GGTGCGCGAC
GCGTTGGGTC TGACGTACGA CTGCAACTTT ACAATGTCGT TTGGATTACA AAACAACGAC
GCGACGACGT ACAGGTTGTT GGTGACGTCG ACGCCGGAAA AAATTGATGA AGCTCTCAAC
GCCGGCGTGC GCGTGTTGCG TGGATTTCAG ATGCAGCGCG TGAGCCAACG CGAGGTCGAT
CGCGCGCGTT TAACGCTGTT GAGTCGTCAT GAGATGGATT TGAAGACGAA CAACTACTGG
GCCGATTTGA TGCAGTGCAC CAACACTCCA GATTTAGCGC CGCTGAAGAA GATTCAGTGC
GTCGCTGATT TACCCCTCAT GTACGACGCG ATGACCGTGG ACGATCTTCA AGAAGTTTAC
GATCGTCTCG GCTTGAGCGA AGGCGAGATC TTCACCAGCG TGACGATCGC CGGGCAGACA
GAGCCGCCTC CATTCCTGTC GAAGAAGGAC GTCGACGCCG CCGCGCTCGC CGCCCAGACG
CTGACCGCCG CGCTTGGTGG CATCAACATC GCCGAAGCGA TCAAGAGATT GAAGAAAGAG
CAGTCTCCGT CCGCGAGCGC GGAGTAG
 
Protein sequence
MLDEPLTPHE EIRRGTLANG LKYVILPNKV PEGRFEAHLE MHVGSVDERE DEQGLAHLVE 
HVTFLGSRKR DQWLGSGTRG NAYTDFHHTV FHIHAPTTNK DGHYMPPNVL DILYDVAFAP
QLLDTRVAKE KKAVLAEAQM MNTIEYRVDC QLLEHLHWDN LLGTRFPIGK LDQVEAWPAQ
AVRDFHARWY FPANATLYVV GDFDASVDEV EGMISAAFDE AAPAEGAEEA ESPLKRHAVR
PPVKHAYGAP SYELDEIQRS NDEAKASGKD DPFLPFVAPE GKVSMFQHEH LSNASFNIFS
KLAVKPLEKM GDLHRTILQR IVLLVLQSRI QARYAETNAD YKRIELDHSD SAREGCCVST
VTVTCEPREY EFALQVAVEE SRRLQKFGLT PSELDRFKAA MLRDSEQLAQ QAGCVPSLEN
LDFVMEQDAC GHVVMDQVAG HEALVRMSDY ITLEACNGAC DELLGFIGEY GVENNRKPNS
GKCTAIVACV PATMTNVDGE TVPFDITPER IEQVLAADYG EITEPEDIFV PEVLIADDEI
NALIEQTAPT FTEATYHEPT GVYQRTLSNG IRVNYKVLDA EPGSAFLRLV VPGGRSVESP
NIGPGGIGSS AVGLRTVQEA GDVGEWSRKQ IELLTMQHLL VYDVEPEVEY MFLDSAFATD
GGLRTILEIM HLTLTKPTWD EQALERVKDI YRMFQINTNK NIELLTHDTV NAVIYEDRRI
MDPNKEALQA LTLDGVRDMI EAQFASGALE LNIVGDIIPE EVDEMVLAFM GSIQTKPAPP
LPQVPPLKLK EVPKDDPVRA QRLWLKDSDE RACAVAAGPG PSMWAPMTSL YSEIPSFVNQ
DAYTPEQIAA LRDPVNEVAS AKGNPFAQQS ARRANSLATY CAGMMLAEVI GGRLFTTVRD
ALGLTYDCNF TMSFGLQNND ATTYRLLVTS TPEKIDEALN AGVRVLRGFQ MQRVSQREVD
RARLTLLSRH EMDLKTNNYW ADLMQCTNTP DLAPLKKIQC VADLPLMYDA MTVDDLQEVY
DRLGLSEGEI FTSVTIAGQT EPPPFLSKKD VDAAALAAQT LTAALGGINI AEAIKRLKKE
QSPSASAE