Gene OSTLU_46533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46533 
Symbol 
ID5003704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp5895 
End bp9190 
Gene Length3296 bp 
Protein Length1088 aa 
Translation table 
GC content58% 
IMG OID640419125 
Productpredicted protein 
Protein accessionXP_001419673 
Protein GI145350565 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.363787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GATCCCGCGG CGACGACGGC GATGCTGGAC GAGCCTTTGA CGCCGCACGA AGAGATTCGT 
CGAGGAACGC TCGCGAATGG GTTGAAGTAC GTCATCTTAC CGAACAAAGT TCCCGAGGGG
AGATTTGAGG CGCACTTGGA GATGCACGTC GGGTCCGTGG ACGAACGCGA AGACGAGCAA
GGGCTGGCGC ACCTCGTCGA GCACGTCACG TTTTTGGGTT CTCGAAAACG AGATCAGTGG
CTCGGGAGCG GTACGCGAGG GAACGCGTAC ACGGATTTCC ACCACACCGT GTTCCACATC
CACGCACCGA CGACGAACAA GGATGGACAT TACATGCCGC CCAACGTGCT GGACATTCTC
TACGACGTCG CGTTTGCGCC GCAACTCTTG GACACTCGCG TGGCGAAGGA GAAGAAAGCC
GTGCTCGCCG AGGCTCAAAT GATGAACACG ATCGAGTACA GAGTGGACTG TCAGCTTCTA
GAGCATTTGC ACTGGGACAA CCTCTTGGGC ACGCGATTTC CGATCGGTAA GCTCGATCAG
GTGGAAGCGT GGCCGGCGCA GGCGGTGCGA GACTTTCACG CGCGCTGGTA TTTCCCAGCC
AACGCCACGC TCTACGTCGT GGGCGACTTC GATGCCTCCG TGGACGAAGT CGAGGGTATG
ATTTCCGCCG CGTTCGACGA AGCGGCGCCC GCGGAAGGCG CGGAAGAAGC CGAGTCTCCT
CTCAAACGTC ACGCGGTGCG TCCTCCGGTG AAGCACGCGT ACGGGGCGCC GTCGTACGAG
CTCGACGAAA TTCAACGCTC GAACGACGAG GCGAAGGCGT CGGGCAAAGA CGACCCGTTC
TTGCCTTTCG TCGCCCCCGA GGGTAAAGTG TCCATGTTCC AGCACGAACA CTTGTCCAAC
GCGAGTTTCA ACATATTTTC AAAATTGGCC GTCAAGCCGC TCGAAAAGAT GGGCGACTTG
CACCGGACGA TTTTGCAGCG CATTGTGTTG TTGGTGTTGC AGAGCCGCAT TCAGGCGAGA
TACGCGGAAA CTAATGCGGA TTACAAACGC ATTGAACTCG ATCATAGCGA CAGCGCGAGA
GAGGGGTGTT GCGTCAGCAC CGTCACCGTG ACGTGCGAGC CGAGAGAGTA CGAATTCGCC
TTGCAAGTCG CCGTCGAGGA ATCTCGACGA TTGCAAAAGT TTGGATTGAC GCCGAGCGAG
CTCGATCGTT TTAAGGCGGC GATGTTGAGA GATTCTGAAC AGCTCGCGCA GCAGGCGGGA
TGCGTGCCGA GCCTCGAAAA TCTCGACTTT GTCATGGAGC AGGACGCCTG CGGACACGTC
GTCATGGACC AGGTCGCGGG TCACGAGGCC TTGGTGCGCA TGTCTGATTA CATCACGCTC
GAGGCGTGCA ACGGCGCGTG CGATGAGCTT TTAGGTTTTA TCGGTGAGTA CGGCGTGGAG
AACAATCGCA AACCGAACAG CGGTAAGTGC ACCGCCATCG TCGCGTGCGT CCCAGCGACG
ATGACGAACG TCGACGGCGA GACGGTGCCG TTCGATATCA CGCCCGAGCG CATAGAACAA
GTTTTGGCGG CGGATTACGG TGAAATCACA GAGCCCGAGG ATATCTTTGT GCCGGAGGTG
TTGATTGCGG ATGACGAAAT CAACGCCCTC ATCGAGCAAA CGGCGCCGAC GTTTACCGAG
GCGACGTATC ACGAGCCGAC CGGCGTGTAC CAGCGCACGC TGAGTAACGG TATTCGCGTC
AACTACAAGG TGCTCGACGC CGAGCCTGGG AGCGCGTTCT TACGACTCGT CGTCCCGGGT
GGTCGTTCGG TTGAGAGCCC GAACATCGGG CCCGGCGGAA TCGGTTCGTC GGCGGTCGGT
TTGCGCACCG TGCAAGAGGC TGGCGACGTC GGCGAATGGT CGCGCAAGCA AATCGAGCTC
TTGACGATGC AACACTTGCT CGTGTACGAC GTCGAGCCCG AGGTCGAGTA CATGTTTTTG
GATTCCGCGT TCGCCACGGA TGGAGGCTTG AGGACTATTC TCGAGATCAT GCACCTCACG
CTGACGAAGC CGACGTGGGA CGAGCAAGCG CTCGAGCGCG TCAAGGACAT TTACCGCATG
TTTCAGATTA ACACGAACAA GAACATCGAG TTGCTCACGC ATGACACCGT CAACGCCGTG
ATTTACGAAG ACCGAAGAAT CATGGACCCG AACAAGGAGG CACTCCAGGC GCTGACGCTG
GATGGCGTTC GAGACATGAT CGAAGCTCAA TTCGCGAGCG GGGCTTTGGA ACTCAACATC
GTCGGCGACA TCATCCCCGA AGAAGTCGAC GAAATGGTTC TCGCGTTTAT GGGATCGATT
CAGACCAAGC CCGCGCCGCC GTTGCCGCAA GTGCCCCCGC TGAAATTGAA AGAGGTGCCA
AAGGATGATC CGGTGCGCGC GCAGCGGTTG TGGTTGAAGG ACAGCGACGA AAGAGCGTGC
GCGGTGGCCG CCGGACCGGG GCCATCCATG TGGGCGCCGA TGACGAGTCT GTACTCTGAA
ATTCCGAGCT TTGTCAATCA AGACGCGTAC ACGCCGGAAC AGATTGCAGC CTTGCGCGAT
CCCGTGAACG AGGTGGCGTC GGCGAAGGGG AACCCATTCG CACAGCAGTC TGCGCGACGC
GCGAATTCTT TGGCGACGTA TTGCGCGGGG ATGATGCTCG CAGAAGTTAT CGGTGGTAGA
TTGTTCACAA CGGTGCGCGA CGCGTTGGGT CTGACGTACG ACTGCAACTT TACAATGTCG
TTTGGATTAC AAAACAACGA CGCGACGACG TACAGGTTGT TGGTGACGTC GACGCCGGAA
AAAATTGATG AAGCTCTCAA CGCCGGCGTG CGCGTGTTGC GTGGATTTCA GATGCAGCGC
GTGAGCCAAC GCGAGGTCGA TCGCGCGCGT TTAACGCTGT TGAGTCGTCA TGAGATGGAT
TTGAAGACGA ACAACTACTG GGCCGATTTG ATGCAGTGCA CCAACACTCC AGATTTAGCG
CCGCTGAAGA AGATTCAGTG CGTCGCTGAT TTACCCCTCA TGTACGACGC GATGACCGTG
GACGATCTTC AAGAAGTTTA CGATCGTCTC GGCTTGAGCG AAGGCGAGAT CTTCACCAGC
GTGACGATCG CCGGGCAGAC AGAGCCGCCT CCATTCCTGT CGAAGAAGGA CGTCGACGCC
GCCGCGCTCG CCGCCCAGAC GCTGACCGCC GCGCTTGGTG GCATCAACAT CGCCGAAGCG
ATCAAGAGAT TGAAGAAAGA GCAGTCTCCG TCCGCGAGCG CGGAGTAGTT GCTGAT
 
Protein sequence
MLDEPLTPHE EIRRGTLANG LKYVILPNKV PEGRFEAHLE MHVGSVDERE DEQGLAHLVE 
HVTFLGSRKR DQWLGSGTRG NAYTDFHHTV FHIHAPTTNK DGHYMPPNVL DILYDVAFAP
QLLDTRVAKE KKAVLAEAQM MNTIEYRVDC QLLEHLHWDN LLGTRFPIGK LDQVEAWPAQ
AVRDFHARWY FPANATLYVV GDFDASVDEV EGMISAAFDE AAPAEGAEEA ESPLKRHAVR
PPVKHAYGAP SYELDEIQRS NDEAKASGKD DPFLPFVAPE GKVSMFQHEH LSNASFNIFS
KLAVKPLEKM GDLHRTILQR IVLLVLQSRI QARYAETNAD YKRIELDHSD SAREGCCVST
VTVTCEPREY EFALQVAVEE SRRLQKFGLT PSELDRFKAA MLRDSEQLAQ QAGCVPSLEN
LDFVMEQDAC GHVVMDQVAG HEALVRMSDY ITLEACNGAC DELLGFIGEY GVENNRKPNS
GKCTAIVACV PATMTNVDGE TVPFDITPER IEQVLAADYG EITEPEDIFV PEVLIADDEI
NALIEQTAPT FTEATYHEPT GVYQRTLSNG IRVNYKVLDA EPGSAFLRLV VPGGRSVESP
NIGPGGIGSS AVGLRTVQEA GDVGEWSRKQ IELLTMQHLL VYDVEPEVEY MFLDSAFATD
GGLRTILEIM HLTLTKPTWD EQALERVKDI YRMFQINTNK NIELLTHDTV NAVIYEDRRI
MDPNKEALQA LTLDGVRDMI EAQFASGALE LNIVGDIIPE EVDEMVLAFM GSIQTKPAPP
LPQVPPLKLK EVPKDDPVRA QRLWLKDSDE RACAVAAGPG PSMWAPMTSL YSEIPSFVNQ
DAYTPEQIAA LRDPVNEVAS AKGNPFAQQS ARRANSLATY CAGMMLAEVI GGRLFTTVRD
ALGLTYDCNF TMSFGLQNND ATTYRLLVTS TPEKIDEALN AGVRVLRGFQ MQRVSQREVD
RARLTLLSRH EMDLKTNNYW ADLMQCTNTP DLAPLKKIQC VADLPLMYDA MTVDDLQEVY
DRLGLSEGEI FTSVTIAGQT EPPPFLSKKD VDAAALAAQT LTAALGGINI AEAIKRLKKE
QSPSASAE