Gene OSTLU_18224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18224 
Symbol 
ID5005250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp681792 
End bp683003 
Gene Length1212 bp 
Protein Length403 aa 
Translation table 
GC content59% 
IMG OID640420671 
Productpredicted protein 
Protein accessionXP_001421532 
Protein GI145354522 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0263] Glutamate 5-kinase 
TIGRFAM ID[TIGR01027] glutamate 5-kinase 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCGC GACCGAACTC GCCCGCGGGC GCGAATCAAG CGCCGATCGT GGTGGTGAAA 
GTCGGGACGT CGTCGCTGCT GAAAGGGGGG ACGAGCGGAC ACCTGCACCT GTCGAATTTC
GGGATGCTCG CGGAGACGTG CTGCGAGCTG CAGCGACGAG GGATGCGGTG CGTGGTGGTC
ACGAGCGGGG CCGTCGGCGT GGGATGTCAG GTGCTGAATA AGAAGAAACC GCAGGATTTA
GCGGGAAAAC AAGCTATGGC GGCGGTCGGG ATGGTGCGCC TCATGCGAAT GTATGATGAT
TTTTTTCAAA GCGTCGGGCA GCCCGTGGCG CAGGTGCTCA TCTCGCTCGA TAACATCATG
GATAGGCAAC AGTATATGAA CGCGCAGAGC ACGTTTCGGT CGTTGCTGGC GCAAGATATT
ATTCCAATCG TGAATGAAAA CGATACGGTG GCGGTGCAGC ACACGAAGTT TGGGGACAAC
GATACGCTGA GCGCGCACGT GGCGGCGCTC GTCGACGCCG ATTACTTGTT CTTGCTCACC
GACGTCGATG GTTTATACAC CGCGAACCCG AACACGAACC CGGACGCGAC GAGAATCTCT
GTGGTTGAAA ATATCGACGA TTTGGAAGTC TCCACCGACG ACGCCGGGGC GAGTGGTTTG
GGCACGGGCG GGATGGCGAC CAAGCTCTCC GCCGCGCGTC TCGCCGCAGC GAGCGGATGC
AACACCGTCG TCATGCACGC CAACGCGTTG CCGGACCTTT CGGATATCAT TTTACACCAG
AAGTCGGTCG GCACGCTCTT CCTCGCCATG CCTCGGCCTC TGCGCGGGCG AAAGCGCTGG
ATCTTGTTGC TTCCGCCTTC GGGCGACTTG GTCGTCAACG CCAACGCGGC GCGCGCCATG
GAAACCAATA AGTCGCTCTT CTCTACGGGC ATCGTCGCCT GCCGCGGCGA CTTTGTCGCG
GAAGACGGCG TGAGAATCTT GACTATCGAT TCCGAAACTG GCGACGAGCG CGATCTTGGC
CGCGCTATCA CCAACTACTC GTCGGAGGAA ATCGAAACCT TCATCGGCAA GTCAGCGGAT
GAATTCTACG AAATAGTCGG TTACGCCGGC GCTGAAAGCA TCGCCCACCG CAACAACATC
TGTTGCTGGA TTCCGTTCGG TCAAGAGCCA AGCTCGCTCA ATTTGGCTGG CATGACCGGC
GGAAGCGACT GA
 
Protein sequence
MVPRPNSPAG ANQAPIVVVK VGTSSLLKGG TSGHLHLSNF GMLAETCCEL QRRGMRCVVV 
TSGAVGVGCQ VLNKKKPQDL AGKQAMAAVG MVRLMRMYDD FFQSVGQPVA QVLISLDNIM
DRQQYMNAQS TFRSLLAQDI IPIVNENDTV AVQHTKFGDN DTLSAHVAAL VDADYLFLLT
DVDGLYTANP NTNPDATRIS VVENIDDLEV STDDAGASGL GTGGMATKLS AARLAAASGC
NTVVMHANAL PDLSDIILHQ KSVGTLFLAM PRPLRGRKRW ILLLPPSGDL VVNANAARAM
ETNKSLFSTG IVACRGDFVA EDGVRILTID SETGDERDLG RAITNYSSEE IETFIGKSAD
EFYEIVGYAG AESIAHRNNI CCWIPFGQEP SSLNLAGMTG GSD