Gene OSTLU_51965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51965 
Symbol 
ID5006620 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp348418 
End bp353638 
Gene Length5221 bp 
Protein Length1562 aa 
Translation table 
GC content59% 
IMG OID640422041 
Productpredicted protein 
Protein accessionXP_001422721 
Protein GI145357021 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.738666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0653658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGTCGCGATC GTGCGCGCGC CGGAAGGGCG CGCGAGCGAG CGAAGGCGAT GCAAGCGATG 
GGCGCGCGCG TAAGCCCCGT CGCGGTGACG GATGCGCGCG CGCGTCGCCA TCGCGCGCGC
GTGGGGGCGA AAAATGGTGC ACACGCGCAG GCGATGCGTA ACAAAAACGT CGCGAGCGCG
AGCGCGGGCA CGATGCGCGC GTTGGGGAAC CAAGCGGTGC GTAAGACGCG CGCGACGACG
CGCGAAGGGG GTTAGGGCTT GTTTTCCAAG AGTCACCCCG CGACGAATCA CGAACGGCGA
ATGCGCGTTT AAAAGACGGG CCGAAGGCGA GGCGCGAAGG CGTCGATCGA ACGTCGAACG
AAGAAGACTG ACGGTAGAGT GTGATCATTT ACATCGCGCA GAAGTTTACT CCGTCCCCGG
TGACGTTGCG TAAGCCCGCG GCGTTGCCCA AGTGCTCGAC GGTGCGCGTG CCGGGACGCG
ACCGCGCGAA GACGGTGTGC CTGGCGGAGC GAGGGTCGGT GTACGGCGAT TGGCCCGTGT
ATGAAGGCGG TGCGGTGGAT AACACGCAGC TTTTGGAAGA GCATGATGCG TGCGGGGTCG
GTTTCATCGC TTCGTTGAAG GGTGAGCGCA CGCACAAGAC TGTGAAAGAT TCCTTGATGG
CTGTCGGGTG CATGGAGCAC CGCGGGGCGT GCTCGGCGGA CAACGACTCT GGTGATGGTG
TCGGTGTCAT GACGCACATT CCGTGGAAGC TTTTGGACAA GTGGTGCGCG GCGAATGGTA
TCAGTGGATT CTCTGAAGGC TCCTCGGCGG TCGGTATGGT TATGTTGCCC ACCGACGGTG
CCAAGGCGGC TGAGGCGAAG AAGATTCTTG AGGCGAGCTG CGTCGCCGAA GGCCTCAAGG
TGCTCGGTTG GCGTGCGGTT CCGGTGGACA ACTCCGTCGT CGGTCCGCTC GCGAAGATGA
CGTGCCCGGT GCACGAACAG ATTCTCGTCG ACGGGGCGGG CTTGGAACGC GAAGAGCTCG
AGCGCAAGTT GTTCATCGCG CGCAAGACGT GCGAAAAGTC GGCGAGCTCC GACGCCGTGT
TAGCTGAGAG CTTCTACATT TGCACCTTGA GCTCCCGCAC GATTGTGTAC AAGGGTATGC
TTCGATCCGC GGTGTTGGGC AAGTTCTACA AGGATTTGGA AGACCCCGAC TACGAGTCGC
AGTTCTCGAT TTACCACCGC CGTTTCTCCA CGAACACGAC GCCGAAGTGG CCACTCTCGC
AACCGATGCG TTTCTTGGGA CACAACGGTG AGATCAACAC CCTTCAAGGT AACTTGAACT
GGATGGCGTC TAAGGAAGCT GATATGGAGA ACCCGATCTG GGGCGGTCGT GAACCGGAAT
TCCGCCCGAT CTGTAATCCG GCTGCCTCCG ATTCCGCCAA CCTCGACAGA GTCGCGGAGC
TCCTGGTGAG AACCGGCCGC GCGCCGGCGG AGACTATGAT GTTGCTTGTG CCAGAAGCGC
ACCGCAACCA CCCCGAACTC GATGCGACGT TCCCGGAAGT TCATGATTTC TACGATTATT
ACGCTGGGAT GCAAGAAGCG TGGGATGGTC CAGCGTTGCT CGTCTTCTCC GATGGCAAGC
AGCTCGGCGC CCGCCTCGAC CGCAACGGGT TGCGCCCGGC GCGCTTCTGG CGCACGTCCG
ATGATTACAT CTACGTCGCC TCGGAAGTTG GTGTCCTCGG TGATGTCATG TCCAACGCGT
CCAATGTCGT CTCCAAGGGC CGTCTCGGCC CGGGTATGAT GATTTACGCT GATTTGGAGA
CGGGCGAGTT CAAGGAAAAT ACGGAAATCG CGAAGGAAGT CTCCGCGCGC CTCCCGTACG
GCGAATGGAT GAAGGCCATC GATCGCGTCA AGGGCATCGA ACCCATTGGC GCGACGCAAC
TGGACCCGAT CCAACTCATC GAGTGCCAAG CTCGCGCCGG TTACGCCGCC GAAGACATCA
CGATGATCAT TGAATCGATG GCGTCCGATG CCATCGAACC CACTTGGTCG ATGGGCGACG
ACACCCCGAT GCCCGTCTTG TCTGGCCGAC CGCGCTTGCT TTATGACTAC TTCAAGCAAC
GCTTCGCGCA AGTTACCAAC CCCGCCATCG ACCCTCTTCG CGAGGGTCTC GTCATGTCCT
TGGCCATGAC TCTTGGTGCG AAGGGCAACT TGCTCGACAC GCAAGGCAAG GAAACGCCGC
CGGTCATGCT CGACTCCCCG GTCCTCTTCG ACTCTGAGTT GGAGCACATT AAGAACCACC
CGAAGCTGAA GACGCAAACT ATTGCCGCGC GTTACGCCGC TGGTGGTGCC GCTGGTGCCC
TCAAGGCTGG CCTTGACAAG CTTTGCGAAG AGGCCGCCGC GGCGATTCGC GCCGGCAGCG
AGTGCATCGT CATCACGGAT CGTCCGGATC AAGGTCCGGA CTCGCCCGCG ATTCCCTCGC
TTCTCGCTGT TGGTACCGTG CACCACTACT TGATCGCGCA AGGTCTTCGA ACCCGCGCGT
CTATCGTCGT GGAGTCTGCT TCGGCGTTCA GCACGCACCA CATTGCCACC TTGGTTGGTT
TCGGCGCACA CGCTGTGTGC CCGTGGTTGG CTTTGGAAAC CTGCCGGTCA TGGAGAAAGT
CCCCGAAGGT CGAGACCGCC ATCCAGCGCG GTAAGATGGG TGATGTCTCT GTGGAAGGTG
TGCAAGTCAA CTTCAAGAAT GCCCTCAACA AGGGTCTCAA GAAGATCTTG TCTAAGATGG
GTATCTCTTT GATCACCTCG TACCAAGGCG CGCAAATTTT CGAGTGCTAC GGTCTTGGGC
CTGAAGTCAT CAACACCGCC TTCAAGGGCA CCGTTTCCCG CATCGGTGGT CTCACCATGG
ATGAAGTTGC CGCGGAGACG CACATGTTTG TCCAGTCCGC TTTCCCGGGT GAGGCTGAAG
AGATGGCCAA GGTTGAGGCG CGCGGTATGT TCCAAGTCAA GCCGGGATTG GAATACCACG
GCAACAACCA AGAGATGTCT AAGCTTCTTC ACAAGGCTGT TGGCCTCGGT GGTGGTGAAA
AGAATGATGA GTTCTGGAGC GCCTACCAAG CGCACCGCAA CGATCGTCCG TACACGTGCT
TGCGCGATCA ACTCGAAATC AAGTCTGACC GCCAACCGAT CTCCGTCGAT GAGGTCGAAT
CCGTCGCTGA CATTTGCACG CGCTTCTGCA CGGGTGGTAT GTCTCTCGGT GCTATCTCCC
AAGAGTGCCA CGAATCTATC GCCATCGCGA TGAACCGCAT CGGTGGTAAA TCCAACTCTG
GTGAAGGTGG CGAAGACCCG AAGCGATTCG AAACCATCAC TGACGCCACC GCGGATGGCA
AGTCTGAAAC GTTCCCGTAC CTTCGAGGCA TGGAGAATGG CGACGTCGCG TCTTCCGCTA
TCAAGCAAGT CGCTTCCGGT CGCTTTGGTG TCACGACGTC GTTCTTGATG TCTGCCAACC
AGACCGAAAT CAAGGTGGCG CAAGGTGCCA AGCCGGGAGA AGGTGGTCAG CTTCCGGGTA
AGAAGGTTTC CCCGTACATT GCCTGGCTCC GCCGATCCAA GGCTGGTGTC CCGCTCATCT
CCCCGCCGCC GCATCACGAC ATCTACTCCA TTGAGGATCT CGCGCAGCTC ATCTATGACT
TGCACATGGT CAACAAGAAC TCGAAGGTGT CCGTGAAGCT CGTGTCCCAA GCGGGCATCG
GCACGGTGGC GTCCGGCGTC GCCAAGGCGA ACGCCGACAT CATCCAAATT TCGGGTGGCG
ATGGTGGTAC CGGCGCGTCT CCTTTGTCGT CCATCAAGCA CTGCGGTGGT CCGTTGGAGA
TGGGTCTCGT CGAATCGCAC AGAACTCTCG TTGAGAATGG CCTTCGCGAG CGCGTCGTCT
TGCGCGCCGA TGGTGGCTGC CGCTCCGGTC TTGACGTCAT CCAAACCGCT CTCATGGGTG
CCGATGAATA CGGTTTCGGT ACCGTTGCGA TGATTGCCAC TGGCTGCGTC ATGGCTCGTA
TTTGCCACAC CAACAACTGC CCCGTTGGTG TTGCGTCCCA GCGCGAAGAG CTTCGCGCGC
GCTTCCCCGG TGCGCCAAGC GATCTTGTCA ACTTCTTCAT GTACGCCGCG CAAGAAGTGC
GCGAGATCCT CGCCCAAATG GGTTACAGAT CTCTCGATGA GATCATCGGT CGCAACGACT
TGCTCAGCCA AATTGACAAG GCGCCGGCGA AGACTTCGTC TCTCGACTTG TCCTTCCTCA
CCACGTCCTC TGGCGAGGCT GGCGCTTCCT CGGACCGCAT CGCGCAACCG GTGCACAACG
ACGGTATCGT TCTCGATGAC AAGATCCTTA GCGATCCGGA AGTCCAAAAG TGCATCGAAA
CCGAAGGCAC GTACACGAAG AAGGTGGAGA TTGTCAACGT CGACCGTTGC GCGACGGCGC
GCGTCGCCGG TCAAATCGCC AAGAAGTACG GCGACAATGG CTTCGCTGGT TCTCTCACCT
TAGACATCGA GGGTTCCAGC GGTCAATCTT TCGGTGCTTT CGTTGTCGGT GGCCTGAAAG
TGCGACTTGT GGGTGAAGCG AACGATTACG TGGCGAAGAG CATGAGTGGC GGTGAGATTG
CCATCATGCC TCCTCCGAAC TCTCCGTTCG CGCCGGAGTC GGCGAGCATC GCGGGTAACG
CGTGCTTGTA CGGCGCCACT GGTGGTCAAG TGTTTATCAG CGGTCGCGCT GGTGAACGCT
TCGCCGTCCG TAACTCGCTC GGTGAAGCGG TCGTTGAAGG CACTGGCGAC CACTGCTGCG
AATACATGAC GGGTGGTTGC GTCGTCGCGA TCGGCAAGGT TGGCCGCAAC GTTGGCGCGG
GTATGACTGG TGGCATCGGT TACTTCCTCG ACGAAGACGG TACGTTCGAA TCCAAGGTGA
ACGGCGAGAT TGTCGCCATG CAGCGCGTGA TCACGCCGGC GGGTGAGGCC CAACTCAAGG
GTCTCATCTC CGCGCACGCC GAGAAGACGA ACTCGCCGAA GGCGAAGGCT ATCCTCGCTG
ACTGGGCCAA CTATTTGCCC AAGTTCTGGC AGTTAGTTCC GCCGTCTGAG GCGAACACGC
CGGAGGCGAC GAACGATGTT AAGGCTGGAG TTGAAGCCAC TGCTTAAATC CATAGCGCGG
CGCGGCGTCG TACGCAAAAA ATGCTTGCAA GCGTTCATGA GTTGAGGAAT TTAGAAACAG
T
 
Protein sequence
MQAMGARVSP VAVTDARARR HRARVGAKNG GAVDNTQLLE EHDACGVGFI ASLKGERTHK 
TVKDSLMAVG CMEHRGACSA DNDSGDGVGV MTHIPWKLLD KWCAANGISG FSEGSSAVGM
VMLPTDGAKA AEAKKILEAS CVAEGLKVLG WRAVPVDNSV VGPLAKMTCP VHEQILVDGA
GLEREELERK LFIARKTCEK SASSDAVLAE SFYICTLSSR TIVYKGMLRS AVLGKFYKDL
EDPDYESQFS IYHRRFSTNT TPKWPLSQPM RFLGHNGEIN TLQGNLNWMA SKEADMENPI
WGGREPEFRP ICNPAASDSA NLDRVAELLV RTGRAPAETM MLLVPEAHRN HPELDATFPE
VHDFYDYYAG MQEAWDGPAL LVFSDGKQLG ARLDRNGLRP ARFWRTSDDY IYVASEVGVL
GDVMSNASNV VSKGRLGPGM MIYADLETGE FKENTEIAKE VSARLPYGEW MKAIDRVKGI
EPIGATQLDP IQLIECQARA GYAAEDITMI IESMASDAIE PTWSMGDDTP MPVLSGRPRL
LYDYFKQRFA QVTNPAIDPL REGLVMSLAM TLGAKGNLLD TQGKETPPVM LDSPVLFDSE
LEHIKNHPKL KTQTIAARYA AGGAAGALKA GLDKLCEEAA AAIRAGSECI VITDRPDQGP
DSPAIPSLLA VGTVHHYLIA QGLRTRASIV VESASAFSTH HIATLVGFGA HAVCPWLALE
TCRSWRKSPK VETAIQRGKM GDVSVEGVQV NFKNALNKGL KKILSKMGIS LITSYQGAQI
FECYGLGPEV INTAFKGTVS RIGGLTMDEV AAETHMFVQS AFPGEAEEMA KVEARGMFQV
KPGLEYHGNN QEMSKLLHKA VGLGGGEKND EFWSAYQAHR NDRPYTCLRD QLEIKSDRQP
ISVDEVESVA DICTRFCTGG MSLGAISQEC HESIAIAMNR IGGKSNSGEG GEDPKRFETI
TDATADGKSE TFPYLRGMEN GDVASSAIKQ VASGRFGVTT SFLMSANQTE IKVAQGAKPG
EGGQLPGKKV SPYIAWLRRS KAGVPLISPP PHHDIYSIED LAQLIYDLHM VNKNSKVSVK
LVSQAGIGTV ASGVAKANAD IIQISGGDGG TGASPLSSIK HCGGPLEMGL VESHRTLVEN
GLRERVVLRA DGGCRSGLDV IQTALMGADE YGFGTVAMIA TGCVMARICH TNNCPVGVAS
QREELRARFP GAPSDLVNFF MYAAQEVREI LAQMGYRSLD EIIGRNDLLS QIDKAPAKTS
SLDLSFLTTS SGEAGASSDR IAQPVHNDGI VLDDKILSDP EVQKCIETEG TYTKKVEIVN
VDRCATARVA GQIAKKYGDN GFAGSLTLDI EGSSGQSFGA FVVGGLKVRL VGEANDYVAK
SMSGGEIAIM PPPNSPFAPE SASIAGNACL YGATGGQVFI SGRAGERFAV RNSLGEAVVE
GTGDHCCEYM TGGCVVAIGK VGRNVGAGMT GGIGYFLDED GTFESKVNGE IVAMQRVITP
AGEAQLKGLI SAHAEKTNSP KAKAILADWA NYLPKFWQLV PPSEANTPEA TNDVKAGVEA
TA