Gene OSTLU_26284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26284 
Symbol 
ID5004183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp539295 
End bp542542 
Gene Length3248 bp 
Protein Length976 aa 
Translation table 
GC content59% 
IMG OID640419604 
Productpredicted protein 
Protein accessionXP_001420206 
Protein GI145351701 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0122884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.247759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGC TCATCGCATC CGGCGCGCGG GCGGTCTCGA CCGAAGCGCT GAAGCCGCTG 
GACACGTTCG AGCGACGACA TAACTCCGGA ACGACGCAAG AAGTGGCGGA GATGTGCGCG
GTGATCGGGT TCAAGGACAT CGACGCGCTG ATCGACGCGA CGGTGCCGGA AAATATTCGC
CTGAAGAAGA CGATGGACAT GGGCGAGTAC ACGCAGCCGC TCACGGAGAG CGAATTCTTG
ACGATGATGA AGAACATGGC GAGCAAGAAT AAGGTTTTTA AGAACTACAT CGGGACCGGG
TATCACGGCA CGCACGTGCC GACGGTGATT TTGCGTAACA TTTTGGAAAA TCCGGGGTGG
TACACGCAGT ACACGCCGTA CCAGGCGGAG GCGTCGCAAG GACGCCTGGA ATCGTTGTTG
AACTTTCAAA CGATGATTAC CGACTTGACC GGCATGCCGC TGTCTAACTC GTCGTTGTTG
GACGAAGGCA CGGCTGCGGC GGAGGCGATG ACGATGTGCT CTGCGTTGAA CCGCGGTAAG
AAGCCGAAGT TCTACGTGTC GAACAAGTGC CACCCGCAAA CCATCGCGGT GGTGCAGACG
CGCGCCGAAG GTTTGGGTCT CGAAGCCGTC GTGGGCGATG AGAACTCTTT CGATTACACC
GCGAAGGATG TCTGCGGCGT TCTCGTGCAA TACCCGGCCA CGGATGGTTC GATCATCGAC
TACAAGCCCA TCGTGTCCCA GGCGCAAGCC AACGGCATTC GCGTCGTCGC CGCCGCCGAC
TTGTTGTCGC TCACCATGTT ACAACCTCCG GGTGAATGGG GTGCTGACAT CGTCATCGGT
TCATCTCAGC GATTCGGCGT GCCCATGGGC TACGGTGGTC CGCACGCTGC CTTCTTGGCG
ACGACGCACG ACTGCAAGCG TTTGATGCCG GGCCGCATCA TCGGCGAGTC CATCGACGCC
GAAGGCAAGC CGGCGCTTCG CATGGCGATG CAAACGCGCG AGCAACACAT CCGTCGTGAC
AAGGCGACTT CGAACATTTG CACCGCGCAA GCGTTGTTGG CCAACATAGC TGCCATGTAC
GGTGTTTACC ACGGTCCGGA GGGCTTGAAG CAAATCGCCA AGCGCTCGCA CGACTTTGCC
GCCGTCTTCG CCGCCGGTGC CGAAAAGCTT GGCTTCAAGA ACACCACCCC GGAGTTCTTC
GACACCGTCA CGCTGAAGTG CCCGAGTGGC GCGGATGCCA TCGTCAAGGC GTGCGCGTCC
GCTGGCATCA ACATTCGCAA GATGGACGCC GACCACGTCT CTTTGGCGTT TGACGAAACC
ACAGAAATCG CCGACGTCGA CGCTCTCTTC AAGGTGTTCG CTGGTGGCGC TGCCGCGCCC
ACCGTCGCGC AAGTTGCGCC GTCTGTGAAC ACGACCATGC CGATGGCGCG TAAGTCTGAA
TTCATGACCC ACCCGGTGTT CAACCAGTAC CACAGTGAAC ACGAGATGGT GCGCTACCTC
AAGCGCTTGG AAGAGAAGGA TCTCTCCTTG GTTCACTCCA TGATCGCTCT CGGCTCTTGC
ACGATGAAGC TCAACGCAAC GACTGAAATG ATTCCGATCA CGTGGCCGGA GCTTGCGAAC
ATTCACCCGT TCGCGCCGAA AGATCAAACG CTTGGTTACC AAGAGATGTT CCGCGGTCTC
GAAAAGCAAC TCTGCGAGAT CACCGGCTTC GACGCCATGT CCCTCCAGCC GAACTCTGGT
GCGTCTGGTG AGTACGCTGG TTTGATGGGT ATCCGTGCCT ACCACCAATC TCGCGGTGAC
CACCACCGTG ACGTGTGCAT CATCCCGGTT TCCGCGCACG GTACCAACCC GGCGTCCGCC
GCGATGTGCG GCATGAAGAT CGTCGTCATT GGCACCGACG CCAAGGGTAA CATCAACGTC
GCCGAGCTCA AGGCTGCGGC CGAAAAGCAC TCCGCGAACT TGGCGGCTCT CATGGTTACG
TACCCGTCGA CGCACGGTGT CTACGAGGAA GACATCAAGG AAATTTGCGA AGTCATTCAC
CAACACGGCG GTCAAGTGTA CATGGACGGC GCCAACATGA ACGCCCAAGT CGGTTTGACT
TCTCCGGGTT TCATTGGTGC GGATGTGTGC CACTTGAACT TGCACAAGAC TTTCTGCATT
CCGCACGGTG GTGGTGGCCC GGGTATGGGC CCGATCGGTG TCAAGGCGCA CTTGGCGCCC
TTCATGCCGG ATCACCCGTC CATGAAGGAT GGTGCCGTCG CCGTCGGCGG GGACAAGCCC
TTCGGTGTCG TCGCGGCTGC CCCGTACGGA TCCGCGCTCA TTTTACCGAT TTCCTTCTCT
TACATCGCTA TGATGGGTTC CGAAGGTTTG GCGAACGCGT CTAAGCGCGC CATCTTGAAC
GCCAACTACA TGTCCAAGCG CTTGGAGGAT TACTACCCGG TGCTCTTCAG CGGCAAGAAC
GACACGTGCG CGCACGAGTT CATCCTCGAC ATGCGCCCGA TCAAGGATGC CACCGGCGTC
GAAGTCGCGG ACATCGCGAA GCGCTTGATG GATTACGGTT TCCACTCGCC GACGATGTCT
TGGCCGGTCG CCGGTACGTT GATGATTGAG CCGACCGAGT CCGAGTCCAA GGCGGAGCTC
GATCGATTCT GCGACGCTCT CATCGCGATT CGTGGTGAAA TCCGCGACAT TGAGGACGGT
AAGGTGGACC GCGAGAACAA CGTTCTCAAG AACGCCCCGC ACACCGCGGA GGTCGTCACC
GCGAAGGAGT GGAACCGCCC GTACCCGCGC GATCTCGGTG CGTTCCCGGT TGAATGGACT
CGCTCTCACA AGTTCTGGCC GCAAACCTCT CGCATCGACG ACGTCTACGG CGACAGAAAC
CTCGTCGCGA GCCGCGCGGC TGTGGAAGTC GCCGTCGCTC AAACCGCTTA AAAATCACAT
TTTGCACTCG TTAGGAGAAA TGTAATTCAT CACGCCTTAA TTCACTATTC TGACAAGCTA
AAAATTCTAA ATCAACCGCA AGTGTTCTTG TGCGTCGTCC CGGCGCCCAT CCGTCGGTGC
GTAAAACGTC CCCGCGGCTA ACGCCGAGGC CCACACCACC ACACACACGC ACGCCGCTAG
TTGTAAGTAC ATCGTGGACG AGCTATTCTC ATCCACTGAT GTGTCCTCTA CGTCAGAGTA
TTCCTCCAGA CCGGCGCGAC CGCCCTTGGT CCACGACTCG CCCTCGCCGT CTAAACCTTG
TGAGTCGT
 
Protein sequence
MTSLIASGAR AVSTEALKPL DTFERRHNSG TTQEVAEMCA VIGFKDIDAL IDATVPENIR 
LKKTMDMGEY TQPLTESEFL TMMKNMASKN KVFKNYIGTG YHGTHVPTVI LRNILENPGW
YTQYTPYQAE ASQGRLESLL NFQTMITDLT GMPLSNSSLL DEGTAAAEAM TMCSALNRGK
KPKFYVSNKC HPQTIAVVQT RAEGLGLEAV VGDENSFDYT AKDVCGVLVQ YPATDGSIID
YKPIVSQAQA NGIRVVAAAD LLSLTMLQPP GEWGADIVIG SSQRFGVPMG YGGPHAAFLA
TTHDCKRLMP GRIIGESIDA EGKPALRMAM QTREQHIRRD KATSNICTAQ ALLANIAAMY
GVYHGPEGLK QIAKRSHDFA AVFAAGAEKL GFKNTTPEFF DTVTLKCPSG ADAIVKACAS
AGINIRKMDA DHVSLAFDET TEIADVDALF KVFAGGAAAP TVAQVAPSVN TTMPMARKSE
FMTHPVFNQY HSEHEMVRYL KRLEEKDLSL VHSMIALGSC TMKLNATTEM IPITWPELAN
IHPFAPKDQT LGYQEMFRGL EKQLCEITGF DAMSLQPNSG ASGEYAGLMG IRAYHQSRGD
HHRDVCIIPV SAHGTNPASA AMCGMKIVVI GTDAKGNINV AELKAAAEKH SANLAALMVT
YPSTHGVYEE DIKEICEVIH QHGGQVYMDG ANMNAQVGLT SPGFIGADVC HLNLHKTFCI
PHGGGGPGMG PIGVKAHLAP FMPDHPSMKD GAVAVGGDKP FGVVAAAPYG SALILPISFS
YIAMMGSEGL ANASKRAILN ANYMSKRLED YYPVLFSGKN DTCAHEFILD MRPIKDATGV
EVADIAKRLM DYGFHSPTMS WPVAGTLMIE PTESESKAEL DRFCDALIAI RGEIRDIEDG
KVDRENNVLK NAPHTAEVVT AKEWNRPYPR DLGAFPVEWT RSHKFWPQTS RIDDVYGDRN
LVASRAAVEV AVAQTA