Gene OSTLU_31870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31870 
Symbol 
ID5001792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp672390 
End bp674523 
Gene Length2134 bp 
Protein Length686 aa 
Translation table 
GC content58% 
IMG OID640417213 
Productpredicted protein 
Protein accessionXP_001417840 
Protein GI145346737 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.691414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.373209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGACGCGA CGCGCGCGCG CGCGCGACGA CGACGCGAGC TCGATCGCGG ATCGCGTCGC 
GATGTGCGGG ATCTTCGCGT ACAGTAATTG GAACTGCCCG AAGAGCCAGA AGGAGATCGT
CGAGAAGCTG CTCACGGGAC TGAAGCGATT GGAGTACCGC GGATACGACA GCGCGGGACT
GGCGCTCGAG GACGGGGAGG ACGTGTCGCG GACGACGGCG AAGGTGTTTC GCGAGACGGG
GAAGATCGCG AACCTGGAGG GGTTGCTGGA GGCGAGTGAG AAGGATCTGC ACGGGGATTT
GGTGTTCGAG TCGCACTGCG GCATCGCGCA CACGAGATGG GCGACGCACG GACCGCCGGC
GCCGAAGAAT TCGCACCCGC ACACGAGCGA TGAGGAGAAT GATTTTTTGG TGGTGCATAA
CGGGATCATA ACGAATCATC AGGCGCTCAG GGAGACGTTG CAGCGGAAGG GGTACATGTT
TGAGAGCGAT ACGGATACCG AGGTCATTCC AAAGTTGACA AAGTATTTAT TCGATAAATT
TCACGATAAG TGCTCGTTCA GACAGCTGGT GATGGAGGTG TTGAGACAGT TGCACGGGGC
GTACGCGCTG GCGTTTAAAT CGAGGCATTA CCCGGGGGAG TTGGTGGCGG CGAAGCGCGG
GTCGCCGTTG CTCTTGGGCA TCGCCGAGGG ACCGCATCCG GGAGAGCAGC ACGCGTTGGT
GACGAGCGAA GGCTTCGTGC CGACGTCTAA GCGCGCGAAG CGGACGTCGA TGGAATTTTA
CTTCGCTTCC GACGCATCGG CCATGGTAGA GCACACTAAG CGCGTGTTGC ACCTGGAGGA
CGACGACGTG GCGCACATTC ACAACGGAGG GTACGGCATC TATCGCATGG AGAAGATTCA
CACCGAGGGC GAAGATTCGC CGAGTTTGGC GTATGCTCCG ACGGTGAAAT CCGCTGAAGT
CGAGCGTACG ATTGAGACGC TGACTATGGA GGTTGAGCAA ATCATGAAGG GAAACTTTGA
TCACTTTATG AAGAAGGAAA TTCACGAACA ACCGGACGCG ATTCAGCAGA CGATGCGCGG
TCGCGTCGTC TTCGACGCCG ACGGAAACGT GCAACGCGTG TTCCTCGGTG GCATGGTTGA
TTACTTGTCC ACCATTCGAC GGTCACGTAG AATAATCTTG TGTGGATGCG GGACGAGTTA
TAACAGCGCC ATCGCTGTTC GTCAGCTCAT GGAAGAACTG ACCGAGTTGC CGGTGACGCT
CGAGCTCGCC TCGGACGTCC TAGATCGTCA GTGCCCGTTC TTCCGCGATG ATTCCATTAT
TTTCATCTCG CAATCCGGTG AAACCGCGGA TACTTTGCGC GCTCTCGAGT ACGCGAAGTC
CAAGGGGGCG TTGTGCATCG GGATCGTCAA CGTAGTCGGT TCGGCGATTT CCCGCGCCAC
CGATTGCGGT CTCCACATCA ACGCCGGCGC CGAAATCGGC GTTGCCTCCA CCAAGGCTTA
CACGTGCCAA ATCACCTCCA TGGTGCTCCT CGCCTTGGCT CTCAGCGAAG ATTCTCGCTC
TCGCGCTGAT CGCCGCATGG ACATCATGCG CGGCGTCGTC ACATTGCCAG ACACCATGCG
CCGTGCGCTC GAGCTCGATC AGAAAATGCT CGCGCTCGCC CGCACTCTCG TGGACGAGAA
CTCTTTGCTG TTATTCGGTC GTGGTTACAA CTACGCCACC GCCCTCGAAG GCGCCCTGAA
GGTGAAAGAA GTCGCCCTTC TTCACTCTGA AGGCATCTTG GCGGGTGAGA TGAAACACGG
TCCATTGGCG TTGGTCGACG AGACCCTTCC TTTGGTCGTC ATCGCCACGC GCGATTCCTC
CTACCTCAAG CAAAAGTCCG TCATCGAGCA GCTTCGCGCT CGCGACGCGC GCTGCATCTT
GATCGTCAGC GAAGACGATG ATTCTTTGGA CAAATTCGCC TCGAACGAAG ACATGATCAT
CAAGGTTCCC GAGGTGTGCG ACTGCTTGCA ACCTTTGATC AACATCGTCC CCTTGCAGTT
GCTCTCGTAT CACCTCACCG TCTTGCGCGG GCACAACGTC GATCAACCGC GCAACCTCGC
GAAATCGGTG ACGGTAGAAT AGACTGCACC AGGC
 
Protein sequence
MCGIFAYSNW NCPKSQKEIV EKLLTGLKRL EYRGYDSAGL ALEDGEDVSR TTAKVFRETG 
KIANLEGLLE ASEKDLHGDL VFESHCGIAH TRWATHGPPA PKNSHPHTSD EENDFLVVHN
GIITNHQALR ETLQRKGYMF ESDTDTEVIP KLTKYLFDKF HDKCSFRQLV MEVLRQLHGA
YALAFKSRHY PGELVAAKRG SPLLLGIAEG PHPGEQHALV TSEGFVPTSK RAKRTSMEFY
FASDASAMVE HTKRVLHLED DDVAHIHNGG YGIYRMEKIH TEGEDSPSLA YAPTVKSAEV
ERTIETLTME VEQIMKGNFD HFMKKEIHEQ PDAIQQTMRG RVVFDADGNV QRVFLGGMVD
YLSTIRRSRR IILCGCGTSY NSAIAVRQLM EELTELPVTL ELASDVLDRQ CPFFRDDSII
FISQSGETAD TLRALEYAKS KGALCIGIVN VVGSAISRAT DCGLHINAGA EIGVASTKAY
TCQITSMVLL ALALSEDSRS RADRRMDIMR GVVTLPDTMR RALELDQKML ALARTLVDEN
SLLLFGRGYN YATALEGALK VKEVALLHSE GILAGEMKHG PLALVDETLP LVVIATRDSS
YLKQKSVIEQ LRARDARCIL IVSEDDDSLD KFASNEDMII KVPEVCDCLQ PLINIVPLQL
LSYHLTVLRG HNVDQPRNLA KSVTVE