Gene OSTLU_31076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31076 
Symbol 
ID5001623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp260326 
End bp261835 
Gene Length1510 bp 
Protein Length488 aa 
Translation table 
GC content56% 
IMG OID640417044 
Productpredicted protein 
Protein accessionXP_001417460 
Protein GI145345947 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGCGCGAG ACGGCGAGCA TGGGGACGGA TGATGAGGTG CGAGTGGTGA GCTCGTTAGA 
AGAGATATAT GACGCCGCGT CGCTGGAGAC GCACACGCGC GCGCGGTACG CGACGGTGCG
CGATGCGTTC GTGAAGGCGT ACGGACGCGA GCCGGACGCG TTCGCGAGGT CGCCGGGACG
AGTGAATTTA ATCGGCGAAC ACATCGATTA CGAGGGGTAC TCCGTGCTGC CGATGGCGAT
CGGGCTCGAC ACTATCGTCG CCATCTCCGT GAATGCGAGT TCGGGGAAGA TCGCGGTCGG
GAACACGAAC GAAAAGTACA CGCCGAAGAC ATTCGAGAGC TCGCCCGAAC AAGACGTGGA
CGCGGCGTCT TTGCACTGGA CGAATTACGT CATGTGCGGC TACAAGGGGG TGTTCGATTT
CTTGAAAGAG AGCGATAAGG CATCACCCGC CCCCGTTGGG TTGGACATCA TCGTCGACGG
CACGGTACCC ACTGGAAGCG GTTTGAGTTC GTCCTCGGCG TTGTGCTGCG CCGTGGCGGT
GGCGGTGATG CACGCGCTAG GATTGAATTT CACTCAAGGT GAAATTGCTG ACTTCACGTG
CAAGTGCGAA CGATACTCAG GAACGCAGTC GGGGGGTATG GATCAAGCTA TTTCCATCAT
GGGCGAAGCT GGTGTGGCAA AATTGGTCGA TTTCAATCCC ATAAGCACCA ACGACGTCAA
CCTTCCGGAG GAAGCGGCGT TCATCATAGG CAACTGCCTC GCAGTGAGCA ACAAAGCGGA
GACCGCACAC GAGCGCTATA ATTTGCGCGT CGTAGAGTGC CGTCTTGCGG CGATTATTTT
AGGTTTAAAG CTAGGTATGA ACGCGGAAGA AGCGTCAAAA ATAGAGACGC TCAAGGAAAT
CGAAGACTTT GTCGGCTCCA TGTCTGCCGC TAAGGCTGCG GCCGAGGAAC ATTTGCACGA
GGGATACTAC GATGCAAGAG AGATTGAAGA ACTCATAGGA GTAGAAGCAT TCATGGACGT
CTTCTCTTCA CCAGCGTCGA AGTTGGTCTT GAGTCACAAC GAGAAGGGAT ATAAGCTTCT
GGCGCGGACG TTGCACGTCT ACTCCGAGGC CGGTCGTGTG CACTTGTTCG CTGCGGCGTG
CGCGATGAAG GTCGACCCAA CGGAGCTGGG CGTGTACATG AATGGTAGCC ACGAATCTTG
TAGAGCCCTG TACGAGTGCT CTTGCGCGGA GCTGGATGAA CTCGTGGATG CATTTAGAGC
GGCGGGTGCT CTGGGCGCGC GTCTTACTGG TGCTGGTTGG GGCGGTTGTG CCGTAGCAAT
TGTCGCCAAG GATGCGGTAG AGAGTGTTCT GAAAGCGGTG CACGAGTCTT TCTACTCTTC
TCGCATCGCT GCGGGCCTTA TTTCTGCTGA CAATATGGCG ACGACGCTCT TCGCAACGCT
GCCCAGCTCT GGTGCGGCAA TTTTGAAAGG CGTTTCGTTC GCTTAGATCG GTATTTGATA
CGTGCCGAAC
 
Protein sequence
MGTDDEVRVV SSLEEIYDAA SLETHTRARY ATVRDAFVKA YGREPDAFAR SPGRVNLIGE 
HIDYEGYSVL PMAIGLDTIV AISVNASSGK IAVGNTNEKY TPKTFESSPE QDVDAASLHW
TNYVMCGYKG VFDFLKESDK ASPAPVGLDI IVDGTVPTGS GLSSSSALCC AVAVAVMHAL
GLNFTQGEIA DFTCKCERYS GTQSGGMDQA ISIMGEAGVA KLVDFNPIST NDVNLPEEAA
FIIGNCLAVS NKAETAHERY NLRVVECRLA AIILGLKLGM NAEEASKIET LKEIEDFVGS
MSAAKAAAEE HLHEGYYDAR EIEELIGVEA FMDVFSSPAS KLVLSHNEKG YKLLARTLHV
YSEAGRVHLF AAACAMKVDP TELGVYMNGS HESCRALYEC SCAELDELVD AFRAAGALGA
RLTGAGWGGC AVAIVAKDAV ESVLKAVHES FYSSRIAAGL ISADNMATTL FATLPSSGAA
ILKGVSFA