Gene OSTLU_39789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39789 
Symbol 
ID4999955 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp913186 
End bp914186 
Gene Length1001 bp 
Protein Length301 aa 
Translation table 
GC content63% 
IMG OID640415376 
Productpredicted protein 
Protein accessionXP_001415636 
Protein GI145341065 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0676] Uncharacterized enzymes related to aldose 1-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCTT CGTTTGCCAC CGTCCAACCG AGCGTCCGCG CGCGCGCGAC GCTCCGATCC 
CGCGCGAGAC GCGCCGATCG ATCGTCGATC GTCGTCCGCG CGGGCAGCGC GGCGCAACAA
AAAGGTCTCG GCGATCTCGA CACCGTCAAG CTCACCGCCG CCGACGGTTC CACCGCGGAC
GTGTACTTGT TCGGCGGCGT CGTGACGAGT TTCAAGCCCA AGGGCGGTGA CGACGTGCTG
TACGTGCGCC CGGATGCGAA GTTTGACAAG GTGCGTCGGC GGCGAAGCGA CGCGCGAGAC
GCGCTCGGAC GATGAGCGAA GCGATGGAAG ACTGACTGCG GTGTGGTAAA CACAATCGCA
CGTAGAGTAA GCCGATTTCT GGTGGTTTGC CGCACTGCTG GCCGCAGTTC GGCCCGGGGG
CGATTCAAGT GCACGGATTC GCGCGCAACG TCGACTGGAC GCTCGTGAGC ACGACGGATG
GCGACGAACC GTCGATGACG ATGGAACTCA CGCCAAATGA TTACACCAAG GCGATGTGGG
ATAAGGATTT CAAGGTGACG GAAACCGTCA CGCTCAAGGG CGGCGCGCTC GAGGCGAAGC
TCGTGGTTGA GAACAAGGGC AAGGAAGCGT TCGATTTCAC TGGTTCGTTC CACACGTACT
TGAGCGCCGA CATCAACGCC GCCGCCGTCG GCGGGTTGAA CGGCTGCAAG ACGTTAGATC
GACTCGCGGA GAAGGAATCC ACCGTCTCTG GTGACGTCAA ATTCCAAGGA CCGATCGACA
GCGTGTACTA CGGCGTTCCG GAGACGCTTA CGCTCGCCAC GGGCAAGCGC ACTGTGAGCA
TCAAGTCGAG CAAGACGTGG ACAGAAGCCG TGGTGTGGAC GCCGTGGACG GACATGGAGG
CGTGCTACAA GGAGTTCGCG TGCGTCGAAA GCGCCGCCGT GACTCCGGTC GTCGTCGCTC
CGGGCGGCTC TTGGACCGCC ACCACGACGA TTTCCGCGTA A
 
Protein sequence
MSASFATVQP SVRARATLRS RARRADRSSI VVRAGSAAQQ KGLGDLDTVK LTAADGSTAD 
VYLFGGVVTS FKPKGGDDVL YVRPDAKFDK SKPISGGLPH CWPQFGPGAI QVHGFARNVD
WTLVSTTDGD EPSMTMELTP NDYTKAMWDK DFKVTETVTL KGGALEAKLV VENKGKEAFD
FTGSFHTYLS ADINAAAVGG LNGCKTLDRL AEKESTVSGD VKFQGPIDSV YYGVPETLTL
ATGKRTVSIK SSKTWTEAVV WTPWTDMEAC YKEFACVESA AVTPVVVAPG GSWTATTTIS
A