Gene OSTLU_46754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46754 
Symbol 
ID5004193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp587932 
End bp589291 
Gene Length1360 bp 
Protein Length390 aa 
Translation table 
GC content58% 
IMG OID640419614 
Productpredicted protein 
Protein accessionXP_001420223 
Protein GI145351736 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGAACGCGC GCGCGGCGCG CGTGCGTGGT GCAGTCACCA TCACGGCGGC GCACCGCCCG 
GACGCGGACG CTGAAGATGC GGGCAAGGGT CAACGCATTA TGATCATCGG CGGCGACGGC
TACTGCGGCT GGGCGACGGC GTTGCACTTG TCCAAGCGCG GCTACGCGGT CTCCATCGTC
GACAACTTGT GCCGTCGTTC TATGGACGAC CAGCTCGGTT TCAACTCTCT GTCTCCGATT
AAGGGTATCC ACGAGCGCGT GCGCAAGTGG AAGGAAGTCA GCGGCCGCGA CATCGACCTC
TTCGTCGGCG ACGTCTGCGA CTACGAATTC TTGGGCGCCG CGTTCAAAAA GTTTGCGCCG
ACGGCGTGCG TTCACTTCGG TGAGCAGCGC TCCGCGCCGT ACTCCATGAT GGACCGCACC
CGCGCCGTGT TCACGCAAAC GAACAACGTC ATGGGTACCA TCAACGTGTT GTACGCCATC
AAGGAATTCG CCCCGGAATG CCACTGCATC AAGCTCGGCA CGATGGGCGA ATACGGCACC
CCGAATATCG ACATCGAAGA AGGTTACATC ACCATCACCC ACAACGGCCG CACCGACACG
TTGCCGTACC CGAAGCAAGG TGGTTCTTTC TATCACTTGT CCAAGTGCCA CGACTCCGCG
AACATGTTGT TCTGCACCAA GGCGTGGGGC ATTCGCACCA CCGACTTGAA CCAAGGCGTC
GTCTACGGTC TCTCCACGGA TGAAACCGAC ATGCACCCGG ACTTGGTGAA CCGCCTCGAC
TACGACGCCG TCTTCGGTAC CGCGCTCAAC CGCTTCTGCA TCCAAGCGGC CGTCGGTCAC
CCGATGACCG TGTACGGTAA GGGTGGCCAA ACCCGCGGTT TCTTGAACAT CAGAGACACC
GTGCGCTGCA TTCAAATCGC GTGCGACAAC CCCGCGCCGC CGGGCGAGAT GAAGATTTAT
AACCAATTCA CGGAGCAATT CTCCGTGAAC GAACTCGCTG CTATGATCAC GGAAGCCGGT
AAGAAGGTTG GACTGAATCC AGAGGTCATC ACCGTGCCCA ACCCGCGCAC GGAGATGGAG
GAGCACTACT ACAACGCCAA GAACTCCAAG CTTCAAGATC TCGGCCTCGA GCCAATCGCG
ATGCGCGGCG AATTCTTGGA GGGCTTGTTG AAGCAAATCA TCACCTACAA GGATCGTGTC
GACCAGCGAC TCATTCTTCC GGGAGTGAAC TGGAAGGAAT CCGCTTCCGT CGCGGAAGTC
ATTGCTAAGT AAGCATTTAG GAGGCCTATA GCGAGTCTCG CGAACCAGAA AAAGTTGTTC
AATCAAGCCG TTGTCAATAT GACCCAACAT CATTCATCGA
 
Protein sequence
MIIGGDGYCG WATALHLSKR GYAVSIVDNL CRRSMDDQLG FNSLSPIKGI HERVRKWKEV 
SGRDIDLFVG DVCDYEFLGA AFKKFAPTAC VHFGEQRSAP YSMMDRTRAV FTQTNNVMGT
INVLYAIKEF APECHCIKLG TMGEYGTPNI DIEEGYITIT HNGRTDTLPY PKQGGSFYHL
SKCHDSANML FCTKAWGIRT TDLNQGVVYG LSTDETDMHP DLVNRLDYDA VFGTALNRFC
IQAAVGHPMT VYGKGGQTRG FLNIRDTVRC IQIACDNPAP PGEMKIYNQF TEQFSVNELA
AMITEAGKKV GLNPEVITVP NPRTEMEEHY YNAKNSKLQD LGLEPIAMRG EFLEGLLKQI
ITYKDRVDQR LILPGVNWKE SASVAEVIAK