Gene OSTLU_36053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36053 
Symbol 
ID5000198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp776334 
End bp777512 
Gene Length1179 bp 
Protein Length393 aa 
Translation table 
GC content62% 
IMG OID640415619 
Productpredicted protein 
Protein accessionXP_001416243 
Protein GI145342547 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.630596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACCG TCGACGCCAA ACCCGCCGTT CTCGCGCGTT GGCGCCGCCC ATCGCCTCGA 
ACGTCGTCGC GCGCCGCGTC ATCGCGCGAT CCGAGCGACG CGCGACCTGC AATTTTGGGT
TTCGGCAGCG CCGGCGTGGA CTACATAGCC AAACTCGACG GCGCTTTCCC GACGCCTGAC
GCCAAAACGC GCGCGTCCGA GCTCGAAATC GTGGGCGGAG GGAACTGCGC GAACGCGCTC
GTCGCCGCTT CGCGCCTTGG CGCGCGCACG GCGCTGGTCT CGAAAGTGGG AACCGACGGG
GTGGGGACGC AGATTTTGAC AGAGCTGGGC GAACGCGAAG GCGTCGACGT GTCTCATGTC
GTACGACGCG GGAACAGGTC GCCTTTCACG TACATCATGG TCACATCGTC GTCAAACGGA
GATGGAGAGT CCACGCGAAC GTGCGTGCAC ACGCCTGGGG AGACGTTGGA GGTGGAAGAG
TTGGGCGACG TCGCCGCGCT GCTGGAAGCG GTTCATCCGG ATGTCGTCTT CTTCGATGGA
CGACTCACAG AGAGCGCTAT CGCGCTCGCG CGCGTCGCGG AAACGCGTGG AATTAGGGTG
CTCGTCGAGT GTGAACGATT GAGAGATGGA CTAGACGAAC TTGTGCGGCT TGCGGACGTC
GTGGTGACGT CGAAGAATTA CCCGCTTGAT AGATTTACGG AGACGAAGAC GCTAGGAGAC
GCGATGACAG AAATGTTTGC GTGTTTGCCG AAGGCAAAAG TGATGGTGAC GACGCTCGGC
GCGCGAGGGG CGGTAGCTTT GGTACGAGAT GGTGTCGAAA CTCCGGAAGT GGGGGAAGGG
ACGGCGTTGG ATGACGTCGT GTCGAGATTG GAGAACGCGG CCCTCCGCGG CGATGACGAG
ACGCCCGGAC CGAGCGTGGA GACGGAATCC TTGGTAATCC GAGACGCAAG CGGCGAACGA
CGTTTCAAGG CAAAAGTTGT GTTCACGCCG GCGAAACGTT TGACGGACAA CCAAGTGGTC
GACACCACCG GGGCAGGCGA CGCGTTCATA GGCACGCTCG CAATGTCGGC GTGCTCGGAG
GATTTCAACG TCGCCAGCGC GATGCGCCTT GGCGCATACG TTGCGGCGAC GAAATGCGGT
GGCATTGGAG CGCGAAGCGC ATTGCCGCAT CGCAAAGAT
 
Protein sequence
MRTVDAKPAV LARWRRPSPR TSSRAASSRD PSDARPAILG FGSAGVDYIA KLDGAFPTPD 
AKTRASELEI VGGGNCANAL VAASRLGART ALVSKVGTDG VGTQILTELG EREGVDVSHV
VRRGNRSPFT YIMVTSSSNG DGESTRTCVH TPGETLEVEE LGDVAALLEA VHPDVVFFDG
RLTESAIALA RVAETRGIRV LVECERLRDG LDELVRLADV VVTSKNYPLD RFTETKTLGD
AMTEMFACLP KAKVMVTTLG ARGAVALVRD GVETPEVGEG TALDDVVSRL ENAALRGDDE
TPGPSVETES LVIRDASGER RFKAKVVFTP AKRLTDNQVV DTTGAGDAFI GTLAMSACSE
DFNVASAMRL GAYVAATKCG GIGARSALPH RKD