Gene OSTLU_2583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_2583 
Symbol 
ID5004510 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp412890 
End bp414290 
Gene Length1401 bp 
Protein Length453 aa 
Translation table 
GC content56% 
IMG OID640419931 
Productpredicted protein 
Protein accessionXP_001420331 
Protein GI145351968 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.824744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0253902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTCCCACCGT CGTTCGCGTT CGGCGTCGGA ACGTCGGCGT GGCAGATCGA AGGAAACGGT 
GGTGATCGCC CGCGATCGGT GTGGGACGCG TTCGTGAGCG AACTAGGAGA GGAGAAGAGA
GTGGAGGCGG AGCGCGGGAT CGGCTTTCAC GAGCGATACG CGGCGGATGC GCAAATGATG
GCGGATGCGG GAGTGAAACA TTTCAAAATG TCCCTGAGCT GGCCGAGGTT GATGCGCGCC
GACGGGAGCG CGATCGATGA AGGGTTCGAG TATTATCAGA ACGTTTTCGG GGCGCTACGC
GAGCGAGGCG TGGAACCGCA CGTGACGCTG TTTCACTGGG ACACGCCGAT GTGGTGCTGC
GCCAACGAAA CGATCGCGAG CGGGCGTGGG AGCGTGTGCG AAGGAGCTTG GGTGAAGGAT
GAAATCTTGA AAGATTTTGA GAAGTACGCG GATGCGGTGT TTAGTAGACT CGGGAAGGGG
ATAAAATATT GGACCACAAT TAGTGAGCCA AAAACCGTCG CAGAGATGGG CTACGGTGCG
GGCCTTCACG CGCCTGGGCG TCGGAGCGTG GAAGAACAAC TTAAAGTGGG GCATAATATG
TTGCGTGCAC ACGCTTTGGC GGTGGCGCTC TATAGGGAGA AATATTCCCA GTTTGGAGGG
AAACTTTCAA TCAATTTGAA TAGCGCTTGG GTCGAGCCGG CGTCGGATTC GCCGGACGAC
GTGCGTGCGG CGGCAAACGC GATGGATGAA GAGCTTGGAT GGTTTGCCGA TCCTATTTAT
AAAGGTGACT ATCCGGCGAG CATGCGGGCG AGATTGGGGA GCTTTCTTCC GGAGTTCACC
GAGGAGGAGC GCGTGCTCGT GAAAGGGAGC GTCGATTACT TTGCGCTCAA TCACTACACG
TCCTACTTCG CCAAGCATGT GACCGACGCG CAAGCTTCGT CGCAGCTTGG TTTGAGCGGC
AGACCTCAGC CGTGGGAGAT CACACTAGAG TCAGAAAAGA GCAAGAAACC AATCGGCAAG
GAGGCGCAAA GCGACTGGTT GCACATCGTG CCGTGGGGAT TAGAAAAGGT TTTGCTGCAC
ATCAAGGACA GATACGACGA TCCAGCGATC ATGATCTCGG AGAATGGCGT CGATATCGCC
GAGAGGGGCG ATATCGCGGA AACTCTGGAC GACACAACGA GAGTCAAGTT TATCGATGCC
TATCTCGGAG CCGCTCGCGA GGCGATGCGT AAAGGCGCAA ACGTTGTGGG GTACTTTTAT
TGGTCGATGT TCGACAACGT CGAGTGGGTG GATGGGCGAT CGAAACGATT TGGTTTGGTT
TATGTCGATT ACGACGGCAA GTACGGCGAA AAGATGAAGC GCTATCCAAA GAAATCTCTC
GAGCACTTCT CCTCCTACAT G
 
Protein sequence
FPPSFAFGVG TSAWQIEGNG GDRPRSVWDA FVSELGEEKR VEAERGIGFH ERYAADAQMM 
ADAGVKHFKM SLSWPRLMRA DGSAIDEGFE YYQNVFGALR ERGVEPHVTL FHWDTPIVCE
GAWVKDEILK DFEKYADAVF SRLGKGIKYW TTISEPKTVA EMGYGAGLHA PGRRSVEEQL
KVGHNMLRAH ALAVALYREK YSQFGGKLSI NLNSAWVEPA SDSPDDVRAA ANAMDEELGW
FADPIYKGDY PASMRARLGS FLPEFTEEER VLVKGSVDYF ALNHYTSYFA KHVTDAQASS
QLGLSGRPQP WEITLESEKS KKPIGKEAQS DWLHIVPWGL EKVLLHIKDR YDDPAIMISE
NGVDIAERGD IAETLDDTTR VKFIDAYLGA AREAMRKGAN VVGYFYWSMF DNVEWVDGRS
KRFGLVYVDY DGKYGEKMKR YPKKSLEHFS SYM