Gene OSTLU_92070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_92070 
Symbol 
ID4999493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp172445 
End bp174757 
Gene Length2313 bp 
Protein Length770 aa 
Translation table 
GC content62% 
IMG OID640414914 
Productpredicted protein 
Protein accessionXP_001415751 
Protein GI145341300 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGGC ATCGCGGGAC GAGAGGCGGA TGGAACGCGA CGACGACGGA GGGTGGAGAT 
GGACGCGAAC GCGAACGCGC GAGCGAGCGG GACGCGAACG CGGCGACGCG AGGCGAAGGC
GAAGGCGAGG CGACGGGGAG AGCGTCGATC GCGGCGACGC TCGCGGGAGA GACCGCGTCG
TCGATGACGT CGAGTTATAG CGCGTGGGAG GGCGAGGAGA CGAAGACGCA CCCGAGCTCG
ACGTTCGACG CGGGGAGATC GACGGGGGCG GCGTTGCACG GGTGGTTACA TTTAGAGGAG
TGGTTCTTCG CGCAAGGCGC GTCGCACGAC GTGAGCGCCG ATCGCAGGGA CGAGAACGGG
GTGTGTTTTC CGCCGATGTT TCCCGACGCC GCGTCGTTGG GGTTCACGTG GTCGTCGGAG
GGAGATTTAG TCAACAAGCT CGTCAAGCAC TTTGGCGCGA AGTCGGCGGT GACGAGTTTC
TCCGCGCATC GGGCGCAGTA CATCACGGAC GAAGACTTAT TAGAAATAGC GTCGCAAGGG
ATCGAAATGG TTCGATTGCC ACTGTCTTGG AGCATTTTCG CGAGAGATCC GGCCACGGTG
CCTCGCGGCG GCGAGCGCAT CCTCGTCGAT CCCGTGTACC CGGATCGCTT GTTCGTCAAC
ATGGCGGGCG CCGATTTAGA CGCGGTGATC GAACGAATTC GCAACGCCGG GCTGAAGGTG
TTGATTGATT TGCACAGCAT GCCCGGCGGT GCGGCGGCTG GGACGTACAA CGGCGTTTTC
CCGCATCCGC CGATGATGTT CGCTCGCGAC GATCTCAGTC GCACGGGGTT GCTCATCGTT
CGAAACATGC TCAACTGGTT CAAATCGTTG CCGGAAGACT CTCGTCTCGC CGTACACGGA
ATCACGCTAC TGAACGAGCC CGGGCACAAC CTTCCGGCGC AAAGGTCGAA GGTTCTGCGT
TGGCTCGCCA AGGCTGTGCA AACGTACAAG GATGAAATCG TCAACGACGG CGTGCCCGAC
GGCGAGCGCG TCCCGTACCT GTACGTCAAC TTGATCGAGA CGCTCGATCT GAACGTCGCG
GACATGGCGG CTTGGATGCG TTCGCAATTC ACGGTGAAAG AGCTCGAGTC ATGGGCGGTG
TTAGACGTGC ATCATTACTT TGCGTGGTCG TACACAGGAT GCATGGGCGG CACCAACGCG
GGGTGCGCGT TTAGTTGCGA CGACAAACCA TCCGTCGTCG CGCAAAAAGT CGGCGAGCGC
GCGGGCGAGT GGGCCGGTAC TTTCCGAGCG GCGGCGCAGA CGTACGGCGT ACAAAATCTC
GCCGTATCGG AGTGGTCGCT GGCGACGTTT TCGGACTCAT CTCGCTCGTG CTCGAACAGG
GAAGTCTTGG ACATTATGTT TGAGCATCAA GAGCAAGCGT ATCGGGGCGC CGGCATCCAG
AGTTTCTTTT GGGGATGGAA GATGCCCCAC GGTGGTTCGC ACATCAAAGC GTGGTCGTTG
AGCGATTACT TGGCCGGGCG AAGCGCGAGC GAACGTAAGC AGCACGCGCT GGTCCCGCAC
GGGTACTCGA TCGAAGACAC GCTCGAGTTA CCGCAAGGCG CTTCCAAGGA GGATGCGGCG
CGTCTCATTC GCGCGTACAG CCAGTTCCCG GACAAGCCCG CCGAGGATAA AGATAGCCCA
GACACGATGA AGATTCAAAC CAAGGAAATG TTGAAACTTT TGAGTGCAGT GGCGGGGCAG
TCGACGGCGG CTGTGGGGGA GAGCGATCAC GCCCGCCATC GACGCAGTCG CGGCGGGCCG
TCCGCCGCCG ACGCCACGGA GGATTATTTG CGCGCGAGTT CCAAGCTCAA GTCAATAATC
GACACTGACG CGTACGACGA CGACGACAAC GACGATGACA AGGAAATCAT CGTTGTTCGC
CACTCGCATG TATCTGACGT AGAAAAGACG CCGGTTGGTG ATCCGATCCG GCTGGACTTG
CCGTCTCCGC CGCCGTCTCC GAACGTCACC GCGCCGGCGG ACGTCGCGCT CGCCGCGGCG
CAGCCGTCGC CATCGCCCGA GCCAGTCGAC GATGGTTCGA TCGACTCTTT GCTCGATCCG
GCGAAAGATG TTCCTCCGCC ACCGCCGGCG CCACCGCCGA TGCCACCGCC GCCGCCACCG
ATGCCGCCGC CGAAGAGCGT GCTGCGCGCG CAACAACGAG CCGCGGAGAC GCTCTCCGAT
TTGTCGCTCA CCTCCGGGGA AGGGTTCTCG ATTCCCGACG ACGGCTTCGA CGAGTCATCG
AGCGTGAGTG CGTCCACAAT CGATGGCTTC TAG
 
Protein sequence
MARHRGTRGG WNATTTEGGD GRERERASER DANAATRGEG EGEATGRASI AATLAGETAS 
SMTSSYSAWE GEETKTHPSS TFDAGRSTGA ALHGWLHLEE WFFAQGASHD VSADRRDENG
VCFPPMFPDA ASLGFTWSSE GDLVNKLVKH FGAKSAVTSF SAHRAQYITD EDLLEIASQG
IEMVRLPLSW SIFARDPATV PRGGERILVD PVYPDRLFVN MAGADLDAVI ERIRNAGLKV
LIDLHSMPGG AAAGTYNGVF PHPPMMFARD DLSRTGLLIV RNMLNWFKSL PEDSRLAVHG
ITLLNEPGHN LPAQRSKVLR WLAKAVQTYK DEIVNDGVPD GERVPYLYVN LIETLDLNVA
DMAAWMRSQF TVKELESWAV LDVHHYFAWS YTGCMGGTNA GCAFSCDDKP SVVAQKVGER
AGEWAGTFRA AAQTYGVQNL AVSEWSLATF SDSSRSCSNR EVLDIMFEHQ EQAYRGAGIQ
SFFWGWKMPH GGSHIKAWSL SDYLAGRSAS ERKQHALVPH GYSIEDTLEL PQGASKEDAA
RLIRAYSQFP DKPAEDKDSP DTMKIQTKEM LKLLSAVAGQ STAAVGESDH ARHRRSRGGP
SAADATEDYL RASSKLKSII DTDAYDDDDN DDDKEIIVVR HSHVSDVEKT PVGDPIRLDL
PSPPPSPNVT APADVALAAA QPSPSPEPVD DGSIDSLLDP AKDVPPPPPA PPPMPPPPPP
MPPPKSVLRA QQRAAETLSD LSLTSGEGFS IPDDGFDESS SVSASTIDGF