Gene OSTLU_39745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39745 
Symbol 
ID4999994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp565157 
End bp568345 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table 
GC content58% 
IMG OID640415415 
Productpredicted protein 
Protein accessionXP_001415537 
Protein GI145340863 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02103] alpha-1,6-glucosidases, pullulanase-type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTCGA ACGTCGCGCT CCGAGCGTCC ACGACGTCCG CGCACCGATG GTCTCGCTTT 
TCGTCGTCGC CGTCGTCCAC GAGACGTCGC CAACATCAGC GCCAACATCA CCATCTTGAT
GTCCAACACC GTCCTTCTCG TCCCGTTCGC GTCGCCGCCG CGCCCGCGGA GAAGCCGACG
GTCGAAGAAG ACACCGCGCG CGTGCTTCGC GTGCGATATC ACCGCAAAGA TGGCGCGTAC
GCGCGATGGG GCGCACACGT TTGGGGACGC GGCGCGGCGT CGCCGACGCC ATGGGACGCG
CCGCTTGCGG CGACGACGGA TGACGCGTCG GAATGGGTGA CCTTTGACGT CGAGCTCTCG
GATATCTCTC GCGGTGATGG ATCGGTGAGC GTGCTGATTC ATAAGGGCGA GGCGCAAGAC
TGTCGAGTAG AGGATTTCGA CGCGACGGCG GCGCCAGAGG TGTTTTTGGT GAGCGGATAT
TCGAGCGCGT TCGAGACGGA GCCCGATTTG ACGTCGTTGC CGAAAGGAGA CATAGAAAAG
TATCGCGCGA TCTGGGTCGC CGAGCGCGTG ATCGCGGTGC CGGGGGATTT CGCGAACGAT
GGGGACTCGT TCACGCTCGT GAGCTCGTCC ACGGCGGATT TGAAAGTCAC GGGCGAAGGC
GTCGTCGGCG GCGACGACGT CGTCACCGTG GTTCGTTCGG GTGAGTTACC TTCGAGCGTG
TGCGCAAAGT TTCCACACAT CAAAGCTGCG GGCTATCGCG CGCTCGAAGT TCCTTCGAGC
GTCAACGTGC GAGACGCGTT GAAGCGCCAA ATCGCCGTAG CCGCGGTGGA CGCCGCGGGT
AAGCCGACGG ATGCCACGGG CGTGCAGTTG CAAGGCGCGA TCGATGATTT ATTCGCTTAC
GATGGACCTC TCGGAGCGGA ATTTGGAGTG AACGACAAAG TCACGCTACG CGTGTGGGCG
CCCACGGCCT TGAACGTCGC GTTGGCGTTG TTCGACGAAC CCAGAGGAGA GGAGACGAGA
CGCGTCGTCG CGATGACGCG GGACGAGACG TCGGGGGTGT GGAGTGCGAC CGGTGACGAT
TTCAAGGATA AATATTACAA TTTCGAAGTC ACCGTATTCA ACCCGACGAC TGGGAAAGTG
TCGACGAACG TCGCGTCGGA TCCGTACGCT CGGAGCCTCG CCGCCGACGG TCGCCGAGCG
CACGTGTGCG ACATTTCGCG AGACGACCTC AAACCCAAGG GATGGGAGAC GTTTGAGAAG
CCAAAGTTCA CGCATCCCGT GGATTGTTCA ATCTACGAGC TACACGTGCG AGATTTCAGC
GCGTTGGATG AAACCGTGAG CGCTTCTGCG CGAGGCAAGT ATTTAGCGTT CTGCGAAGAA
TCGAGCGTGT GCGTGTCCCA TCTCAAAAAG CTCGCCGACG CCGGTCTCAC GCACGTGCAC
CTGTTGCCGT CTTACGACTT TGGTTCCGTG CCCGAGCTCC CAGAAAATCA GCTGTCGGTA
GACTTCAAAG AGTTGGCCAA ATTACCACCG AATTCTCGAA AGCAACAAGA AGAAATCAGT
AAAATTGCAT GGTCTGATTG CTTCAACTGG GGATACGACC CCGTGCACTA CGGCGTGCCC
GAAGGCAGTT ACGCCACCAA TCCAGACGGT CCCCGGCGCA TTTTCGAGTA CCGCCAGATG
GTGCATGCGC TCGCCTCGAA CGGATTACGC GTCATATGTG ATGTGGTGTA TAATCACACC
TTGTCGTCCG GACCGAGTGA CGTCAACAGC GTCTTGGATA AAATTGTCCC GGGGTACTAC
CATCGACGCA ACTTTGACGG ATTCATCGAG GCGAGCACGT GTTGCAACAA CACGGCCAGC
GAGCATTACA TGATGGATCG TCTCATTGTG GACGATCTCG TGCACTGGGC GAAAGATTAT
AAGGTTGATG GATTCCGATT TGATTTAATG GGGCACTTGA TGCTGTCCAC GATGCTTCGA
GCGAAGGACG CGCTCCAATC GTTGACGCTC GAGAAGGATG GTGTCGATGG TAAATCACTG
TACCTGTACG GAGAAGGATG GGATTACGCC GAGGTGGAAA AAGGTCGCGT CGGCAAGAAT
GCATCGCAGC TAAATTTGGC TCACACTGGT ATCGGAAGCT TTAACGACCG CGTCCGCGAG
GGTTGCATCG GAGGCAGTCC CTTTGGCGAT CCTCGCATGC AAGGTTTCCT CACGGGCTTG
TACTACACCC CCAACGGCGC CGTCGATCAA GGAGACCAAG ACTCCCAACG CTATAGAATG
ATGGAGGACG GCGAGAAGAT AATCGCCGCG CTCGCTGGAA ACGTCCGTGA TTTTGTCTTT
GTCAATCGCC ACGGCGTCGA GGTTCCGTCG AGTTCGGCGG CTTGGCCAGA CTCAAACGTT
GCCTACGCGG GCGAACCGGA AGAAACGGTG AACTACGTCA GCGCGCACGA TAACGAAACC
CTATTCGATT GCATCATGTT AAGAGCGGCG GCGTCTGTGT CTCTGGAACA AAGGTGCCGA
ATCAATCATT TAGCGACGGC GATTGTGGCG TTGTCGCAAG GCGTGCCATT CTTCCACGCG
GGCGACGAAA TCTTACGGAG TAAATCTTTA GACCGAGACT CGTACTCCAG CGGTGACTGG
TTCAATAGGC TGGACTACTC TGGGGACACG CACAACTTCG GCGTCGGGTT GCCCGGGGAG
CAAAAGAACG GCGATAGATA CGACTTCATC ACGCCCATGC TCGCCGATAC GTCGATGCGC
CCCTCGAAAG AGTTCATCGA AGAGGCGACG AGAAACTTTT GCGAACTTCT GAGCATCCGT
CAATCGACGC CACTGTTGCG TCTTCAAACC ACGCGAGACA TTCAGCGTCG CATGAAGTTT
TATAACCGCG GACCGGCGCA AACGCCTGGG CTCATCATCG CTAGTATCAA CGACGGCGAC
GCGTCCACGC CTGGGTTACC ATCGCTCGAC GCGAACTACA AGCGCGTCGT GCTCGCGTTC
AACGCCACTC CGAACGAAAT TTCACATCAC GAGGCTGGGC TTAAAGTTGA TTTCGCTGGC
GTCGATCTCG AATTACATCC GCTCGTCGGC GGCGTCACCG CGGACGCCGT CGCGATGAGG
AGCGTCTTCA TCGAAGGCGT TCCCACGATT CCTCCGTACA CGTGGACCGT GTTCGTACAG
CATAGATAA
 
Protein sequence
MRSNVALRAS TTSAHRWSRF SSSPSSTRRR QHQRQHHHLD VQHRPSRPVR VAAAPAEKPT 
VEEDTARVLR VRYHRKDGAY ARWGAHVWGR GAASPTPWDA PLAATTDDAS EWVTFDVELS
DISRGDGSVS VLIHKGEAQD CRVEDFDATA APEVFLVSGY SSAFETEPDL TSLPKGDIEK
YRAIWVAERV IAVPGDFAND GDSFTLVSSS TADLKVTGEG VVGGDDVVTV VRSGELPSSV
CAKFPHIKAA GYRALEVPSS VNVRDALKRQ IAVAAVDAAG KPTDATGVQL QGAIDDLFAY
DGPLGAEFGV NDKVTLRVWA PTALNVALAL FDEPRGEETR RVVAMTRDET SGVWSATGDD
FKDKYYNFEV TVFNPTTGKV STNVASDPYA RSLAADGRRA HVCDISRDDL KPKGWETFEK
PKFTHPVDCS IYELHVRDFS ALDETVSASA RGKYLAFCEE SSVCVSHLKK LADAGLTHVH
LLPSYDFGSV PELPENQLSV DFKELAKLPP NSRKQQEEIS KIAWSDCFNW GYDPVHYGVP
EGSYATNPDG PRRIFEYRQM VHALASNGLR VICDVVYNHT LSSGPSDVNS VLDKIVPGYY
HRRNFDGFIE ASTCCNNTAS EHYMMDRLIV DDLVHWAKDY KVDGFRFDLM GHLMLSTMLR
AKDALQSLTL EKDGVDGKSL YLYGEGWDYA EVEKGRVGKN ASQLNLAHTG IGSFNDRVRE
GCIGGSPFGD PRMQGFLTGL YYTPNGAVDQ GDQDSQRYRM MEDGEKIIAA LAGNVRDFVF
VNRHGVEVPS SSAAWPDSNV AYAGEPEETV NYVSAHDNET LFDCIMLRAA ASVSLEQRCR
INHLATAIVA LSQGVPFFHA GDEILRSKSL DRDSYSSGDW FNRLDYSGDT HNFGVGLPGE
QKNGDRYDFI TPMLADTSMR PSKEFIEEAT RNFCELLSIR QSTPLLRLQT TRDIQRRMKF
YNRGPAQTPG LIIASINDGD ASTPGLPSLD ANYKRVVLAF NATPNEISHH EAGLKVDFAG
VDLELHPLVG GVTADAVAMR SVFIEGVPTI PPYTWTVFVQ HR