Gene OSTLU_35780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35780 
Symbol 
ID5002900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp523674 
End bp524975 
Gene Length1302 bp 
Protein Length433 aa 
Translation table 
GC content55% 
IMG OID640418321 
Productpredicted protein 
Protein accessionXP_001418959 
Protein GI145349062 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0448] ADP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR02091] glucose-1-phosphate adenylyltransferase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.653202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAACG TGTTGTCCAT CATTCTCGGC GGCGGCGCGG GGACGCGATT GTACCCGTTG 
ACGAAGAAGC GGGCGAAGCC GGCGGTGCCG TTGGGGGCGA ACTACCGCTT GATCGATATC
CCGGTGTCGA ACTGTATCAA CTCTGACATC AACAAGGTGT ACTGCTTGAC GCAGTTTAAC
AGCGCATCTT TGAACAGACA CTTGTCCCAG GCGTACAACA CGAACATCGG GACGTACACG
CGTCAGGGAT TCGTCGAAGT GCTCGCGGCT CAACAGAGCC CGATCAACAA GGCGTGGTTC
CAAGGTACCG CGGATGCTGT TCGTCAATAC TTGTGGTTGT TTGCCGAGAG CGGGTGCGAG
GAATACTTGA TTCTTTCGGG CGATCACTTG TACCGCATGG ATTACCGTCC GTTTATTCGC
GACCACCGCG CGAAGAACGC CGACATCACC GTCGCCGCTT TGCCGACGGA TGAGAAGCGA
GCCAGCTCTT TCGGCTTGAT GAAGATTAAC GAACACGCCA CCATCATCGA GTTCTCCGAA
AAGCCCAAGG GTGACGCTCT CAAGGCGATG CAATGCGACA CCACCATCTT AGGCTTGGAC
GCGGAACGCG CCAAGGAAAT GCCGTACATT GCCTCGATGG GTATCTACGT GTTCAACGCC
AAGGCTATGG AGCAGGTGCT TCAAGATGAT TTCCCGGAAG CGAACGATTT CGGTGGTGAA
ATCATCCCGA TGGCGGCTCA GAAGGGCATG AAGGTGGTCG CTCACTTGTA CGACGGATAC
TGGGAGGACA TCGGTACCGT CGATGCGTTT TTCCACGCCA ACTTGGAGTG CAACGACCCG
AACCCGAAGT TCAGTTTCTA CGATCGCAAC GCGCCGATCT ACACCCAGTC TCGCTTCTTG
CCGCCGAGCA AGGTGCAAGA TTGCGAAATC GAGCGCTCCA CGATCGGCGA CGGCTGCACC
ATCAAGCAAG CCAAGCTCAA AAACGTCATG GTTGGTTTGA GATCGACGGT CAACGAAGGG
TGTGATTTGG AGGACACACT CGTCATGGGT GCTGATTACT ACGAGAGCCT CGAAGAATGC
GACCCGGCTA GCCTTCCGGG CTGCACGCCG ATCGGTATTG GCGCCGGCAC GAAGATTCGC
AAAGCCATCA TCGACAAGAA CGCGCGCATT GGTGAAAACT GCCAAATCCT CAATGAGGCT
GGCGTCATGG ATAAGGATTG CGAAAGCGAA GGTTACATCA TCCGCGATGG CATCATTGTC
GTCATCAAGG ATGCCGTGAT CAAGGCTGGC ACTGTCATCT GA
 
Protein sequence
MDNVLSIILG GGAGTRLYPL TKKRAKPAVP LGANYRLIDI PVSNCINSDI NKVYCLTQFN 
SASLNRHLSQ AYNTNIGTYT RQGFVEVLAA QQSPINKAWF QGTADAVRQY LWLFAESGCE
EYLILSGDHL YRMDYRPFIR DHRAKNADIT VAALPTDEKR ASSFGLMKIN EHATIIEFSE
KPKGDALKAM QCDTTILGLD AERAKEMPYI ASMGIYVFNA KAMEQVLQDD FPEANDFGGE
IIPMAAQKGM KVVAHLYDGY WEDIGTVDAF FHANLECNDP NPKFSFYDRN APIYTQSRFL
PPSKVQDCEI ERSTIGDGCT IKQAKLKNVM VGLRSTVNEG CDLEDTLVMG ADYYESLEEC
DPASLPGCTP IGIGAGTKIR KAIIDKNARI GENCQILNEA GVMDKDCESE GYIIRDGIIV
VIKDAVIKAG TVI