Gene OSTLU_38852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38852 
Symbol 
ID5002087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp732657 
End bp734438 
Gene Length1782 bp 
Protein Length593 aa 
Translation table 
GC content60% 
IMG OID640417508 
Productpredicted protein 
Protein accessionXP_001418096 
Protein GI145347269 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.569718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCG AAGACGACGT CGAGGAAGAC GGCGGCGCGG CGACGGACGG CGCGCAGGCG 
CCGTACGCGG CGTTTTGCAC GGCGGCGATA CTGGCGGCGG TGCACCGAGC GGCGATTTGG
CGCGACTCGA AGGATTTCGT GGACACGCGG TCGAGATCAC CGCCGGCGAA GGTGTTCGAG
GCGCTGGCGC AGTCGCGAGC GGCGACGTGT AGCGTGGCGG CGCGGGAATT CTTGAACGAA
CACTTCGAGA GCGGACCGAG GGAGAGGTCT AAGATGCCGG AACTGGCGGA CTGGCGAAGC
GAGCCGGCGG TGGCGAGAGG GGCGAGGTGC GAAAAGTCGA GGGAGTTTGC GACGCACGTG
CACGAGCTGT GGCGCGTGCT CGCGCGGTTG GACGCGGATG ACTACGCTGA GGAAGAAGTC
GGCGCGGAGG GCGAGGCGCG ACGGACGACG AGCTCGAGGA TTCGATTGCC GTACCCGGCG
GTGGTGCCGG GGGAACGTTT TAGGGAAACC TACTACTGGG ATACGTATTG GATCGTTTTG
GGGTTGTTAA CGAGCGAGAT GCCGGCCACG GCGCTGGGAG TGACGAATAA TTTGTTGTAC
ATGGTCACCA CGTACGGATT CGTGCCCAAC GGCGCGCGCG TGTACTATTT GAATCGATCT
CAGCCGCCGT TGTTGTCGTC GTGCGTCGCC GAGGTGTTTC AAGCGACGCG AGACGTCGAG
TGGTTGCGAC AGGCGTTGCC GTTGCTGGTG CAAGAATACG CCTATCTCAC TCGAAGTGAA
CGCACGGTGA CGATTCGTGA CACGGAAACC GGAGAGACGC ACGAGCTGTC GCGATATTTT
GCCAACACCA CGCGTCCGAG GCCAGAGAGC TATCGCGAAG ACGTCGAGGT GGCGCGCCGA
GCGACCAGAA AGGTGGAGGA CGCCGTGGCC AAGCTCGAAG CTAAGCGTAA GATATATCGA
CATCTTGCTA GCGCGGCGGA GAGCGGCTTC GACTTCAGCT CGAGATGGTT CCTAGACGGC
GATAACTTGG AGACGATTCG CACGTGCAAC ATCATTCCAT CCGACTTGAA CGGATTTATG
CTACGAGTGG AGACGCAAAT CGCTTTGCTC GCCCGCGAGG CATTAGTGTC GTTGGAAAAC
GAAGACGAGC TCTTCGCCGA GCGCGTGTAC TTGAACCATT TGCTCGAGAA GTTCTCCCGT
GCGAGCGAGG TGCGGCGGCG CGCGATTGAC GCCGTGCTTT GGGACGACGA CGTCAAGCGG
TGGCGAGACA TGGCGTTCGA ACCGCTCATG GGCGAAGACA CCCGAGGAAT CGTGCGCGAT
CGCGATGATC TCACGGCGGC ATCTGAGAGC CCGTTTACGA GCGATTTTAC TCCGCTTTGG
TGCGGCGCTT GCGATCCCGA CAGCGATCAA GCGTACGAAG TCGTCGAGTC GTTGAAAAAG
TCCAAGTTGG TCACCGACAA AGGCATCGCG ACCTCACTCG TCGAGAGCGG TCAGCAATGG
GATTGGCCTA ACGCTTGGGC GCCGGAGACT CACATGATTG TCGAAGCGAT ACAAATTTTC
GCCCCTCGCG AGGAAGAGTA CGCGAAGACG CTCGCGCACT CGTGGCTCCG CACCGCGCAT
CAAGCGTGGA AGTCAACGGG CTACATGCAC GAAAAGTACG ACGTGCGCTC GACCGAGGAC
GGCGTGGGTA AAGGCGGCGA ATACATCCCT CAACGTGGTT TCGGCTGGAC CAACGGCGTG
ACACTTCGAT TACTCGAACA ATACGGATTC CCTCAAGATT GA
 
Protein sequence
MTSEDDVEED GGAATDGAQA PYAAFCTAAI LAAVHRAAIW RDSKDFVDTR SRSPPAKVFE 
ALAQSRAATC SVAAREFLNE HFESGPRERS KMPELADWRS EPAVARGARC EKSREFATHV
HELWRVLARL DADDYAEEEV GAEGEARRTT SSRIRLPYPA VVPGERFRET YYWDTYWIVL
GLLTSEMPAT ALGVTNNLLY MVTTYGFVPN GARVYYLNRS QPPLLSSCVA EVFQATRDVE
WLRQALPLLV QEYAYLTRSE RTVTIRDTET GETHELSRYF ANTTRPRPES YREDVEVARR
ATRKVEDAVA KLEAKRKIYR HLASAAESGF DFSSRWFLDG DNLETIRTCN IIPSDLNGFM
LRVETQIALL AREALVSLEN EDELFAERVY LNHLLEKFSR ASEVRRRAID AVLWDDDVKR
WRDMAFEPLM GEDTRGIVRD RDDLTAASES PFTSDFTPLW CGACDPDSDQ AYEVVESLKK
SKLVTDKGIA TSLVESGQQW DWPNAWAPET HMIVEAIQIF APREEEYAKT LAHSWLRTAH
QAWKSTGYMH EKYDVRSTED GVGKGGEYIP QRGFGWTNGV TLRLLEQYGF PQD