Gene OSTLU_429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_429 
Symbol 
ID5003456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp453996 
End bp456440 
Gene Length2445 bp 
Protein Length815 aa 
Translation table 
GC content60% 
IMG OID640418877 
Productpredicted protein 
Protein accessionXP_001419385 
Protein GI145349941 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0318632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.136701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCGCGCTGG CGCTGACGAC GATCGTCGTC CTCGCGACGC GAACGGACGC GCTCAAAGCG 
AACGATTTCA AGACGTGCGC GACGAGCTCG TTTTGCGCGC GAGGGCGAGA CGCGCGAGAC
GCGCGAGGCG CGAGGATCGA AGACGCGCGC GCGCGCGAGC GCGACGCGAG GGCGGTCGAG
TTCGACGTCG TGCGCGGCGA TGAGGGAAGT GCGTCGCTGG TGGGTTCGGT GGAGGGGATG
AAAACGGGCG CGGTGCGCGT GCGGGTGGAC GAAAGAGGCG AGCGGGCGCG ATTCGCGGCG
CGGGAGGTGC TGACGAGGGA GACGGAATTC GAGGACGACG CGAGGGCGCG GGCGCGCGTG
GTGGGAAGGA GCGAGACGAG CGTGGAGGTG GAGTTCGAAG AGACGGGGGC GCGGGCGAAG
ATTTACGCGG ATCCGTTCAA AGTGGAGGTG TACGCGCGGG AGGCGACGAC GCCGGCGGCG
ATTTTTAACG CGCGAGGCGA GTTCGCGTTC GAGACGGGAG GCGACGGGTC GTCGGAGGAT
TGGGCGGAGA CGTTTAACGG GCACACGGAT ACGCGCAAGA ATGGACCCAC GGCGGTGGCG
TTCGACCTGA CGTTTCCGAG CGCGAGCGAC GCGTACGGGC TTCCCGAACG CGCGACGTCG
CTCAGCCTGA AATCGACGCG AAAGTACGAG AGCGGTAAGA GTTGGTTCGG TCGATCTTCG
GTCGATTCGT CGGTGAATGA AACCACGCTC GGGGAACCTT ACAGATTGTA CAACCTCGAC
GTGTTCGAGT ACCTCGATGA TAGCGCGTTC GGGCTTTACG GATCGATTCC GATGCTCACC
GCGCACGGGG TCCGCGACGG CAAATCGACT ACGGCTGGCG CTTACTTTCA CAACCCGAGC
GAGATGTACG TGGATGTCAA CGTCGACGGC GCTAACGGCG TGCACACGAA GTGGATGGCC
GAGAGCGGCG CGATGGATGT TTTCATTCTC CCCGGCGATA CCCCCGCGGA TGTGTTGAAG
CAATACACGG CGCTGACGGG CACGACGTCG ATGCCACCGT TATTTTCGCT CGGCTATCAT
CAGTGCCGAT GGAACTATCG CGACGAGAAC GACGTCAAGG AAGTCGACGC CGGTTTCGAC
GCAAATGACA TACCGTATGA CGTGCTTTGG CTCGACATCG AACACACCGA TGGGAAGCGG
TACATGACGT GGGACAACGG TCCGTTCCCC ACGCCCGAAC GCATGATCAA CGACATCGCC
AGTCGCGGAC GAAAGATGGT CACCATCGTG GATCCGCACG TAAAAATCGA CAATAATTAT
CCCGTCTATA AAGAGGCAAA GGATAAAGGG TTTTACGTCA AGAAGAACGA CGGCACGACG
GATTTTGACG GGTGGTGCTG GCCGGGAAGT TCGACGTACT TGGACGTCAC GAATCCAGAC
GTTCGCGAGT GGTGGGCGAG CAAGTTTTCT TTGGACTCCT ACAAAGGATC GACGAAAGAC
TTGTACATTT GGAACGATAT GAACGAGCCG TCCGTGTTCA ACGGTCCCGA AATTACGATG
CAAAAAGACT TGATTCATCA CGGCGGCGTC GAGCATCGCG AAGTGCACAA CGCTTTCGGC
ATGTACTACC ACATGGCCAC CGCGGAGGGC ATCAAGCGAA GAAACGACGG CGATCGTCCT
TTTGTCCTCT CGCGCGCCTT TTTCGCCGGG ACGCAACGCA TCGGACCGAT TTGGACCGGT
GATAACACCG CTGATTGGCG ACACCTCGCC GTGAGCTTAC CGATGGTGCT CACGCTCGGC
GTGAGCGGGT TGACGTTTAG CGGCGCCGAC GTCGGAGGAT TTTTTGGCAA TCCCGACGCT
GAGCTCATGA CGCGGTGGTA CCAAGTCGGC ACTTACTATC CTTTCTTTCG TGGACACGCG
CATTTGGAGA CGAAGCGCCG CGAGCCTTGG TTGTTCGGTG ACGAGAGCAC GGCAATCATT
CGTCAAGCGA TTCGCGAGCG ATACGCGCTC CTGCCTTACA TTTACACCTT GTTTGAAGAG
TCGCACCGAA CGGGCGCGCC CGTGTTGCGT CCGCTTTGGT ACGAATTCCC AACGGATGAG
AACGTGTTCA AGATTCAAGA TTCATTCATG CTCGGTTCGG CAATTTTAGT GCAGCCAGTG
TTGAAGCAAG GCGCGAAATC CGTCTCGGTG TACCTTCCGG CTGGCGTTTG GTACGAAAAA
CGATCGGGCG CGCGACACGT CGGACCGAAG ACGTTCGACG TCTCCGTCGA GCTCAGCGAC
GTTCCGGTGT TCCTTAGAGG TGGTGCTATT TTCGTGCGAA AGGACAGAGC TCGGCGAAGC
TCGACAGCGA TGAAGGGTGA CCCATTGACC ATCGTCGTCG CGCTCGACGC GAACGGCGAG
GCGAGAGGCT CGTACTACGC CGACGACGGT AAGAGCTACG AGTAC
 
Protein sequence
LALALTTIVV LATRTDALKA NDFKTCATSS FCARGRDARD ARGARIEDAR ARERDARAVE 
FDVVRGDEGS ASLVGSVEGM KTGAVRVRVD ERGERARFAA REVLTRETEF EDDARARARV
VGRSETSVEV EFEETGARAK IYADPFKVEV YAREATTPAA IFNARGEFAF ETGGDGSSED
WAETFNGHTD TRKNGPTAVA FDLTFPSASD AYGLPERATS LSLKSTRKYE SGKSWFGRSS
VDSSVNETTL GEPYRLYNLD VFEYLDDSAF GLYGSIPMLT AHGVRDGKST TAGAYFHNPS
EMYVDVNVDG ANGVHTKWMA ESGAMDVFIL PGDTPADVLK QYTALTGTTS MPPLFSLGYH
QCRWNYRDEN DVKEVDAGFD ANDIPYDVLW LDIEHTDGKR YMTWDNGPFP TPERMINDIA
SRGRKMVTIV DPHVKIDNNY PVYKEAKDKG FYVKKNDGTT DFDGWCWPGS STYLDVTNPD
VREWWASKFS LDSYKGSTKD LYIWNDMNEP SVFNGPEITM QKDLIHHGGV EHREVHNAFG
MYYHMATAEG IKRRNDGDRP FVLSRAFFAG TQRIGPIWTG DNTADWRHLA VSLPMVLTLG
VSGLTFSGAD VGGFFGNPDA ELMTRWYQVG TYYPFFRGHA HLETKRREPW LFGDESTAII
RQAIRERYAL LPYIYTLFEE SHRTGAPVLR PLWYEFPTDE NVFKIQDSFM LGSAILVQPV
LKQGAKSVSV YLPAGVWYEK RSGARHVGPK TFDVSVELSD VPVFLRGGAI FVRKDRARRS
STAMKGDPLT IVVALDANGE ARGSYYADDG KSYEY