Gene OSTLU_956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_956 
Symbol 
ID5003488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp634774 
End bp636504 
Gene Length1731 bp 
Protein Length577 aa 
Translation table 
GC content56% 
IMG OID640418909 
Productpredicted protein 
Protein accessionXP_001419443 
Protein GI145350064 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00413725 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTCGGGTTGA ATCAGCTTGA TTTCGTCTAC GCCATGGACC CAGACGAGCG CGTGTACGGG 
CTGGGCGAGC AATTTTCTTC GTACAATCAT CGCGGGCGTC GAGTGCCCGT GATTACGGGC
GAACAAGGCA TGGGTCGCGG AGTGCAACCT TTATCATTCA TGTTTAATTC CGTCTTTCCA
GGGTCGGCTG GGTCCTGGCA CACGACATAC ACGGCGATTC CGCATTACAT CACGCACAAG
GCGCGATCGG TGTTTCTGAC GAATTACACC TACAGCGAGT TTGATTTCAC CGAGGAAGAA
AGTGTCGTTA TTCGCGCCGC GGCTCCGAGT GGATTGATCA CCGGACAAAT AATCGGTGGG
TCTAGCATTC CTGACGTTTT ACGGGCGTAC ACTGATTATG CCGGACGCAT GACGTCGCTT
CCGGAATGGG CGATGAACGG CGTCATTCTC GGCATGACGG GTGGTCCACA GAAAGTACGT
CAAGTGTACA AAACATTGGG CGAAGGCGGT GTGAAAGTGG CTGGATTGTG GCTCCAAGAC
TGGGGTGGCG TGCGCAACAC GTCTATCGGG ATTGAACGCG TGTGGTGGAA TTGGCGTCTC
GACGAGACGC ACTACACGGA TTGGGACGCG CTGCGAGAGG AAATCAAACC AAACGGCACG
CACCTTCTGA CATACGTCAA TACCTTCTTG ATGGATGCGA ATTCCGACAA AGGATTACTT
TATAGAGAGG CGAAGGAGAA AAACTACATG GTACGCGATG TCAAGGGTGA AGTATATAGA
CTCGGCTCGG AACCGGGCGT GACATTCGGT TTACTCGACT TGTCCAATCC CGAGTGCGTG
GCGTGGATAG AGGATATCAT TGTCGACATG CTGGAAACGA CGGGCGCGAT GGGCTGGATG
GCGGACTTTG GTGAATATCT CCCTTTCGAC GCGGTTTTGC ACTCAGGGGA GTTGCCCATC
GAAGTGCATA ATCGTTATCC TGAGGATTGG GCAGAGGTGA ATCGACGAGC GATGCGGCGA
GCCGGTCTCG AGGGCACAGG TTTCTTCTGG AGCCGCAGTG CGAGTACGAA GTCCCCGAAA
CATTCCGCGC TTTTCTGGCT TGGCGATCAA ATGGTCTCAT GGGACGCGTA CGACGGCATC
AAAACAGCCG TGCTCGGCGG ATTATCAGGC GGATTGTCCG GTCTTACGTT GACGCACAGC
GACGTCGGAG GGTACACCGC CCACCCGCTC AAGCATCGTT CGGAAGAACT CCTGATGCGA
TGGATGGAGC TGAACGCATT CGCCGACGCG ATTTTTCGCA CACACCAAGG TAATCGCCCG
CATCACAACG CACAGCCATG GAACACGCCA GAGTTGGTGG AACATTTGAA GTTTTGCGTG
GATATTCACG TCGCGCTCAA GCCGTACAAA GTCGAGCTCA TGCGAGAGGC CCAAGCCGTG
GGGCTCCCCA TGACTCGTTC GATGATCATT CACTACCCAT ACGACACCAA CGCCGCCAAT
ATCGCCACGC AATTCCTCCT CGGACGAGAC ATTCTCGTCG CCCCCGTGTT GGACAAAGGC
GCCACGCACG TGCACGTTTA TCTTCCGCCC GGCGACGTGT GGGTCGACGC CTGGACGACG
CAAAGAGCGC CCGTGCAGCC CGACCTCATC GGCTCTGACG AAGGTGGCCG AGGGTCGTGG
ATCACTGTTG ACACTCCCAT GGGTTGGCCC GCCGCCTTCG TCCGCAAATC C
 
Protein sequence
VGLNQLDFVY AMDPDERVYG LGEQFSSYNH RGRRVPVITG EQGMGRGVQP LSFMFNSVFP 
GSAGSWHTTY TAIPHYITHK ARSVFLTNYT YSEFDFTEEE SVVIRAAAPS GLITGQIIGG
SSIPDVLRAY TDYAGRMTSL PEWAMNGVIL GMTGGPQKVR QVYKTLGEGG VKVAGLWLQD
WGGVRNTSIG IERVWWNWRL DETHYTDWDA LREEIKPNGT HLLTYVNTFL MDANSDKGLL
YREAKEKNYM VRDVKGEVYR LGSEPGVTFG LLDLSNPECV AWIEDIIVDM LETTGAMGWM
ADFGEYLPFD AVLHSGELPI EVHNRYPEDW AEVNRRAMRR AGLEGTGFFW SRSASTKSPK
HSALFWLGDQ MVSWDAYDGI KTAVLGGLSG GLSGLTLTHS DVGGYTAHPL KHRSEELLMR
WMELNAFADA IFRTHQGNRP HHNAQPWNTP ELVEHLKFCV DIHVALKPYK VELMREAQAV
GLPMTRSMII HYPYDTNAAN IATQFLLGRD ILVAPVLDKG ATHVHVYLPP GDVWVDAWTT
QRAPVQPDLI GSDEGGRGSW ITVDTPMGWP AAFVRKS