Gene OSTLU_45442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_45442 
Symbol 
ID5001363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp438729 
End bp440082 
Gene Length1354 bp 
Protein Length430 aa 
Translation table 
GC content55% 
IMG OID640416784 
Productpredicted protein 
Protein accessionXP_001417504 
Protein GI145346039 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.000628338 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCTCGTCGG TTGGGTATCG CCTAAGCATG CGCACGCAAC GCGAAGACAC ATGCGCGTGG 
CGCTGTGTGC GAACAAGAAC GTCAAATTGG CGACGATCAA CGTCCACAAG ACTCTGCATA
AACCGAAAGC AGCTCCGCCC TCGGAAGTGT TTCGAGCCCC GATTTGGACG ACATGGGCGA
AGATGAAGAC GAACGTTTCG CAAGAAAAGG TTTTGAGCTT CGCGCAAGAA ATTCTTGCGA
ACGGTATGAG CGCCAGCGTC ATCGAGATCG ATGACAAGTG GCAGTGTGGG TACGGTGATC
TCGATTTTGA CGCCACAAAG TTCCCAGATC CGAGTTCGAT GGTGGACGAG CTTCACGCCA
TGGGCTTCAA AGTGACGGTG TGGGTCATGC CGTTCATCGC CGAAGATACA ATGGCGTACA
GAGAAGGGAA GGACAAGGGT TACTTTGTCA ATTCGAACAC GCGAAATGGT TTCTTCAGGT
GGTGGCAAAC GCCGCCAGTC GTCGCGTTAG ACGTCACAAA CCCGGAGGCG GTTGATTGGT
TTGTATCTCG GTTGAAGCGT CTGCAAGAAA AGCACGGTAT CGACGGCTTC AAGTTTGACG
CCGGTGAACC ATGCTTTTTG CCGCGAAGAT TCATCACACA CACACCTCTT TCGCACCCAT
CAGAGTACAC GAGAGCGTGG GTGAACAACG TCGCTTCAAA GTTCGAACTT GCAGAAGTTC
GAAGCGGTCA TAACAGCACA GGGAATTCTT CCCTCGTCCG CATGGGCGAT AGATTCTCCG
ACTGGGGCAT TGAGAACGGG CTAGGGTCGA TTATTCCCGC GCTGCTTACA TCTGGCGTGC
TTGGGTACCC GTTTTGTTTG CCAGACATCA TCGGTGGAAA CGCTTATTTT GGCAAACACC
CGGACGAAGA GCTCCTCGTG AGGTGGGCGC AAGCCAACGC GCTGATGCCG GCGATGCAGT
TTTCCCTCAC TCCTTGGGCC GCAGGTAGCA TGGCGAAAGA CTTATGCATC TCCGCATTGG
AGATGCGCGA TCAGTTCGTG GAGACCCTCA TCGATCACAG CGAACGCGCG GTCGAAACGC
TCGAACCCAT CTGTCGTCCG ATGTGGTGGC TCGATCCCGA GGATAGCGAA ACGTTCCGCA
TAGGAGATCA GTTCGCGCTC GGCGAAGATA TCATCGTCGC CCCCGTCACC ACGCGAGGCG
CGAATGAGAG AGCGATTTAT TTGACCGAGG GTCGATGGCG CGATTTATCT AATGGCAAGG
TCCACCAGGG TCGGCGTTGG ATGCGCGATT TCTCCGCCCC GATCGGCGCG CTGCCCATTT
TCATTCGCGA AAAGTCGTCG TAACGTCGAA GATT
 
Protein sequence
MRVALCANKN VKLATINVHK TLHKPKAAPP SEVFRAPIWT TWAKMKTNVS QEKVLSFAQE 
ILANGMSASV IEIDDKWQCG YGDLDFDATK FPDPSSMVDE LHAMGFKVTV WVMPFIAEDT
MAYREGKDKG YFVNSNTRNG FFRWWQTPPV VALDVTNPEA VDWFVSRLKR LQEKHGIDGF
KFDAGEPCFL PRRFITHTPL SHPSEYTRAW VNNVASKFEL AEVRSGHNST GNSSLVRMGD
RFSDWGIENG LGSIIPALLT SGVLGYPFCL PDIIGGNAYF GKHPDEELLV RWAQANALMP
AMQFSLTPWA AGSMAKDLCI SALEMRDQFV ETLIDHSERA VETLEPICRP MWWLDPEDSE
TFRIGDQFAL GEDIIVAPVT TRGANERAIY LTEGRWRDLS NGKVHQGRRW MRDFSAPIGA
LPIFIREKSS