Gene OSTLU_38421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38421 
Symbol 
ID5001889 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp510322 
End bp512223 
Gene Length1902 bp 
Protein Length559 aa 
Translation table 
GC content59% 
IMG OID640417310 
Productpredicted protein 
Protein accessionXP_001418031 
Protein GI145347132 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0452022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000596343 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGGATT CGGGAGCGGC GGAGTTGACG GTGAAGCTGT GCGATGGAGC GGACGAAGAC 
CACTGGTACG GTGGGTCGCA TTTGCTGAAG CAGCTCTGGC CGCTCGAGCG AGCGAAAATC
GAAATGGGAC CGTTTTACCC GTTCGACCAC GGTCCGAACG GCGTCGGCAA CGTTCTCGGC
ACTCACTGGG TGTCGAGCGG TGGGACGTTG ATATTCGCCG ATCCAAATAG TGATTTATTA
CACGTCGGCT TGAACGCACC GGCATTCGTC CCGCGAGGTC AAGCGCCGCG GTATTTCGGT
GTCGGTATTC AAAACGCTTC CCGGCCGTCG TTGCCGCTGG AAGAGGCTCT GACGTCGGAA
CGCGGCGACG GACTACTTCG ATTGCAATCA CGGGCGAATT ACGCGGACGC GAACATGTTG
CATCCGTGGC AAAGCATCAA TACGAATGAC GCGCGAACGC GCTTGCAGCT ACGCGTCGCC
GTATCTGCGC AGGAAGACGC GAGGAGCGCG ACGAGAGTCG CCTTGGCGCA GCTTCCAAAG
CCGAGCGCCG CGCCGGACAA GTCTCTGATG TTCTATCCAA TCTGGACGAC TTGGGCGACG
TCGCACGCTG ATGTCACGCA AGAGTCAACA CTTGCAGTAG CTAAGGACAT CGCAAAAGCC
AAATCTGACT CGGGATTACC GAACGGTTCG ATCATCGAAA TCGACGACCG CTGGCAAGCG
CGATATGGCG AACTTCATTT CGATCCCGTG AAGTTTCCGG ATCCGAAAGG CATGATCAAG
GAGCTTCACG ACATGGGATT TTTAGTAACG GCGTGGGTGA TGCCGTTCTT GCAAGAGAGT
TCAGCGGCGT GCGAGGAGGC AAAGCGATTA GGGTACTTAC TTGAAGGCTC GCAACCGCCG
AACGAAGTCG TCGAGGTGCT CACGGGAGGC GTCGGTCAAG TCCTCGGGAC TACTGTGAAA
GTGTTCGTCG ATCGCTACGA CTGGCCACCC GGTCACTGGG AAGGCGGCGG TGGTGGGGGC
AACCTTGAGC CTGGACAGTT CCGATGGTGG GGCACGCAGC CCGTCCGCGG CATCGATTTC
ACCAATGACG AGGCGTGCGA ATGGTTCGTG CGTCGACTCA AGAAACTGCA AGAAGAAGTC
GGACTCGACG GTTTCAAGTT TGACGCCGGC GAGCCTTGTT TCATGCCTTA CGGCGCCCGC
CCGCACACGC CCCTGAAACA TCCACAAGAG TACAGTCAGG CATACGTGGA AAAAGTTTGT
TCTAAATTTG CCTTGAGCGA AGTGCGCGTG GCGATGGGAA CGAACAATTA CAACGGCTTA
ATTCGCATGG GCGATAAAGA TACCGTGTGG GGCGTCGATA ACGGTCTACA GAGTCTCATT
CCGAGCTTAC TCACGTCCGC AGTTATCGGG TTCCCGTTCA CGCTTCCGGA CATCATCGGC
GGCAACGCGT ATTGGAATCA AACGCCCGAT ACCGAGTTGA TGATTCGCTG GGCGCAAGTC
AGCGCGTTCA TGCCCGCGGT GCAATGGTCG ATTCCACCGT GGGAAGTGAG CTCAGTGGCG
TACGAGGCGA GTGTCAAGGT GATGCACATG CGCGAACGGC TGCTCTTACC CAAGCTCGCC
GCGCTCGCGG AGGACGCCAA AGCCTCCCTC GAACCCATCT GTCGTCCAAT GTGGTGGCTC
GACCCCAACG ACGCCCAAAC GTTCGCCATC GACGACCAGT TTGCCGTCGG TACCGACATG
ATCGTCGCCC CTGTGGTCGA AAAAGGTGCG ACATCGCGCA CGGTTTACCT CCCCGCCGGG
TCTTGGCGGC GATACGACGA ATCGAACGCC GTCGCCGGCG GTCGCCGCGT CGTCGTCGAC
GCCCCGCTCG ACGTCCTCCC CGTGTTCATC CGCGCCGACT AG
 
Protein sequence
MEDSGAAELT VKLCDGADED HWYGGSHLLK QLWPLERAKI EMGPFYPFDH GPNGVGNVLG 
THWVSSGGTL IFADPNSDLL HALTSERGDG LLRLQSRANY ADANMLHPWQ SINTNDARTR
LQLRVAVSAQ EDARSATRVA LAQLPKPSAA PDKSLMFYPI WTTWATSHAD VTQESTLAVA
KDIAKAKSDS GLPNGSIIEI DDRWQARYGE LHFDPVKFPD PKGMIKELHD MGFLVTAWVM
PFLQESSAAC EEAKRLGYLL EGSQPPNEVP GQFRWWGTQP VRGIDFTNDE ACEWFVRRLK
KLQEEVGLDG FKFDAGEPCF MPYGARPHTP LKHPQEYSQA YVEKVCSKFA LSEVRVAMGT
NNYNGLIRMG DKDTVWGVDN GLQSLIPSLL TSAVIGFPFT LPDIIGGNAY WNQTPDTELM
IRWAQVSAFM PAVQWSIPPW EVSSVAYEAS VKVMHMRERL LLPKLAALAE DAKASLEPIC
RPMWWLDPND AQTFAIDDQF AVGTDMIVAP VVEKGATSRT VYLPAGSWRR YDESNAVAGG
RRVVVDAPLD VLPVFIRAD