Gene OSTLU_119606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119606 
SymbolHcs 
ID5000156 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp636939 
End bp638550 
Gene Length1612 bp 
Protein Length339 aa 
Translation table 
GC content44% 
IMG OID640415577 
Productbiotin holocarboxylase synthetase-like protein 
Protein accessionXP_001416447 
Protein GI145343692 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGCGG TGGAGCTGTG CGTTCGGAAT CCTTCGGTAC AATTTCTTCG AAGCCACGCG 
CTTTATCGTT GAACTTCGTT TAGATGCTCC CAAAGGTGAA AGAGCGGCTT GGTCAAAAGC
CAATCGCCGC GAGGTATTGA AATATGCGTG TTCGAAACAT CACATATTCG TTAACGCATG
CTTTTGTAGT CGAGTACGTG TGCTCGAGAG CCCATCAAAT CTAGCAGGCA CTTTCAACGA
GTTAGAATTC TTTTCCAAGC TTCAAACTCG ACACCTTGGG GCTATTTTGC TGCGATCGGG
AAAGTTGGGT AGCACGAGCG ACTTTTTGCT CAAGTGCGCG TCTCAATTTA CAGTTTTGTG
CTTTTTCTCA CTTGATAATC AGGGAGTTTG AGTCTTTCCC GTCTGGGACT GTATGCGTCA
GTGATGAGCA GACTGCGGGC CAAGGTCGAG GGTATTTAGA CGCCGTTCGT TCATCGTACC
GAATCCTCAC GCATATTCAG CACAAATGTC TGGCAGTCAC CGTACGGTTG CTTGACGTTT
TCCTTTACAT GTGAAACTAG TTTGTGCGTG CAAATAAATC AAAGTATTTA TGTGTGTGAC
TTGGAACGAC AGGGCTGCTG CCGAAATCCC TCTTCTACAG TACATATCGA CATTAGCGGT
CGTTAAGGTT TGTGTGCACT CAGAACATTC CTTCTTGTTT GATCACTGCT GCGCTTAGGC
GATTGAGGAG TCTTGTACTG CTGCTGGTTG CTCACAGTAC AAGTCGCTGG GTATACGGAT
CAAGTGGCCA AATGATATTT ATTATAAGTT GAACAAGGTA TAGGGACACT TAGACCTTCT
TCTATATACT TTAAGCTTCG AACGTAGATT GGTGGTGTTC TATGCAAAGC CATTTATCGT
GAAAATTCCT TCAGCGTAGT GATAGGTATC GGCCTCAATC TTGACAACTC TGTAAGTTTT
ATTTCTTTTA TTTCGACAAA AAGAAGAATA TCTTATCGGT CGCTAGTCGC CTAGTGTGTG
CCTTAACAGC CTGATTAATG AGAACGCTCT ATTCTTCTCG ACTTTTGATG GCACAAAGCG
TGGGGGTGTT CAAGAACCAT CCACTGAGAA GTTCAGCGTA GTGCGTTGCT AACGTATGCT
TTGGGAGTTT ACTCATACTG TGTAGAAATT GAAGAGAGAG GCGCTCGTTC CTAAAATTCT
GGGTCAGTCC GCTCCAAACG AGGAAATATC TTTGCTTACC CCGGTTTAGC TCACTTTGAA
AGTCTACATG ACAAATTGCA AAGAAACGGT TTCTGCGCCA TACGGGTATG AAATGTAGAA
TGTCCAGGAC TCTTGCTTAT GTATTTGTAG GCAGAATACA CGGCTCACTG GCTACACGAT
GAAGCACAAC TTTCGATTTT TAACTCAGAT TCTTCGGGCG CGATGATTTC GGTCACTGTC
ACTGGTCTCA CAGATGCAGG TTTTCTGTTT GCAGGTGAAA TGTGACAACA ACCTAGAACT
CACAAATCTT ACTGCGTAAA CAGTCGATCG TCAGGGAAAC ACGTTCGAAC TCCATCCAGA
TGGTAATAGT TTAGATATCC TTACAGGGAT GATTGGGAAA AAACTCTCTT AG
 
Protein sequence
MHAVELCVRN PSMLPKVKER LGQKPIAASR VRVLESPSNL AGTFNELEFF SKLQTRHLGA 
ILLRSGKLGS TSDFLLKEFE SFPSGTVCVS DEQTAGQGRG TNVWQSPYGC LTFSFTCETS
LAAAEIPLLQ YISTLAVVKA IEESCTAAGC SQYKSLGIRI KWPNDIYYKL NKIGGVLCKA
IYRENSFSVV IGIGLNLDNS SPSVCLNSLI NENALFFSTF DGTKRGGVQE PSTEKFSVKL
KREALVPKIL AHFESLHDKL QRNGFCAIRA EYTAHWLHDE AQLSIFNSDS SGAMISVTVT
GLTDAGFLFA VDRQGNTFEL HPDGNSLDIL TGMIGKKLS