Gene OSTLU_28333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28333 
Symbol 
ID5006297 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp234953 
End bp236566 
Gene Length1614 bp 
Protein Length360 aa 
Translation table 
GC content63% 
IMG OID640421718 
Productpredicted protein 
Protein accessionXP_001422240 
Protein GI145356020 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0642636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.462802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GACGCGCGCG ATGGCGCTCA CGGTGCGTCT CGCGCGCGAC GCGCGCGCGA CAAAAATATT 
CGCGCGCTCG ATCGCGTCGC CGTCGCGCGC GCGCGTAGGG ACGCGCGCGC GCGAACGGCG
GGAAAAACGC GACGCGCGGC GATCGACGCG CGGCGATCGC GGGTCGACGG CGATCGCTTC
GACGGGACGC GACGACGAAG GCGAACGCGC GCGAGACGCG CGCGGAGGAC TGACGAATGG
AATGAAATCG TGCGATCGCT TCCACGAGCG CAGATGAACG TTTCCGCGAA GGTGCGCCGA
GCGATGCGCG ATGGACCGAG CGGGCGGGCG ATCGGGCTTT GGAACCCGGC GAGACGCGAG
GCGAGCGCGA CGCGCGCGAC GAGACGCGCG CGGAGGCGCG AGGGGGATGA ATGCGAAGGC
GCGCGATGCG ATGCGCGCGG AGACTGACGA TGTGACGCGC GTGATGACGC GGTGCGATAG
GTTTCCGCGT TCGCCGGCGC GAAGATCAAC GCGGCGCGCC AAACCAAGGC GCGCCGCGCG
ACGGCGGTGG TGCGCGCCGA AGGCACGGAC TACGGTCTCG GCTTGCAGTG CTCGCCGACG
GCGAACAAGA ACATCGACCC GAAGGGACGC GCGAAGGTGC CGCTCGAGCT CGAGGACATG
CCGCTTCCGT TGAACACGTT CAAGAACAAG GAACCGTTCA CGGGTAAGGT GCGCTCCGTC
GAGCGCATCG TCGGCCCGAA CGCGACTGGC GAAACGTGCC ACATCATCAT CGAACACGGT
GGTAAGATGC CGTTCTGGGA AGGTCAATCG TACGGTGTCA TCCCGCCGGG TACCAAGGTG
AACTCCAAGG GTAAGGAAGT GCCGCACGGC GTGCGCTTGT ACTCCATCGC GTCGTCCCGT
TACGGCGACT CCTACGACGG CCAAACCGCG ACCTTGTGCG TTCGCCGTGC GACGTACTGG
GATCCGGAAA TGAACGCCGA AGATCCGGCC AAGAAGGGCA TCTGCTCCAA CTTCTTGTGC
GACGCCAAGC CGGGTGCCGA AGTCATGATG ACTGGCCCGA CTGGTCAAGT CATGCTCTTG
CCGAAGGACC CGGCGACGCC GGTCATCATG GTCGCCACCG GTACCGGTAT CGCCCCGATG
CGCTCCTACA TTCGCCGATT CTTCCTCGAA GACGTCCCGA ACTGGGAATT CAAGGGTCTC
GCGTGGTTGT TCATGGGTGT CGCTAACTCT GACGCCAAGT TGTACGACGA CGAGTTCCAA
GAGTGCGCCA AGCGTTTCCC GGATCAGTTC CGCATCGACT ACGCGCTCTC CCGCGAAGAC
ACCAACAAGA ACGGTGGTAA GATGTACATC CAAGACAAGG TTGAAGAGTA CAAGGACCAA
GTGTTCCAAC TCCTCGACGG CGGCGCTCAC ATGTACTTCT GCGGTCTCAA GGGTATGATG
CCGGGCATCT TGTCCATGTT GGAAGGCGTG TGCAAGGAGA AGGGCATCAG CTACGAAGAA
TGGCTCGAAG GCCTCAAGAA GAAGGGCCAA TGGCACGTCG AGGTGTACTA AGCGCCTTAG
CCCGCGAATC GCAGCGTTTG GTTGATCCCT CAATGCGGAA ACGCGTTGAA GTTT
 
Protein sequence
MALTVSAFAG AKINAARQTK ARRATAVVRA EGTDYGLGLQ CSPTANKNID PKGRAKVPLE 
LEDMPLPLNT FKNKEPFTGK VRSVERIVGP NATGETCHII IEHGGKMPFW EGQSYGVIPP
GTKVNSKGKE VPHGVRLYSI ASSRYGDSYD GQTATLCVRR ATYWDPEMNA EDPAKKGICS
NFLCDAKPGA EVMMTGPTGQ VMLLPKDPAT PVIMVATGTG IAPMRSYIRR FFLEDVPNWE
FKGLAWLFMG VANSDAKLYD DEFQECAKRF PDQFRIDYAL SREDTNKNGG KMYIQDKVEE
YKDQVFQLLD GGAHMYFCGL KGMMPGILSM LEGVCKEKGI SYEEWLEGLK KKGQWHVEVY