Gene OSTLU_50765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50765 
Symbol 
ID5004053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp159641 
End bp161678 
Gene Length2038 bp 
Protein Length633 aa 
Translation table 
GC content61% 
IMG OID640419474 
Productpredicted protein 
Protein accessionXP_001419920 
Protein GI145351091 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0523138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.66186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCTCGGTCG CGAGCGCGCG CGCGTCGACC TCCGGACGCG AGCGAACGCG CTCGAGCGCG 
CGGCGCGTCG AGCGACGCGA GGGCGCGTCG AGCGACGCGA GGACGCGGCG CGAGGGATCG
CGGGACGACG ACGACGACGA TGGACGCGGC GCGAGGGACG GCGACGACGG CGACGACGGT
GAGGCGCGCG GTGCGGGCGC GCGGAGGACG CGCGGTGCGC GAGACGCGCG GCGCGCGGGC
GCGGGACGCG CGACGAGGGA CGCCGCGGGC GGTGGTGGAG CCGGAGGGGG CGTCGGCGGG
CGACGCGTCG GCGTCGGCGT CGGCGGGCGA CGCGTTGAAG TGGCAGGCCG AGGACGCGGC
GAGCCTGAGG GCGCATAACG AGGAAGCGAT CGCGAAATAT GCAAACTTTC CGGAGTTGGA
TAAGCCGGAC GCGCACGTGG CGCGAGATGC GGATGGGTAC TACGTCGTGA AGGAGGAGTG
GCGAAAACCG ACGAATCCCT TTGAAAAGTT GAAGCTCGCG AAGGATCCGA TGCGAGAGTT
GATCGGGATG AACGGGATCG AGGAGATGGC CAAGGCGAGC GCGGCGGATT TCAAGGCTTG
GGACGAGGCG TTGAATGACC CGGACGAGAC CGATCAACGA CCGAAGTGGG CGGGTTTGTT
TCATCGACGC AAAGGACACT ACGGGCGATA CATGATGCGA CTCAAGCTTC CGAACGGACT
CATTAACTCG ACGCAGATGC GATATTTGGC GAGCGTGATT AAAAAGTACG GCGAAGATGG
GTGCTGCGAC ATCACGACGA GACAAAACAT TCAGCTTCGT GGGGTTGAGT TGAAGGATGC
GCCCGAAATC TTGCGCAAAC TCGAAGAGCT CGGTATGTGC TCGTTGCAAA GCGGGTTGGA
CAACGTGCGC AACGCGACGG GGAACCCGCT CGCGGGTTTT GATCCGCAAG AAATCGTCGA
CACGCGACCT TACACGCTCG CCATTCAAGA TTACGTCACC GGTGGTGGGC GCGGGAATCC
AGCCATCGCT AACTTGGGGC GTAAGTGGAA CGTGTGCGTC GTCGGCAGCA GCGACTTTTT
CGAGCATCCG GACATCAACG ACTTGGCCTT CATTCCAGCG ATGAAGGATG GCAAGTTCGG
ATTCAACATG ATCGTCGGTG GCTTCATATC TTCTCAGCGC GCAGCGGAGT CGGTCTCTCT
CGATGCGTGG ATCCCTGAGA ACGAGCTTGT CGCCGCGACG CACGCCGTGT TGACGACGTT
TCGCGATTAC GGCCATCGCG GCAACCGCCA AAAGTGCCGC ATGATGTGGC TCATCGACGA
GATGGGTTTG GAAACGTTTC GCACCGAAGT TGCGTCGCGC ATGCCAACGG GAGACTTGGC
GCGCGGCGCC GAAGTGGATC TCATCGACCG AGAGTCTCCG CGACGCAGTT ACATCGGCGT
GCACGCGCAA AAGCAAGAGG GCTTGAGCTG GGTCGCCGCT GCCGTGCCGG GCGGACGTAT
GCAACCGGAA GATCTCGCGG AGATGGCTGA TCTCGCGGAC AAGTACGGCG AAGGTGAGAT
TCGACTCACC GTCGAGCAAA ACTTTATCAT TCCCCACGTG CCGAACGATA AGATTGACGC
CATTCTCCAA GAGCGTTTGT TTCAAGAGTA CACGCCGTTC CCGGGCAAAC TTGTGTCCAA
CATGGTGGCG TGCACCGGAA ATCAGTTCTG CGGATTCGCG CAAATCGAGA CGAAGCGACA
AGCGCTCGAA ATGGCGGAAC ACTTGGAAAG TTGCTTAGAA CTTTCGAAGG ACGTGCGCAT
GATTTGGACA GGTTGCCCGA ACTCTTGCGC TCCGGTGCAA GTGGCGGACA TTGGCCTCAT
GGGCGCGCAG GTGAAGAATC CGACGGGCGA GAAGGGCATG GTACCAGGGG TGAACATCTT
CATCGGTGGT ACCGTCGGGC CGAACGGCCA CTTGAAGGAA GCGCCAGAAA TCGCAAAGGT
CCCGTGCTCA GAGTTGAAGC CGGTTCTGGA ACAGATTATG ATTGAGAGGT TTGGCGCG
 
Protein sequence
MDAARGTATT ATTVRRAVRA RGGRAVRETR GARARDARRG TPRAVVEPEG ASAGDASASA 
SAGDALKWQA EDAASLRAHN EEAIAKYANF PELDKPDAHV ARDADGYYVV KEEWRKPTNP
FEKLKLAKDP MRELIGMNGI EEMAKASAAD FKAWDEALND PDETDQRPKW AGLFHRRKGH
YGRYMMRLKL PNGLINSTQM RYLASVIKKY GEDGCCDITT RQNIQLRGVE LKDAPEILRK
LEELGMCSLQ SGLDNVRNAT GNPLAGFDPQ EIVDTRPYTL AIQDYVTGGG RGNPAIANLG
RKWNVCVVGS SDFFEHPDIN DLAFIPAMKD GKFGFNMIVG GFISSQRAAE SVSLDAWIPE
NELVAATHAV LTTFRDYGHR GNRQKCRMMW LIDEMGLETF RTEVASRMPT GDLARGAEVD
LIDRESPRRS YIGVHAQKQE GLSWVAAAVP GGRMQPEDLA EMADLADKYG EGEIRLTVEQ
NFIIPHVPND KIDAILQERL FQEYTPFPGK LVSNMVACTG NQFCGFAQIE TKRQALEMAE
HLESCLELSK DVRMIWTGCP NSCAPVQVAD IGLMGAQVKN PTGEKGMVPG VNIFIGGTVG
PNGHLKEAPE IAKVPCSELK PVLEQIMIER FGA