Gene OSTLU_33837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33837 
Symbol 
ID5000576 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp598311 
End bp599801 
Gene Length1491 bp 
Protein Length473 aa 
Translation table 
GC content59% 
IMG OID640415997 
Productpredicted protein 
Protein accessionXP_001416706 
Protein GI145344368 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.40604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCGG GCGCGGTGAA GATCGCGCCC GACGCTCTGA ACGACTTCAT CGCCCCCTCG 
CAGGACTGCG TCGTCGCTCT CGACGGTGGT AAGCTCAAAC TCGACGACGA TGGTGCCGTC
GGCGCGTCTT CCGAAGACGC CTTCTCCACC GGCGAAGTCG CCCTGCGTCG ACGTAAACCG
CGCGAAGACG ACGCCATGGC CGTGGATGCC GAGCCAACGT CGACTTTCAC ACCGACGATG
ACGCAGGGCG ACGCGCTGAA GGTGTCGCTG AGCGATTGCT TGGCGTGCAG CGGTTGCGTG
ACGAGCGCGG AGAGCGTGCT GCTGGAACAA CAATCCGTGG ATGAGTTCGC ACAGGCGTGC
GCGCGCGCGC GGAGCGACGG AACGAGCGTC GTCGTCGCGA GCGTCAGCCC GCAGTCGTTG
ATGAGCCTGA GCGAGGCGTA TGGATTGGGA GTGGAGGAGA CGCGCGCGCG GCTTGGCGGG
CTGTTGAAGG CGGGATTCGG CGCGGCGAGG GCGTTTGATA CGTCATTTAG TCGGGATATA
GCGCTCGTGG AGACGTTTGC AGAGTTTACG GAGTGGATGC GAGACGGCGC GAGGACGCCG
ATGTTGGCGA GTGCGTGTCC GGGGTGGGTG TGTTACGCGG AGAAGACGCA CGGCGAACTC
GCGGTGCCGC ACATGGCAAC GACGAAGAGT CCGCAGCAAA TCATGGGAAG GTTTGTGAAG
AGCGCGGTCG CGCGCGAACT TGGCGTACCA GCACATAACG TGTACCACGT GAGCGTGATG
CCGTGCTACG ACAAAAAGCT CGAGGCGACT CGCGATGATT TCGAGAGCGA CGGTGTCAAG
GATGTCGACG TCGTGCTCAC GACGGGCGAG GTGGCTTTGT TATTAGAAAA GGCTGGTTTG
TGCCATTTGA GAGACGCGCC GGCAAATGAT TTTGACGCAT TCGTGAGCAC AAACGAACAA
GCACCAGAAA GTGTGTGCGC AGCGCCGGCG GTATCGGGAT CTGGGGGATA CGCCGAGTAC
GTTTTCCGGC GCGCGGCGGC GGAGTTGTTC AATGCTCCGA TAACTGGAGA GATTGACTGG
GTCAAGATGC GCAACGCGGA CATGCGTGAG GCCACACTAA CGATCAATGG TGAAGCTGTT
CTACGCGTGG CTGTCGCGTA TGGTTTCAGA AACATTCAAA ATCTTGTTCG AAGCATCAAA
TTAAAAAAGA GCAAGCACCA TTTCGTCGAG ATAATGGCGT GTCCTTCGGG ATGCTTGAAT
GGCGGCGGTC AAATCCCAGC GCGCGAGGGA ACTGCGAACA AAGAATTGAT CGACAGACTG
GATGATACGT ATAGGGAAAA CGCACGCGCA CGACCGATGG CGGATGTGTC GACGCTCTAT
CGCGAATGGA TCGGCGGAAA TCCAGGATCG TCAAACGCTC GCGAAGCGCT TCGAACGCAA
TATCACATTC GCGCAAAATC CGTCGGAGTC GTCCAACTGA ACAGTTGGTA G
 
Protein sequence
MFSGAVKIAP DALNDFIAPS QDCVVALDGV ALRRRKPRED DAMAVDAEPT STFTPTMTQG 
DALKVSLSDC LACSGCVTSA ESVLLEQQSV DEFAQACARA RSDGTSVVVA SVSPQSLMSL
SEAYGLGVEE TRARLGGLLK AGFGAARAFD TSFSRDIALV ETFAEFTEWM RDGARTPMLA
SACPGWVCYA EKTHGELAVP HMATTKSPQQ IMGRFVKSAV ARELGVPAHN VYHVSVMPCY
DKKLEATRDD FESDGVKDVD VVLTTGEVAL LLEKAGLCHL RDAPANDFDA FVSTNEQAPE
SVCAAPAVSG SGGYAEYVFR RAAAELFNAP ITGEIDWVKM RNADMREATL TINGEAVLRV
AVAYGFRNIQ NLVRSIKLKK SKHHFVEIMA CPSGCLNGGG QIPAREGTAN KELIDRLDDT
YRENARARPM ADVSTLYREW IGGNPGSSNA REALRTQYHI RAKSVGVVQL NSW