Gene OSTLU_46735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46735 
Symbol 
ID5003984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp230998 
End bp232491 
Gene Length1494 bp 
Protein Length458 aa 
Translation table 
GC content58% 
IMG OID640419405 
Productpredicted protein 
Protein accessionXP_001420117 
Protein GI145351507 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.344273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.29017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCGCTCGCGC CGACGACGAC GGCGAGCGGT GAGGGCGAGC GCGGGTGGCG CGTCGAGACG 
GACACGATGG GAGAGGTGCG CGTGCCCGCG GATAAGCTGT GGGGGGCGCA GACGCAGCGG
TCGCTGCAGA ATTTTAGGAT CGGGGGGGAG AAGATGCCGG TGGCGATCGT GCGAAGCTTG
GCCATCGTCA AGTACGCGGC GGCGACGGTG AACGAACGGG CGGGGGGACT GGAGCCGAGA
TTGGCGGATG CGATTCGAAA GGCGGCGCGG GAGGTGGTGG AGGGACGGTT GGATGATCAT
TTTCCGCTCG CCGTGTGGCA AACGGGAAGT GGGACGCAGA CGAACATGAA TCTGAACGAG
GTCATCTCGA ATTACGCGAA TTCGCGCGTG CTCGGCGGCG CCGTCGGGAC CAAGTCTCCG
ATTCACCCGA ACGATCACGT CAACAAATCG CAGTCGAGTA ACGATACGTT TCCGACCGCT
ATGTCCATCG CCACCGCGCA CGAAGTGCAA GAGCGGCTGA TTCCGGCGTT GAAAATGCTG
CAGGAGGCGT TGCACCAGAA AGTGCTCGCG TGGGGAGGCA TAGTGAAAAT CGGCAGAACG
CATCTTCAAG ACGCCGTCCC GATCACGCTC GCGCAAGAAT TCAGTGGGTA CGAGCAACAG
TGCAAGAATT CTCTCGTGCG CGCCAAAGGT GCGCTCATTC ACTTGCTCGA GCTCGCCATC
GGCGGCACCG CCGTCGGCAC GGGGCTAAAC TCGCCACCAG GTTTCGGCGA GGCGATGGCG
GCGGAAATCT CCCTATTAAC CGGTTTGCCG TTCGTGAGCG CGCCAAATAA GTTTGAGGCG
CTCGCCGCAC ACGACGCGCA GTCCGCGCTG TCCGGTCTGT TGAAGACTAT CGCGATTTCG
TTGTTGAAAA TTGCCAACGA TATTCGTCTT CTCGGCTCTG GACCTCGCGC CGGACTCGGC
GAACTTCAGC TGCCTGAAAA CGAGCCCGGA AGCAGTATCA TGCCTGGTAA GGTGAACCCG
ACGCAGTGCG AAAGTCTGAT GCAGGTGTGC GCGCAAGTCA TCGGTAACGA TCTCGCGGTC
ACCATCGGCG GTTCCGCGAG CTCCCACTTT GAGCTCAACG TAGCTAAGCC TCTGATCGCT
CACAACAACC TGAACTCCAT CGCGCTCCTT TCTGATTCGG TCATCAGCTT CACGCAAAAC
TGCGTCGTTG GTATCGAACC AAACATCGAG CGCATCGACG CGCTCATGCG AAGCAGCTTA
ATGCTCGTCA CCAGCTTGCT GCCGAAGATC GGGTACGACA ACGCCGCAAA AATTTCGAAA
AAAGCACACG CCGAAGGTTT AACTTTACGC GAAGCTGGCA TCGCGCTCGG TTTACTGACG
AACGAACAGT TCGATGAATG GATAAAGCCT GAAGAAATGA CGCGCCCACA GTCAAAACTG
TGAACAGTTT ATTATAATCG AGCGCGGCGC CGAAAGTTGT TGTAAACAAA ATCT
 
Protein sequence
MGEVRVPADK LWGAQTQRSL QNFRIGGEKM PVAIVRSLAI VKYAAATVNE RAGGLEPRLA 
DAIRKAAREV VEGRLDDHFP LAVWQTGSGT QTNMNLNEVI SNYANSRVLG GAVGTKSPIH
PNDHVNKSQS SNDTFPTAMS IATAHEVQER LIPALKMLQE ALHQKVLAWG GIVKIGRTHL
QDAVPITLAQ EFSGYEQQCK NSLVRAKGAL IHLLELAIGG TAVGTGLNSP PGFGEAMAAE
ISLLTGLPFV SAPNKFEALA AHDAQSALSG LLKTIAISLL KIANDIRLLG SGPRAGLGEL
QLPENEPGSS IMPGKVNPTQ CESLMQVCAQ VIGNDLAVTI GGSASSHFEL NVAKPLIAHN
NLNSIALLSD SVISFTQNCV VGIEPNIERI DALMRSSLML VTSLLPKIGY DNAAKISKKA
HAEGLTLREA GIALGLLTNE QFDEWIKPEE MTRPQSKL