Gene OSTLU_43184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43184 
Symbol 
ID5005533 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp91064 
End bp94880 
Gene Length3817 bp 
Protein Length449 aa 
Translation table 
GC content65% 
IMG OID640420954 
Productpredicted protein 
Protein accessionXP_001421384 
Protein GI145354210 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.236516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.074058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGCCTCCCC CGCCGCCTCC GCCGCCTCCT CGTCGCCTCG CGTCGCCTCG TCGATTTCTC 
GCAGCGCCGC GGCTTTGGAC TTCAAGATCC CCGGCCGCGT CACCGGCGGC GCCGCGCTCT
TGCGCCCCGT CCGCTTCACC CGCGCCGAGC GCGCGTCGAT CAGCGGTTCC CCGTCGTCGT
ACGACATCCG CGTCGTCGCA TCGCGCGTCG CGCGCGCGGT TTCGTCGTCC GATGCCGACG
TCGCGCGCCG ACGAATCCGC GTCGCGCGTC CTCCTCTCGC GTTGCGTCGC GCGGGGCGTC
GCGCGGCGCG CGGTGACGCG CGACGAACGC GCGCGCTCGC CGTCGCCATG CGCCGCGCGA
CGCGACGCGC GACGCAGATT TCGAGCGCCG ATGTGCGGCG CATCCATCGC CTTCGCGCGC
GCGGACATCG TCGCGCGTCA CGTCGCGCCG CGCGTCGCGT CGCGCGTCGA GACGCGACGC
GCAGCGACGA TGTCCGCGCC GCGCGTGTAC GTGGGAAAGA TCCCAATCGC GCTCGCGGAC
GACGCGATCG TCGAGCGCGC GCTCGAAATC TGCGGCGCGC TGAAAACGTG GAAACGCGTC
CGCGATCCCG CGACGCGCGC GCCGAAGCGG TTCGGATTCG CGACGTTCGA GACGATCGAG
GGCGCGGTGC GGTGCGCGCG CGCGATCGAT GGGCTGCGCG TGAGCGAGGC GGACGAGGCG
ATGACGTGCG CGGCGAACGC GGCGACGAGG GCGGCGACGA CGGCGTACCT GAGACGCGCG
GGAGAGCGCG CGCTGGACGC GGACGGGGAC GCGAGGCGAC GAAGAGAGAT CGCGAGGGTG
CTCGGGCGAG GGATGGAGGG GGAGGAGGTG GAGATCGGGG AGATTTCGGA CGACGAGGCG
ACGGCGAAGG CGAAGGCGGG AAAGGCGGCG AAGGCGGCGA ACGCGAAGGC GGAGGTCGTC
GTGCGTGGGT TACCGCCGCG GTCGGTGGAC GAGGGGAAGG CGCGAGAGTC GACGCGCGGC
GCGGCGGCGA GGGAGGGTGG GCGGAGGTCG ACGTCGAGGG GGCGCTCGCC GGCGAACGCT
TCCGACGCCG GGAGGTTCGA GAGCGGGACG AGCGCGCCGG TGACGAACTC CGCCACCGCG
CGGTTCGTGC GCGGTGGAAG ACACGAATAC GAAGCGACCG AGCGCGCCGA GCGGTTGTTT
CGCGAGCGGG AGAAGGTGAT GGATGATTTA GTCGGCGCCA ACGTCCGCGA ACGCGCGAGG
CGAGCGAAAA TAGAAAAGGA AAAGAAGATG GAACGTCGCG CGGCGATCAA GCGCGACTTA
GCGAGCGATA GCGAAGAGGA CGGCATCCAA ACGCCGCTTT GGGAAAAGTC CGACCGCGAG
CGGGCGCGAC GACGACGGTT TCGAGATTTA GAAATCGAGG ACGACGAACG AGACCGTCGC
GATGAGGAAG AGGAGCTCGA GCGACGCGAT ACTCGCGCGA CGGTTCGCGC GGCGGCGCGT
TCGCGTTCGA CGAGCGATGA TGTCCACGCA GAGGCGTTTC CGATTTCAAA AGCTCGAAAG
TCGACGCACG AGCCCGAGCG CGTTCGAGGC TTTGCCGAAC CGCGCTCGAG CGATTTGGTG
TCGGTGCCGC CGCCACCGCA GCAGCAGCAG CAGGCAAAGA CGTCTTTCGG CTTGACCGCA
CCGAAGCGAC CGACCGGCGG TCTATCCGCC GCGTTCTCCG CATCGCACGC GAAACCACCT
CCGGACGTCT TCGAAGACGC CGCCCGCGAC GTCGATCGCC GTCGCGCCGA GCGACGACCC
GTCGACGTCG CCGCCATCAT CGCCACCGTT CCCACCGCTC GCGACGACAT CTTCGCCCAT
CCCATGGATT GGTCCACTTA CACCACCGCC GAAATCGACG CCGTCGCCTC CAAGTGGATC
TCCAAGAAGC TCACCGACCT CCTCGGCGAA TCCGAGCCCG CTCTCGCGCG CTTCGTCCTC
GAAAAGCTCG ACGCTCGCGT TTCCCCTCTC GACCTCATCG TCGATCTCGA CCCGGTTTTA
GACGCCGAGT GCGAGCCTTT CGTCATTTCT CTTTGGCGTT TGCTCATTTT CGAGATCAAC
AAGGCGACTA TGTCGAGCTG ATTCCGTCGT CCTTTCGTCG GCGCGGCGTT TGTTTGAAGT
GAAATTTCAC GCCGCACCAA ACCTAGTTGA CTGTACGGCG TCATCGCGCG ACGCGCGCGC
GCTCGGACGC CATGCTCGCC TCCGCGACGC GCCAGCGGCT GACAAAAACC CTCACCCTTC
GCGCCGTCGC GCTCGCGGAC GCCTCCGCGG CGCACGGCGT CGATGGCGCG CGCGAAACGA
CGACGAAACC GACGATCCAG GGTCATCGCA CCGTCGCCGC GCGAACGCGA CCGCATCGCG
CGTACTCGAG CGAGCGCGCG CACGGTGATT TGAAGGACCA AGATCGCATC TTCACCAACC
TGTACGGGAA GCACGATCCG TTCCTGAAGG GCGCGATGAA GCGCGGGGAC TGGCACAACA
CGAAGTCGCT CGTGGAACTC GGGGCGGATT GGATAATAAG CGAGATGAAG GCGAGCGGAT
TGCGCGGACG CGGCGGCGCG GGGTTCCCGA GCGGGCTGAA GTGGTCGTTC ATGCCGAGGG
TGAGCGATGG ACGGCCGAAT TATCTCGTGG TGAACGCGGA TGAGAGCGAA CCGGGGACGT
GCAAGGACAG GGAAATCATG CGACACGACC CGCACAAGTT GTTGGAGGGA TGCTTGATCG
CGGGGACGGC GATGCGGGCG CGTGCGGCGT ACATTTACAT TCGTGGTGAG TATGTGAACG
AACGGTTGGC GCTGGAACGG GCGTTGGCGG AGTGTTACGC GGCGGGATAT TTGGGGAAGA
ATGCGTGCGG AAGCGGGATG GATTTCGACG TCAACATTCA CTACGGCGCT GGGGCGTATA
TTTGCGGCGA AGAGACGGCT TTGATCGAGT CTTTAGAGGG TAAACAGGGC AAACCGAGGT
TGAAACCGCC GTTCCCCGCC AACGTCGGCC TGTACGGATG CCCGACGACG GTGACGAATG
TCGAAACCGT CGCCGTGGCG CCGACGATTT TGCGCCGAGG CGCGGATTGG TTCGCCTCCT
TCGGTCGAAA GAATAACGCC GGTACGAAGT TGTTTTGCAT CTCTGGGCAC GTCAACAACC
CGTGCACGGT GGAGGAAGAG ATGTCTATTC CTTTGCGAGA CCTCATCGAG AAGCACTGCG
GCGGCGTGCG AGGCGGTTGG GACAACCTCT TAGCCGTCAT CCCCGGCGGA TCCTCAGTAC
CGCTCATTCC CAAGAACGTG TGCGAAGACG TGTTGATGGA TTTTGACTCT TTGAAAGAAG
CCCAGAGCGG TCTCGGGACC GCGGCGGTGA TCGTCATGGA CAAATCCACC GACGTCATCG
ACGCCATCGC GCGTTTGAGT TACTTTTACA AGCACGAATC GTGCGGCCAG TGCACGCCCT
GCCGCGAAGG GACGTCGTGG TTGTACAACA TCATGCAACG CATGGTCAAG GGCGACGCGC
GTCTCGAAGA GATTGACACA TTACAGGAGC TCACGAAACA AATCGAAGGG CACACTATTT
GCGTACGTCC GTCGACGATT CCTCCGCGCG TTCATTTTTT TTTCCCAAAA AAGGATCGAC
TGACTGACCT CGTGCGCCTC TTTGCGTCGT GATTCCTCAG GCTCTCGGCG ATGCCGCTGC
GTGGCCGATA CAAGGAGTGA TTCGCCATTT CCGCCCTCTC CTCGAAGAAA GAATCTCTCA
GTTTTCTTCT TCCCCGTCGG AAAAAATCAC GGCGTAA
 
Protein sequence
PPPPPPPPPP RRLAERAHGD LKDQDRIFTN LYGKHDPFLK GAMKRGDWHN TKSLVELGAD 
WIISEMKASG LRGRGGAGFP SGLKWSFMPR VSDGRPNYLV VNADESEPGT CKDREIMRHD
PHKLLEGCLI AGTAMRARAA YIYIRGEYVN ERLALERALA ECYAAGYLGK NACGSGMDFD
VNIHYGAGAY ICGEETALIE SLEGKQGKPR LKPPFPANVG LYGCPTTVTN VETVAVAPTI
LRRGADWFAS FGRKNNAGTK LFCISGHVNN PCTVEEEMSI PLRDLIEKHC GGVRGGWDNL
LAVIPGGSSV PLIPKNVCED VLMDFDSLKE AQSGLGTAAV IVMDKSTDVI DAIARLSYFY
KHESCGQCTP CREGTSWLYN IMQRMVKGDA RLEEIDTLQE LTKQIEGHTI CALGDAAAWP
IQGVIRHFRP LLEERISQFS SSPSEKITA