Gene P9303_20971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20971 
Symbolmqo 
ID4776834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1857664 
End bp1859154 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content52% 
IMG OID640087605 
Productmalate:quinone oxidoreductase 
Protein accessionYP_001018097 
Protein GI124023790 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID[TIGR01320] malate:quinone-oxidoreductase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGTCT CTGATGTTGC TGGATCCCAA TCTCGCTACG ACGCGGTGCT TGTCGGGGCT 
GGAATTATGA GTGCCACTTT GGCGGCCCTG CTGCATGAGC TCGATCCTGA GCTGCGTTTG
TTGATGGTCG AGCGTTTGCA GGCGCCGGGT CTTGAGAGCA GTGCGGCTGA AAACAATGCA
GGCACTGGTC ATGCGGCTAA TTGCGAACTG AATTACACAC CGCTTCAGCC TGATGGCAGC
GTGGCTACGG CTAAGGCTTT GGCCATTAAT ACCGCCTTTG AGCGCTCTTT GGAGTTCTGG
GCTTCGTTGA CGGAAAAAGG CAAGTTGCTA CCGCAGCAAT TTCTACATCT GGTCCCTCAT
ATCAGTGTGG TTTTTGGCGA TGCTGATTTG GCTTTCTTGC ATCAGCGCTT TCAGCAATTG
AGTGCGCTAC CTGCCTTTGC CTCCATGCAA TGGAGTACTG ATGCCGCTGA GCTTGCCGAA
TGGATGCCAT TGGTGATGGA AGGGCGAGCC AATGCAGAAT CTGTTGCTGC AACCTGCATT
AAGCGGGGTA CGGATGTGGA TTTCGGATTG CTGACAAGGG CCTATGTGAA GTCATTGCAA
GCAAGCGGAG CTTTGGAATT GAGTTGCGGC TGCGAAGTCG TTCATTTGCA CCGGCTCGGC
AAGCACCGGT GGAATCTTGA TCTCAAGCAC TCTTCTGGAA GTCGCTCTGT GCAGACACCT
TTTGTGTTTC TCGGTGCAGG AGGGGGGGCA TTGCCTTTGT TGCAGCGATC TGGCATTCCA
GAGGCAGCTG CCTATGCAGG CTTTCCAGTG AGCGGACAGT GGTTGGTCTG CTCTGAGCCA
GGTTTAACGG CAAGGCATCA CGCCAAGGTG TATGGCAAGG CGAAGGTGGG TGCTCCTCCA
ATGTCTGTGC CACATCTTGA TAGCCGTTGG ATTGATGGAT GCCGCTCGTT GCTTTTCGGG
CCTTATGCGG GTTTCAGTAG CAAATTCCTC AAGCAAGGCT CCCGCTTGGA TCTCTTGCGT
TCGGTACGGC GCAGCAATTT TCGCTCCATG TTGGAGGTGG GTTTTAAAAA CTTTGATTTA
GTCACTTATC TCCTCTCAGA GCTACAGCAG AGTGAGAAAG ATCGCTTTGA AACCCTAAAG
CAATTTCTTC CCAATGCGCA GTTGAATGAT TGGAAGCTTT CAGTTGCTGG CCAGAGAGTA
CAAATCATCA AAGGCACAGC CGAGGGGGGG CGTTTGCAGA TGGGTACAGA GGTGGTATCC
GCTGAAGATG GCTCCCTAGC TGCCTTATTA GGAGCTTCGC CTGGGGCTAG TACAGCGGTG
ACGGTCATGC TGGAAGTTTT GCAGCGTTGC TGGAGCGAGC GTATGGCAAG TGAATCTTGG
CAAGAACGAT TGCAAAAACT GTTGCCGAGT TATGGCCATG ATCCTAATTC TGATCCCTTA
CTGCTGATGC AGATGCGCAT ACGCAGCAAT GAATTACTCA GTTTTACTTG A
 
Protein sequence
MAVSDVAGSQ SRYDAVLVGA GIMSATLAAL LHELDPELRL LMVERLQAPG LESSAAENNA 
GTGHAANCEL NYTPLQPDGS VATAKALAIN TAFERSLEFW ASLTEKGKLL PQQFLHLVPH
ISVVFGDADL AFLHQRFQQL SALPAFASMQ WSTDAAELAE WMPLVMEGRA NAESVAATCI
KRGTDVDFGL LTRAYVKSLQ ASGALELSCG CEVVHLHRLG KHRWNLDLKH SSGSRSVQTP
FVFLGAGGGA LPLLQRSGIP EAAAYAGFPV SGQWLVCSEP GLTARHHAKV YGKAKVGAPP
MSVPHLDSRW IDGCRSLLFG PYAGFSSKFL KQGSRLDLLR SVRRSNFRSM LEVGFKNFDL
VTYLLSELQQ SEKDRFETLK QFLPNAQLND WKLSVAGQRV QIIKGTAEGG RLQMGTEVVS
AEDGSLAALL GASPGASTAV TVMLEVLQRC WSERMASESW QERLQKLLPS YGHDPNSDPL
LLMQMRIRSN ELLSFT