Gene P9303_17291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_17291 
Symbol 
ID4777993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1512191 
End bp1513300 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content55% 
IMG OID640087236 
Productputative oxidoreductase 
Protein accessionYP_001017736 
Protein GI124023429 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAA TCCGCCCTCC AATTGGTGTT GCTATTGCCG GCCTTGGCTT CGGTGAAAGC 
GTTCATCTTC CGGCACTAAA AGCCAACCCA GACCTACAGC CTGTTGCCCT ATGGCATCCA
CGCAGAGAGC GACTAGAAAA AGCCTCTAAC CAGCATGAAC TTGTCGGCTA TAGCGACTGG
TCTGCCCTAT TGACGGATCC AAAGATCGAG GCCGTGATTC TGGCAACACC ACCAGGCCCG
CGTTTTGAAT TGGCGCTCGA AGCACTGAAA GCTGGAAAAC ATTTGCTTCT GGAGAAACCC
GTGGCGTTGC ACGCCGATCA AGTCGCTGAA CTGCAACGAC TGGCCATCAA GAAAAGATTG
AGCGTGGCCG TGGACTTTGA ATACCGTGCT GTGCCGTTGT TTATGCAAGC CAAGCGTTTG
CTTGATCAAG GCATCGTGGG AACACCATGG CTGGTGAAAC TTGACTGGTT AATGAGTAGT
CGCGCCAATG CTTCAAGACC ATGGAACTGG TATTCCCAAT CCGAGGCAGG TGGCGGCGTG
ATTGGGGCGC TAGGTACTCA TGCCATCGAC ATGCTGCATT GGTTGTGTGG GCCAACTCGC
CAGGTGAGTG CTCTGCTATC CACCTCGATT CAAACAAGGC CTGATCCAAG TAGTGGCGAT
CCGTGTGATG TGAGCAGTGA AGACGTCACC TTGGCTCAGT TAAAGCTGGG TGGAAATGAG
AGGCCTGAAA TTCCTGCACA GGTGAGCCTC ACGGCAATTG CCCTCCAGGG AAGAGGCTGC
TGGCTGGAGA TCTACGGCAG CAACGGCAGC CTCTTGCTTG GCAGTGACAA CCAGAAAGAC
TATGTGCATG GCTTTGGCCT CTGGGCAGCC GCTGCCGGTG AGCCACTACG CAGCATTAGC
GCTGATTTGG ATCTAGCATT CCCAACCACA TGGACTGATG GTCGCATCGC CCCTGTGGCG
AGGCTTCAAG GCTGGTGGGC CGAGAGCATG CGCAGTGGAC AGCCAATGCT GCCAGGCCTT
GCGGAAGGAT GGGCCAGCCA ACAGGTATGC GACAAAATAA GGGATTCGGC TAGATCAGGA
CAGCGACTTG AAATCCAATC GACACTCTGA
 
Protein sequence
MNPIRPPIGV AIAGLGFGES VHLPALKANP DLQPVALWHP RRERLEKASN QHELVGYSDW 
SALLTDPKIE AVILATPPGP RFELALEALK AGKHLLLEKP VALHADQVAE LQRLAIKKRL
SVAVDFEYRA VPLFMQAKRL LDQGIVGTPW LVKLDWLMSS RANASRPWNW YSQSEAGGGV
IGALGTHAID MLHWLCGPTR QVSALLSTSI QTRPDPSSGD PCDVSSEDVT LAQLKLGGNE
RPEIPAQVSL TAIALQGRGC WLEIYGSNGS LLLGSDNQKD YVHGFGLWAA AAGEPLRSIS
ADLDLAFPTT WTDGRIAPVA RLQGWWAESM RSGQPMLPGL AEGWASQQVC DKIRDSARSG
QRLEIQSTL