Gene A9601_02951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02951 
Symbol 
ID4716981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp272134 
End bp273138 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content36% 
IMG OID640077996 
Productputative oxidoreductase 
Protein accessionYP_001008690 
Protein GI123967832 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCAA CCTCATCACC CGTAAAGGTT GGAGTCATAG GTATAGGAAA TATGGGATGG 
CATCATGCTC GAGTACTAAG TTTACTCAAA GATGCAAATC TCATTGGAGT CGCAGATCCA
AATGAAGAGA GAGGCAAATT AGCTATTGAA CAATTTCAAT GTGAATGGTT CAAAGATTAT
AAGGACCTAA TTCCAAAAGT TGATGCTATC TGTATCGCCG TCCCAACACT ACTTCATCAA
AAAGTAGGAC TAGATTGTCT TAAGAGAGGA GCTAACGTTC TCATTGAAAA ACCAATTGCA
GCTAACGAGT TGGAAGCAAA ATCTTTGATA GAGGCCGCTA ATGAGAGTAA CTGTCTATTA
CAAGTTGGGC ATATTGAAAG ATTTAATCCT GCTTTCAGAG AATTAAATAA AATAGTAAAT
AATGAAGAAA TTGTTGTTTT AGAAGCAAGG AGGCACAGTC CTCATGCAGA CAGAGCAAAT
GATGTATCTG TCGTAATGGA TTTAATGATT CATGACATTG ATCTTATTTT GGAGCTTGTA
AACTCAAAAA TACAAAAATT AGCAGCAGTT GGAGGAAGAA ATAGCGAAGG ATTAATAGAT
TATGTCAATG CTACTTTAGT TTTTAAAAAT AATGTTATTG CAAGCCTAAC TGCAAGCAAA
ATGAGTCACA AAAAAATTAG AAATTTAAGT GCTCACTGCC AAAATAGCCT AGTAGAAACT
GATTTTTTAA ATCACTCTTT ACAAATCCAT CGAAAGTCTC ATGAATCATA CACAGCTGAG
CATGGAGAAT TAGTTTATAG AAATGATGGA TATGTCGAAG AAGTTAGCAC AACCTCCATT
GAACCTCTTT ATGCAGAACT GGAGCATTTT CTTAAGTGCG TTCAAGGTAA AGAGACACCT
GAGGTAGATG GTGAGCAAGC CTCAAGAGCT TTGAAAATTG CTGATTTTAT AGAGCGTGCT
GTAGAAAATT CTGGAGATGC AATTTTACTT GAAAATCCTT TCTAA
 
Protein sequence
MQPTSSPVKV GVIGIGNMGW HHARVLSLLK DANLIGVADP NEERGKLAIE QFQCEWFKDY 
KDLIPKVDAI CIAVPTLLHQ KVGLDCLKRG ANVLIEKPIA ANELEAKSLI EAANESNCLL
QVGHIERFNP AFRELNKIVN NEEIVVLEAR RHSPHADRAN DVSVVMDLMI HDIDLILELV
NSKIQKLAAV GGRNSEGLID YVNATLVFKN NVIASLTASK MSHKKIRNLS AHCQNSLVET
DFLNHSLQIH RKSHESYTAE HGELVYRNDG YVEEVSTTSI EPLYAELEHF LKCVQGKETP
EVDGEQASRA LKIADFIERA VENSGDAILL ENPF