Gene A9601_11671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11671 
SymbolguaB 
ID4717880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp978590 
End bp979753 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content39% 
IMG OID640078882 
Productinosine 5-monophosphate dehydrogenase 
Protein accessionYP_001009558 
Protein GI123968700 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID[TIGR01304] IMP dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.83783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATATTG AACTTGGTCT AAATAAAAAA GTCAGAAGGG CTTATGGCAT TGATGAAATA 
GCTTTAGTCC CTGGTAATAG AACTCTTGAT TATGATTTAA CTGATCCTTC TTGGTCAATA
GGTGATATCA AAAGAGAAGT TCCGATCGTA GCTAGTGCTA TGGACAGTGT TGTGGATGTT
GATACGGCTG TAGAACTCAC GAAATTAGGT TGTCTAGGGG TTATTAATAT GGAGGGCATA
CAGACAAGAT ATGAAAACCC TGATGAAATA TTAAACCAAA TAGCATCAGT CGGGAAGAAT
GATTTTGTTC CGTTAATGCA GAAGATTTAC AGTGAACCAG TCAAGGAGGA ATTGATTTTA
CAAAGAATAA ATGAGGTCAA AGAAGGAGGA GGCATCGCTG CTTTTAGTGG GACTCCACAA
GCCGCTATTA GGTTTAAAGA AATACTTAAT AATTCCAAAA TAGATTTATT TTTTCTTCAA
GGAACAGTTG TTTCAACTGA ACATCTTGGT ATGGAAGGTA AGGAAACCTT AAATATTAAA
GATCTCTGCC AATCTATGAA TGCCCCAGTT GTAGCCGGAA ATTGTGTTAC TTATGAAGTT
GCAAAACTTC TCATGGACTC TGGAGTTGCA GGATTAATGG TTGGGATAGG ACCTGGAGCG
GCATGTACAT CAAGAGGAGT ATTAGGAATT GGAATCCCTC AAGCAACTGC AATTGCTGAT
TGTAGTGCGG CAAGAAATGA TTACTTTAAA GAAACTGGTC GTTATGTTCC TATTATTGGT
GATGGAGGAA TTGTGACGGG CGGAGATATC TGTAAATGTT TAGCATGTGG ATCAGATGCT
GTAATGATTG GATCCCCAGT AGCTAAATCC TCAAACGCTC CAGGTAAAGG ATTTCACTGG
GGTATGGCTA CTCCAAGTCC AGTATTGCCA AGGGGTACAA GAATTGAAGT TGGTTCTACA
GGATCCTTAG AAAGGATAAT TAAAGGCCCT GCCTTACTTG ATGATGGGAC ACATAACTTA
TTAGGAGCCA TTAGAACATC AATGAGTACT CTTGGAGCAA AAAATATTAA AGAAATGCAA
AAAGTTGAAA TAGTTATCGC ACCATCTCTT CTTACAGAGG GTAAGGTTTA TCAAAAAGCT
CAGCAGCTTG GGATGGGTAA GTAA
 
Protein sequence
MNIELGLNKK VRRAYGIDEI ALVPGNRTLD YDLTDPSWSI GDIKREVPIV ASAMDSVVDV 
DTAVELTKLG CLGVINMEGI QTRYENPDEI LNQIASVGKN DFVPLMQKIY SEPVKEELIL
QRINEVKEGG GIAAFSGTPQ AAIRFKEILN NSKIDLFFLQ GTVVSTEHLG MEGKETLNIK
DLCQSMNAPV VAGNCVTYEV AKLLMDSGVA GLMVGIGPGA ACTSRGVLGI GIPQATAIAD
CSAARNDYFK ETGRYVPIIG DGGIVTGGDI CKCLACGSDA VMIGSPVAKS SNAPGKGFHW
GMATPSPVLP RGTRIEVGST GSLERIIKGP ALLDDGTHNL LGAIRTSMST LGAKNIKEMQ
KVEIVIAPSL LTEGKVYQKA QQLGMGK