Gene P9211_09751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_09751 
SymbolleuB 
ID5730573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp867003 
End bp868085 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content38% 
IMG OID641285342 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001550860 
Protein GI159903516 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.945321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00220334 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACAT ACAATGTTGT TTTATTACCA GGCGACGGAA TAGGTCCAGA AATAATGGAT 
GTGGCAACAA AAACAATAGA TTATGTAGCT CAGAAATATA ACTTTGGAAT TGAATATGAA
CAAAAACTAA TAGGTGGTTC AGCTATAGAT AAATATAATG ATCCGTTGCC AGAAGAAACT
TTAAAAGCCT GTAAGAGCAG TGACGCTGTC TTACTTTCAG CAATAGGCAG CCCAAAATAT
GATGCTCTAC CCAGAGAAAA AAGGCCTGAG TCTGGTTTAC TTAATTTAAG ATCAGGTCTA
GGATTATTTG CAAATATTAG GCCAGTAAAA GTCTGGCCAG CACTAATTAG TGAAAGTTCT
CTAAAGCAGG AGATAGTCCA AAATGTTGAT CTTGTTGTGG TAAGAGAACT AACAGGCGGC
ATCTACTTTG GGCAACCAAA AGGAAGATTA CAAAGTGAGA ATGGTGAGAG AGCTTTTAAT
ACTATGACCT ACTCAACAAT GGAAATAGAT AGGATTGCCA GAGTTGCATT TGAACTTGCT
ACCGATAGAA AAAAGAAACT ATGTTCTATA GATAAAGCCA ATGTTTTAGA TGTAAGCCAA
TTATGGAGAG AAAGAGTAAT AGAACTAAGC TACAACTTTC CAAAAGTTGA ATTAAATCAT
TTATATGTAG ACAATGCAGC AATGCAACTT ATAAGACAAC CAGACCAATT TGATGTAATT
CTAACTGGCA ATCTGTTTGG AGATATTATT AGTGATGAAG CAGCTATGTT GACTGGTTCA
ATTGGAATGC TTCCTTCCGC ATCATTACGA CTTGAAGGTC CTGGACTTTT TGAACCAGTT
CATGGATCAG CTCCAGATAT AGCAAATAAG GATATCGCCA ACCCAATGGC CATGGTCCTT
TCAGCAGCAA TGATGTTAAG GGTTGGTCTA AAAGAGAACA ACGCAGCTGA TGACCTAGAG
AATGCGATAG ATAAGGTCCT TAAAGATGGT TATAGAACAT CAGATCTTAT GACTAATGGC
AAAAAGGTGC TTGGATGCAG GGAAATGGGG GAGCAAATTC TTATGAGTCT TGCAGCTGCA
TAA
 
Protein sequence
MKTYNVVLLP GDGIGPEIMD VATKTIDYVA QKYNFGIEYE QKLIGGSAID KYNDPLPEET 
LKACKSSDAV LLSAIGSPKY DALPREKRPE SGLLNLRSGL GLFANIRPVK VWPALISESS
LKQEIVQNVD LVVVRELTGG IYFGQPKGRL QSENGERAFN TMTYSTMEID RIARVAFELA
TDRKKKLCSI DKANVLDVSQ LWRERVIELS YNFPKVELNH LYVDNAAMQL IRQPDQFDVI
LTGNLFGDII SDEAAMLTGS IGMLPSASLR LEGPGLFEPV HGSAPDIANK DIANPMAMVL
SAAMMLRVGL KENNAADDLE NAIDKVLKDG YRTSDLMTNG KKVLGCREMG EQILMSLAAA