Gene Pnec_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1054 
Symbol 
ID6182952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp917359 
End bp918429 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content48% 
IMG OID641671666 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001797843 
Protein GI171463730 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.347892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG CAGTCCTGCC GGGCGATGGT ATCGGCCCGG AAATCGTTGC TCAAGCCGTT 
CGAGTGCTCC AAGCGCTTGG CCCAGAGTTT GATTTAGAAG AAGCTCCAGT TGGTGGCGCT
GCCTATGATG CGGCAGGCCA TCCTTCGCCG CCGGCTACTT TAGAGTTGGC TAAAAAAGCA
GATGCCATTT TGTTTGGTGC AGTTGGCGAC TGGAAATACG ATACGCTTGC ACGCGAGCTG
CGTCCAGAGC AAGCAATTCT AGGTTTGCGT AAACACCTTG AATTGTTTGC CAACTTCAGA
CCAGCGATTT GCCATCCAGA ACTCACGGCC GCATCGAGCC TCAAGCCAGA AATTATCGGC
GGCTTAGATA TTCTGATTGT GCGCGAGCTC AATGGCGATA TTTACTTTGG TCAACCGCGC
GGTATTCGTA CTTCAGAGTT GCCCTTATTT AAAGGTGCTC GCGAAGGTTT TGACACCATG
CACTATAGCG AGCCAGAAGT AGAGCGTATT GGTCGGGTTG CTTTCGAAGC AGCGCGTAAG
CGCAGTAAAA AAGTATGTAG CGTTGATAAG GCCAACGTAC TAGAGACTTC ACAGCTTTGG
CGTGAGGTGA TGATTCGTAT TGCCAAAGAA TATCCGGATG TTGAGTTATC TCATATGTAT
GTGGATAACG CTGCAATGCA ATTGGTCAAA GCACCTAAAG CATTTGATGT TGTAGTAACC
GGAAATTTAT TCGGTGACAT TCTGTCCGAC GAAGCGGCGA TGTTGACTGG CTCCATTGGT
ATGTTGCCAT CTGCCTCTTT GGATAAAAAT AATAAAGGCT TGTATGAGCC AAGTCACGGC
TCCGCGCCTG ATATTGCTGG TAAAGGTATT GCTAATCCAT TGGCAACGAT TTTGTCTGCT
GCGATGATGT TGCGTTACTC CTTGGGTATG CCTGCTGAAG CAGATCGCAT TGAAAAGGCC
GTGCAAAAAG TATTGGCGCA AGGATTGCGA ACTGCCGATA TTTATACCGA AGGTACGAAA
AAGGTGTCTA CGGTTGAAAT GGGCGATGCT GTAGTTGCGG CGCTGGCTTA A
 
Protein sequence
MKIAVLPGDG IGPEIVAQAV RVLQALGPEF DLEEAPVGGA AYDAAGHPSP PATLELAKKA 
DAILFGAVGD WKYDTLAREL RPEQAILGLR KHLELFANFR PAICHPELTA ASSLKPEIIG
GLDILIVREL NGDIYFGQPR GIRTSELPLF KGAREGFDTM HYSEPEVERI GRVAFEAARK
RSKKVCSVDK ANVLETSQLW REVMIRIAKE YPDVELSHMY VDNAAMQLVK APKAFDVVVT
GNLFGDILSD EAAMLTGSIG MLPSASLDKN NKGLYEPSHG SAPDIAGKGI ANPLATILSA
AMMLRYSLGM PAEADRIEKA VQKVLAQGLR TADIYTEGTK KVSTVEMGDA VVAALA