Gene Mlg_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1121 
Symbol 
ID4269845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1312090 
End bp1314324 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content64% 
IMG OID638125872 
Productisocitrate dehydrogenase, NADP-dependent 
Protein accessionYP_741962 
Protein GI114320279 
COG category[C] Energy production and conversion 
COG ID[COG2838] Monomeric isocitrate dehydrogenase 
TIGRFAM ID[TIGR00178] isocitrate dehydrogenase, NADP-dependent, monomeric type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.833434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CCGCGACGAA CACCATCTAC TACACCCTCA CGGACGAGGC CCCTGCGTTG 
GCCACCCGCT CTTTACTCCC CGTGGTCCAG ACCTTCACTG GCGCCGCCGG CATCGATGTC
AAACTCTCTG ATATTTCCCT GGCCGGACGC ATCCTCTCCG CCTTCCCGGA ACGGTTGAGC
GACGATCAGA AGGTGGAGGA CGGCCTGGCC TTCCTCGGTG ACCTGACGCA GGACCCTAAC
GCCAACATCA TCAAACTCCC GAACATCAGC GCCTCCATCC CCCAGCTCAA CGCCTGCATC
CGCGAACTCC AGTCCCAGGG CTACGACGTT CCCAGCTACC CCGCCGAGCC CAAGAACGAC
GAGGAACAGG CGATCCACGA CCGCTACGCC AAGGTGCTGG GCAGCGCGGT CAACCCGGTG
CTGCGCGAGG GCAACTCCGA CCGACGCGCC CCCCGGGCGG TGAAGAACTT CGTGCGCAAG
CACCCGCACT CCATGGGCAA GTGGAGCAAG GCCTCCCGCA CCCACGCCGA CTACATGCGC
GGCGGCGACT TCTACTCCGC GGAGCAGTCC GTCACCATGC CCCAGGCCGG CACGGTGCGC
ATCGAGTTCG TCGACAAGGA CGGCAACGTC ACGCTGAAGA AGAAGCTGGA GCTCGAGACC
GGCGAGATCA TCGACAGCAT GCGCATGAGC GTGAACGCCC TGCGCGACTT CCTGGAAGAG
ACCATGGAAG ACGCCAAGGA CTCCCAGGTC ATGTGGTCGC TGCACGTGAA GGCCACCATG
ATGAAGGTCT CGCACCCCAT CGTGTTCGGT CACGCGGTGA AGGTCTACTA CAAAGAGGTC
TTCGAGAAGT GGGGCGACCT GTTCAAAGAA CTGGGCGTGA ACCCCAACGA CGGCCTCAGC
AGCGTCTACG AGAAGATCGA AACCCTGCCG CGCTCCCAGC AGGAGGAGAT CCACCGCGAC
ATCCTGGCCG TCTACGAGCA CCGCCCGGAA ATGGCCATGG TGGATTCCTA CAAGGGTATC
ACCAACCTGC ACATGCCCAG TGACGTGATC GTGGACGCCT CCATGCCGGC CATGATCCGC
AACGGCGGCA AGATGTGGGG GCCCGACGGC AAGCCGAAGG ACTGCAAGGC CGTCATGCCG
GAGAGCACCT ACTCCAAGAT CTACCAGGAG ATGATCAACT TCTGTAAGAC CAACGGCGCC
TTCGACCCCA CCACCATGGG CACGGTGCCC AACGTGGGCC TGATGGCGAA GAAGGCCGAG
GAGTACGGCT CCCACGACAA GACCTTCGAG CTCGAGGCCG ACGGCATCAT GCGCATCGTC
GACCACAAGG GCCATGTGCT GATGCAGCAC GAGGTGGAGA AGGGCGATAT CTGGCGCGCC
TGCCAGACCA AGGACATCGC CGTGCGCGAC TGGGTCAAGC TGGCCGTCCG CCGCGCCCGC
GAGGCCGACA CCCCGGCCAT CTTCTGGCTG GACCGCAACC GCCCCCACGA CATCGAGCTG
ATCAAGAAGG TCAACTGCTA CCTGCAAGAG CACGACCTGA GCGGCCTGGA TATCCGCATC
CAGACCTACG AGGAGGCCAT CCGCCGCTCC ATGGAGCGTG CGCTGCGCGG CCGTGACACC
ATCTCGGTCA CCGGCAACGT CCTGCGCGAC TACCTGACCG ACCTGTTCCC CATCCTGGAA
CTGGGCACCT CGGCCAAGAT GCTCTCCATT GTGCCCCTGC TCAAGGGCGG CGGCATGTAC
GAGACCGGCG CCGGCGGCTC CGCGCCCAAG CACGTCCAGC AGCTGCAGGA GGAAAACCAC
CTGCGCTGGG ATTCGCTGGG CGAGTTCCTG GCCATTGCGG TGTCGCTGGA CGAACTGGGC
ATGAAGCAGG ACAACGCCCG TGCCCGCGTG CTGGCGCAGT GCCTGGACCA GGCCACCGAG
GAAGCGCTGG AGAACGAGAA GTCGCCGTCG CGCAAGACCG GCGAGCTGGA CAACCGGGGC
AGTCACTACT GGCTGGCCCT CTACTGGGCC GAAGCGGTTG CCGCTCAGAC CGAGGACAAG
GAGCTGGCCG ATCACTTCGG GCCGGTGGCC AAGCAGCTGA AAGAGAAGAA GGAGCAGATC
CTCGAAGAGC TGAGCGTGGT CCAGGGCAGC CCGGCGGACC TGGACGGCTA CTACCACCCG
TCCCCCGAGG TGGCCGACAA GGTCATGCGG CCCAGCCCGA CGCTGAACGG CATCCTGGCG
GACGCCATGA AATAA
 
Protein sequence
MTDTATNTIY YTLTDEAPAL ATRSLLPVVQ TFTGAAGIDV KLSDISLAGR ILSAFPERLS 
DDQKVEDGLA FLGDLTQDPN ANIIKLPNIS ASIPQLNACI RELQSQGYDV PSYPAEPKND
EEQAIHDRYA KVLGSAVNPV LREGNSDRRA PRAVKNFVRK HPHSMGKWSK ASRTHADYMR
GGDFYSAEQS VTMPQAGTVR IEFVDKDGNV TLKKKLELET GEIIDSMRMS VNALRDFLEE
TMEDAKDSQV MWSLHVKATM MKVSHPIVFG HAVKVYYKEV FEKWGDLFKE LGVNPNDGLS
SVYEKIETLP RSQQEEIHRD ILAVYEHRPE MAMVDSYKGI TNLHMPSDVI VDASMPAMIR
NGGKMWGPDG KPKDCKAVMP ESTYSKIYQE MINFCKTNGA FDPTTMGTVP NVGLMAKKAE
EYGSHDKTFE LEADGIMRIV DHKGHVLMQH EVEKGDIWRA CQTKDIAVRD WVKLAVRRAR
EADTPAIFWL DRNRPHDIEL IKKVNCYLQE HDLSGLDIRI QTYEEAIRRS MERALRGRDT
ISVTGNVLRD YLTDLFPILE LGTSAKMLSI VPLLKGGGMY ETGAGGSAPK HVQQLQEENH
LRWDSLGEFL AIAVSLDELG MKQDNARARV LAQCLDQATE EALENEKSPS RKTGELDNRG
SHYWLALYWA EAVAAQTEDK ELADHFGPVA KQLKEKKEQI LEELSVVQGS PADLDGYYHP
SPEVADKVMR PSPTLNGILA DAMK