Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1918 |
Symbol | |
ID | 3784156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2207479 |
End bp | 2208540 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637812004 |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_412605 |
Protein GI | 82703039 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0473] Isocitrate/isopropylmalate dehydrogenase |
TIGRFAM ID | [TIGR00169] 3-isopropylmalate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAG CAATTCTGGC CGGAGATGGT ATCGGCCCGG AAATTGTCGC GCAAGCGGTG CGCGTGCTGG AAACGCTGAG AAGCGATGGA TTGAAGCTGG AACTGGAACA GGGATTGCTG GGCGGATGTG CTGTAGATGC AGCGGGCGAG CCTTTTCCCG CGGCAACGCG CACATTGGTG GCTCAGGCGG ACGCCGTGAT CCTGGGGGCA GTGGGCGGCC CGCGATATGA CGGGCTGCCC CGGCAGCTCA GGCCGGAGCA AGGCCTTCTT GGCATACGGA AGGCCTTGAA CCTGTTTGCC AATCTCCGGC CTGCGGTACT TTATCCTGAA CTTGCCGATG CCTCCACGCT GAAGCCCGAG GTGGTGTCCG GACTCGATAT CCTGATCGTG CGCGAGTTGA CCGGGGATAT TTACTTTGGT GAGCCACGCG GGATTGAATT ACGGAATGGT CAGCGCATCG GCTACAATAC CATGATTTAC AGCGAAGCCG AGATCCGGAG AATAGCGCGG GTGGCTTTCC AGGCAGCGCG CAAGCGCAGT CGCAGGCTGT GCTCTGTCGA CAAGATGAAC GTACTGGAAT CAACCCAGCT GTGGCGCGAC GTGGTGACCG AAACGGCAGG TGAATATCCG GACGTGGAGC TTTCGCACAT GCTGGTGGAC AATGCGGCCA TGCAGCTTGT ACGCAATCCC CGGCAGTTCG ATGTGGTTGT GACAGGCAAT ATGTTCGGGG ACATCCTGTC GGATGAAGCA TCCATGTTGA CCGGTTCGAT CGGCATGCTG CCTTCGGCAT CGCTCGATGA GCGGAACAAG GGGCTTTATG AACCCATACA CGGTTCTGCT CCCGATATCG CCGGCAAGGA CGTGGCGAAT CCTCTGGCCA CCGTCCTTTC AGTTGCGATG ATGCTGCGCT ATACCTTCGA TCGGGAGGAG GAGGCATCCC GAATCGAACG GGCAGTGAAA AAGGTGCTGG CTGATGGATA CCGGACGGCG GATATTTACG AGCCAGGAAA GATGAAAATC GGAACCGCAG CAATGGGTGA TGCGGTTCTG GCAAGTTTGT AG
|
Protein sequence | MKIAILAGDG IGPEIVAQAV RVLETLRSDG LKLELEQGLL GGCAVDAAGE PFPAATRTLV AQADAVILGA VGGPRYDGLP RQLRPEQGLL GIRKALNLFA NLRPAVLYPE LADASTLKPE VVSGLDILIV RELTGDIYFG EPRGIELRNG QRIGYNTMIY SEAEIRRIAR VAFQAARKRS RRLCSVDKMN VLESTQLWRD VVTETAGEYP DVELSHMLVD NAAMQLVRNP RQFDVVVTGN MFGDILSDEA SMLTGSIGML PSASLDERNK GLYEPIHGSA PDIAGKDVAN PLATVLSVAM MLRYTFDREE EASRIERAVK KVLADGYRTA DIYEPGKMKI GTAAMGDAVL ASL
|
| |