Gene Nmul_A1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1918 
Symbol 
ID3784156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2207479 
End bp2208540 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content58% 
IMG OID637812004 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_412605 
Protein GI82703039 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAG CAATTCTGGC CGGAGATGGT ATCGGCCCGG AAATTGTCGC GCAAGCGGTG 
CGCGTGCTGG AAACGCTGAG AAGCGATGGA TTGAAGCTGG AACTGGAACA GGGATTGCTG
GGCGGATGTG CTGTAGATGC AGCGGGCGAG CCTTTTCCCG CGGCAACGCG CACATTGGTG
GCTCAGGCGG ACGCCGTGAT CCTGGGGGCA GTGGGCGGCC CGCGATATGA CGGGCTGCCC
CGGCAGCTCA GGCCGGAGCA AGGCCTTCTT GGCATACGGA AGGCCTTGAA CCTGTTTGCC
AATCTCCGGC CTGCGGTACT TTATCCTGAA CTTGCCGATG CCTCCACGCT GAAGCCCGAG
GTGGTGTCCG GACTCGATAT CCTGATCGTG CGCGAGTTGA CCGGGGATAT TTACTTTGGT
GAGCCACGCG GGATTGAATT ACGGAATGGT CAGCGCATCG GCTACAATAC CATGATTTAC
AGCGAAGCCG AGATCCGGAG AATAGCGCGG GTGGCTTTCC AGGCAGCGCG CAAGCGCAGT
CGCAGGCTGT GCTCTGTCGA CAAGATGAAC GTACTGGAAT CAACCCAGCT GTGGCGCGAC
GTGGTGACCG AAACGGCAGG TGAATATCCG GACGTGGAGC TTTCGCACAT GCTGGTGGAC
AATGCGGCCA TGCAGCTTGT ACGCAATCCC CGGCAGTTCG ATGTGGTTGT GACAGGCAAT
ATGTTCGGGG ACATCCTGTC GGATGAAGCA TCCATGTTGA CCGGTTCGAT CGGCATGCTG
CCTTCGGCAT CGCTCGATGA GCGGAACAAG GGGCTTTATG AACCCATACA CGGTTCTGCT
CCCGATATCG CCGGCAAGGA CGTGGCGAAT CCTCTGGCCA CCGTCCTTTC AGTTGCGATG
ATGCTGCGCT ATACCTTCGA TCGGGAGGAG GAGGCATCCC GAATCGAACG GGCAGTGAAA
AAGGTGCTGG CTGATGGATA CCGGACGGCG GATATTTACG AGCCAGGAAA GATGAAAATC
GGAACCGCAG CAATGGGTGA TGCGGTTCTG GCAAGTTTGT AG
 
Protein sequence
MKIAILAGDG IGPEIVAQAV RVLETLRSDG LKLELEQGLL GGCAVDAAGE PFPAATRTLV 
AQADAVILGA VGGPRYDGLP RQLRPEQGLL GIRKALNLFA NLRPAVLYPE LADASTLKPE
VVSGLDILIV RELTGDIYFG EPRGIELRNG QRIGYNTMIY SEAEIRRIAR VAFQAARKRS
RRLCSVDKMN VLESTQLWRD VVTETAGEYP DVELSHMLVD NAAMQLVRNP RQFDVVVTGN
MFGDILSDEA SMLTGSIGML PSASLDERNK GLYEPIHGSA PDIAGKDVAN PLATVLSVAM
MLRYTFDREE EASRIERAVK KVLADGYRTA DIYEPGKMKI GTAAMGDAVL ASL