Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1921 |
Symbol | |
ID | 3784159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2209401 |
End bp | 2210810 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637812007 |
Product | isopropylmalate isomerase large subunit |
Protein accession | YP_412608 |
Protein GI | 82703042 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.724454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACGT TATACGACAA ACTCTGGCAA AGCCATGTGG TGCATGAAGA ATCCGATGAT GCGGAAGGCA TGGCGCTGCT ATATATCGAT CGTCACCTCG TGCACGAGGT GACCAGCCCC CAAGCGTTCG AAGGGTTGAA GCTGGCCGGA CGCAAGCCGT GGCGCCTCAA TTCCATTCTG GCGGTAGCAG ACCACAACGT GCCGACCACC GGGCGAGATC ATGGCATCAG TGATCCGGTA TCTCGCCTGC AAGTGGAAAC GCTCGATCAG AATTGTGAGG AATTGGGCAT CACCGAATTC AGAATGAACG ATCAGCGCCA AGGCATCGTG CATGTCATCG GACCGGAACA GGGTGCCACC CTGCCAGGAA TGACAGTGGT TTGCGGCGAC TCGCATACAA GCACGCATGG CGCTTTTGGC TGCCTTGCCT TTGGCATAGG TACTTCCGAG GTGGAGCATG TGCTGGCGAC GCAGTGCTTG CTGGCGAAGA AATCCAGAAC GATGCAGATC GTGGTAGATG GCAACCTGGG CAATGGCATT ACCGCCAAGG ATGTGGCGCT TGCCGTGGTC GGCAGGATCG GTACTGCCGG AGGTACAGGC TATGCGATCG AGTTTGCCGG CAGCGCGATT CGTGGGCTGT CGATGGAAGG GCGCATGACG CTCTGCAATA TGGCGATCGA GGCGGGTGCG CGTGCGGGAA TGGTGGCAGT GGATGATGTC ACCATTGAAT ATCTCCGGGG CCGTCCCTTT GCGCCTAAAG GCGATCTCTG GGAAAAAGCG GTGGCCTACT GGCGTACCCT GAAGAGTGAC GAGGGTGCGA GCTTTGACAA GGGGGTTCAA CTGGACGCCG CCTCGATCAA GCCGCAGGTA ACCTGGGGAA CTTCCCCCGA AATGGTTGCG ACAGTGGATG GAAAGGTGCC TGATCCGACA GAAATCGCGG ATGCGGTGAA GCGTCACGAC ATGGAGCGGG CGCTCAAATA CATGGCGCTG GCTCCCAACA CCCCGATCAG TGAAATTCGC CCCGATAAAA TATTCATCGG TTCCTGCACG AATGCGCGCA TAGAGGATTT GCGCGCCGCC GCCGAGGTGG TGAGGGGCCG TCGCATTGCG AAGAGCATCA AGCTTGCAAT GGTCGTACCA GGATCGGGCC TGGTCAAGCA TCAGGCAGAG CAGGAGGGGC TGGACAGGAT ATTCCGCGAT GCGGGCTTTG AATGGCGCGA ACCGGGTTGC TCGATGTGCC TGGCGATGAA TGACGACAGG CTGGAGCCCG GCGAACGGTG CGCTTCCACT TCCAATCGCA ACTTTGAGGG CAGGCAGGGT CCCGCGGGGC GTACTCACCT GGTGAGCCCT GCGATGGCCG CCGCTGCCGC AGTCGCGGGA CATTTTGTGG ATGTAAGGGA AATATATTAG
|
Protein sequence | MQTLYDKLWQ SHVVHEESDD AEGMALLYID RHLVHEVTSP QAFEGLKLAG RKPWRLNSIL AVADHNVPTT GRDHGISDPV SRLQVETLDQ NCEELGITEF RMNDQRQGIV HVIGPEQGAT LPGMTVVCGD SHTSTHGAFG CLAFGIGTSE VEHVLATQCL LAKKSRTMQI VVDGNLGNGI TAKDVALAVV GRIGTAGGTG YAIEFAGSAI RGLSMEGRMT LCNMAIEAGA RAGMVAVDDV TIEYLRGRPF APKGDLWEKA VAYWRTLKSD EGASFDKGVQ LDAASIKPQV TWGTSPEMVA TVDGKVPDPT EIADAVKRHD MERALKYMAL APNTPISEIR PDKIFIGSCT NARIEDLRAA AEVVRGRRIA KSIKLAMVVP GSGLVKHQAE QEGLDRIFRD AGFEWREPGC SMCLAMNDDR LEPGERCAST SNRNFEGRQG PAGRTHLVSP AMAAAAAVAG HFVDVREIY
|
| |