Gene Nmul_A1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1921 
Symbol 
ID3784159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2209401 
End bp2210810 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content59% 
IMG OID637812007 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_412608 
Protein GI82703042 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.724454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACGT TATACGACAA ACTCTGGCAA AGCCATGTGG TGCATGAAGA ATCCGATGAT 
GCGGAAGGCA TGGCGCTGCT ATATATCGAT CGTCACCTCG TGCACGAGGT GACCAGCCCC
CAAGCGTTCG AAGGGTTGAA GCTGGCCGGA CGCAAGCCGT GGCGCCTCAA TTCCATTCTG
GCGGTAGCAG ACCACAACGT GCCGACCACC GGGCGAGATC ATGGCATCAG TGATCCGGTA
TCTCGCCTGC AAGTGGAAAC GCTCGATCAG AATTGTGAGG AATTGGGCAT CACCGAATTC
AGAATGAACG ATCAGCGCCA AGGCATCGTG CATGTCATCG GACCGGAACA GGGTGCCACC
CTGCCAGGAA TGACAGTGGT TTGCGGCGAC TCGCATACAA GCACGCATGG CGCTTTTGGC
TGCCTTGCCT TTGGCATAGG TACTTCCGAG GTGGAGCATG TGCTGGCGAC GCAGTGCTTG
CTGGCGAAGA AATCCAGAAC GATGCAGATC GTGGTAGATG GCAACCTGGG CAATGGCATT
ACCGCCAAGG ATGTGGCGCT TGCCGTGGTC GGCAGGATCG GTACTGCCGG AGGTACAGGC
TATGCGATCG AGTTTGCCGG CAGCGCGATT CGTGGGCTGT CGATGGAAGG GCGCATGACG
CTCTGCAATA TGGCGATCGA GGCGGGTGCG CGTGCGGGAA TGGTGGCAGT GGATGATGTC
ACCATTGAAT ATCTCCGGGG CCGTCCCTTT GCGCCTAAAG GCGATCTCTG GGAAAAAGCG
GTGGCCTACT GGCGTACCCT GAAGAGTGAC GAGGGTGCGA GCTTTGACAA GGGGGTTCAA
CTGGACGCCG CCTCGATCAA GCCGCAGGTA ACCTGGGGAA CTTCCCCCGA AATGGTTGCG
ACAGTGGATG GAAAGGTGCC TGATCCGACA GAAATCGCGG ATGCGGTGAA GCGTCACGAC
ATGGAGCGGG CGCTCAAATA CATGGCGCTG GCTCCCAACA CCCCGATCAG TGAAATTCGC
CCCGATAAAA TATTCATCGG TTCCTGCACG AATGCGCGCA TAGAGGATTT GCGCGCCGCC
GCCGAGGTGG TGAGGGGCCG TCGCATTGCG AAGAGCATCA AGCTTGCAAT GGTCGTACCA
GGATCGGGCC TGGTCAAGCA TCAGGCAGAG CAGGAGGGGC TGGACAGGAT ATTCCGCGAT
GCGGGCTTTG AATGGCGCGA ACCGGGTTGC TCGATGTGCC TGGCGATGAA TGACGACAGG
CTGGAGCCCG GCGAACGGTG CGCTTCCACT TCCAATCGCA ACTTTGAGGG CAGGCAGGGT
CCCGCGGGGC GTACTCACCT GGTGAGCCCT GCGATGGCCG CCGCTGCCGC AGTCGCGGGA
CATTTTGTGG ATGTAAGGGA AATATATTAG
 
Protein sequence
MQTLYDKLWQ SHVVHEESDD AEGMALLYID RHLVHEVTSP QAFEGLKLAG RKPWRLNSIL 
AVADHNVPTT GRDHGISDPV SRLQVETLDQ NCEELGITEF RMNDQRQGIV HVIGPEQGAT
LPGMTVVCGD SHTSTHGAFG CLAFGIGTSE VEHVLATQCL LAKKSRTMQI VVDGNLGNGI
TAKDVALAVV GRIGTAGGTG YAIEFAGSAI RGLSMEGRMT LCNMAIEAGA RAGMVAVDDV
TIEYLRGRPF APKGDLWEKA VAYWRTLKSD EGASFDKGVQ LDAASIKPQV TWGTSPEMVA
TVDGKVPDPT EIADAVKRHD MERALKYMAL APNTPISEIR PDKIFIGSCT NARIEDLRAA
AEVVRGRRIA KSIKLAMVVP GSGLVKHQAE QEGLDRIFRD AGFEWREPGC SMCLAMNDDR
LEPGERCAST SNRNFEGRQG PAGRTHLVSP AMAAAAAVAG HFVDVREIY