Gene Nmul_A0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0474 
Symbol 
ID3784891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp529376 
End bp531133 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content56% 
IMG OID637810550 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionYP_411174 
Protein GI82701608 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAATC CAACATATAT TGCCGATATT CAACATCTTG AAACGCAGGA AAGCATGAGC 
GCGGATTTGA CGGGCGCCGA AATTACGGTG CGTTGCTTAC AAGAGGAAGG GGTAGAACAT
ATTTTCGGCT ATCCCGGGGG CGCTGTGCTG TTCCTGTATG ACGAATTGTT CAAGCAGGAT
AAGGTCAGGC ATATCCTGGT GCGGCACGAA CAGGCCGCGG TACATGCCGC GGACGGCTAT
GCCCGCTCCA GCAACAAGGT AGGGGTGGCA CTGGTCACTT CCGGGCCGGG TGTGACCAAC
GCCGTGACCG GCATCGCCAC TGCCTATATG GATTCGATTC CCCTCGTCAT CATCAGCGGG
CAGGTCCCCA CCCATGCCAT CGGCTTGGAC GCGTTCCAGG AAGTCGATAC CGTAGGCATC
ACGCGTCCCT GCGTGAAGCA TAATTTTCTG GTGAAGGATA TCGCAGAGCT TGCCGTCACC
ATCAAGAAAG CGTTCTATAT CGCTTCGACA GGGCGTCCCG GTCCGGTGCT GGTCGACATC
CCGAAGGACG TGAGTCAGCA GAAAACAAAA TTCGTTTATC CCGAGCGTGT CACAATGCGC
TCCTATAATC CGAACATCCG GGGGCACGCC GGCCAGATCA AAAAAGCCGT TCAACTGATC
CTGGAAGCCA GGCGTCCGAT GATCTACACC GGCGGGGGAG TCATCCTCAG CGACGCCGCT
ACCCGGTTGA CGGAACTCGT CCGCCTGCTG CGCTTTCCCT GCACCAACAC GTTGATGGGC
CTGGGGGGTT ATCCGGCAAC GGACCCCCAG TTTGTGGGCA TGCTGGGCAT GCATGGCACT
TACGAGGCCA ACATGGCGAT GCAGCATTGC GACGTGCTGG TGGCTGTAGG TGCCCGTTTC
GATGATCGTG TCATCGGCAA TCCGAAGCAT TTCTATAATC CGGACCGGAA GATCATTCAC
ATCGATATCG ATCCTTCCTC CATTTCGAAG CGTGTCAAGG TGGACGTTCC TATCGTCGGC
AATGTCCCCG ATGTGCTGGA TGAACTCATA AAACTGCTTG AACTGCGCAA GGAAAAACCC
GATCAGACCG CTCTTGACGC CTGGTGGAGC CAGATCGATT CATGGCGGGA GCGCGACTGT
CTCAGGTATG ACCGCACAAG CGCGATCATC AAGCCGCAGA GGGTGGTGGA AACGCTCTAC
AAGGTAACGA ATGGCGATGC CTTCATTACC TCGGATGTCG GTCAGCACCA GATGTGGGCA
GCGCAATTTT ACAAATTCGA CCTGCCGCGG CGATGGATCA ATTCCGGAGG GCTTGGAACG
ATGGGTTTCG GCCTGCCTTC GGCGATGGGT GTCCAGATGG CTAATCCAGG TGCGAACGTG
GCCTGCATTA CCGGCGAAGC CAGCATCCAG ATGTGCATCC AGGAGCTGTC GACCTGCAAG
CAATATCACC TTCCGCTCAA GATCATCAAC CTGAATAACC GCTATATGGG AATGGTGCGG
CAGTGGCAGG AATTCTTCCA TGGCAACCGC TATGCGGAGT CCTACATGGA CGCGCTGCCC
GATTTCGTGA AGCTGGCGGA GAGTTATGGC CATGTCGGCA TGCGGATCGA ACAGCCCGGC
GATGTCGAAG CCGCCTTGCA GGAAGCATTC AAGCTCAAGG ACAGGCTCGT GTTCATGGAT
TTCGTCACCG ACCAGACCGA AAACGTATTC CCGATGGTGC CGGGCGGCAA GGGTCTGTCT
GAAATGATTC TGGTATAA
 
Protein sequence
MFNPTYIADI QHLETQESMS ADLTGAEITV RCLQEEGVEH IFGYPGGAVL FLYDELFKQD 
KVRHILVRHE QAAVHAADGY ARSSNKVGVA LVTSGPGVTN AVTGIATAYM DSIPLVIISG
QVPTHAIGLD AFQEVDTVGI TRPCVKHNFL VKDIAELAVT IKKAFYIAST GRPGPVLVDI
PKDVSQQKTK FVYPERVTMR SYNPNIRGHA GQIKKAVQLI LEARRPMIYT GGGVILSDAA
TRLTELVRLL RFPCTNTLMG LGGYPATDPQ FVGMLGMHGT YEANMAMQHC DVLVAVGARF
DDRVIGNPKH FYNPDRKIIH IDIDPSSISK RVKVDVPIVG NVPDVLDELI KLLELRKEKP
DQTALDAWWS QIDSWRERDC LRYDRTSAII KPQRVVETLY KVTNGDAFIT SDVGQHQMWA
AQFYKFDLPR RWINSGGLGT MGFGLPSAMG VQMANPGANV ACITGEASIQ MCIQELSTCK
QYHLPLKIIN LNNRYMGMVR QWQEFFHGNR YAESYMDALP DFVKLAESYG HVGMRIEQPG
DVEAALQEAF KLKDRLVFMD FVTDQTENVF PMVPGGKGLS EMILV