Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0474 |
Symbol | |
ID | 3784891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 529376 |
End bp | 531133 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810550 |
Product | acetolactate synthase, large subunit, biosynthetic type |
Protein accession | YP_411174 |
Protein GI | 82701608 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAATC CAACATATAT TGCCGATATT CAACATCTTG AAACGCAGGA AAGCATGAGC GCGGATTTGA CGGGCGCCGA AATTACGGTG CGTTGCTTAC AAGAGGAAGG GGTAGAACAT ATTTTCGGCT ATCCCGGGGG CGCTGTGCTG TTCCTGTATG ACGAATTGTT CAAGCAGGAT AAGGTCAGGC ATATCCTGGT GCGGCACGAA CAGGCCGCGG TACATGCCGC GGACGGCTAT GCCCGCTCCA GCAACAAGGT AGGGGTGGCA CTGGTCACTT CCGGGCCGGG TGTGACCAAC GCCGTGACCG GCATCGCCAC TGCCTATATG GATTCGATTC CCCTCGTCAT CATCAGCGGG CAGGTCCCCA CCCATGCCAT CGGCTTGGAC GCGTTCCAGG AAGTCGATAC CGTAGGCATC ACGCGTCCCT GCGTGAAGCA TAATTTTCTG GTGAAGGATA TCGCAGAGCT TGCCGTCACC ATCAAGAAAG CGTTCTATAT CGCTTCGACA GGGCGTCCCG GTCCGGTGCT GGTCGACATC CCGAAGGACG TGAGTCAGCA GAAAACAAAA TTCGTTTATC CCGAGCGTGT CACAATGCGC TCCTATAATC CGAACATCCG GGGGCACGCC GGCCAGATCA AAAAAGCCGT TCAACTGATC CTGGAAGCCA GGCGTCCGAT GATCTACACC GGCGGGGGAG TCATCCTCAG CGACGCCGCT ACCCGGTTGA CGGAACTCGT CCGCCTGCTG CGCTTTCCCT GCACCAACAC GTTGATGGGC CTGGGGGGTT ATCCGGCAAC GGACCCCCAG TTTGTGGGCA TGCTGGGCAT GCATGGCACT TACGAGGCCA ACATGGCGAT GCAGCATTGC GACGTGCTGG TGGCTGTAGG TGCCCGTTTC GATGATCGTG TCATCGGCAA TCCGAAGCAT TTCTATAATC CGGACCGGAA GATCATTCAC ATCGATATCG ATCCTTCCTC CATTTCGAAG CGTGTCAAGG TGGACGTTCC TATCGTCGGC AATGTCCCCG ATGTGCTGGA TGAACTCATA AAACTGCTTG AACTGCGCAA GGAAAAACCC GATCAGACCG CTCTTGACGC CTGGTGGAGC CAGATCGATT CATGGCGGGA GCGCGACTGT CTCAGGTATG ACCGCACAAG CGCGATCATC AAGCCGCAGA GGGTGGTGGA AACGCTCTAC AAGGTAACGA ATGGCGATGC CTTCATTACC TCGGATGTCG GTCAGCACCA GATGTGGGCA GCGCAATTTT ACAAATTCGA CCTGCCGCGG CGATGGATCA ATTCCGGAGG GCTTGGAACG ATGGGTTTCG GCCTGCCTTC GGCGATGGGT GTCCAGATGG CTAATCCAGG TGCGAACGTG GCCTGCATTA CCGGCGAAGC CAGCATCCAG ATGTGCATCC AGGAGCTGTC GACCTGCAAG CAATATCACC TTCCGCTCAA GATCATCAAC CTGAATAACC GCTATATGGG AATGGTGCGG CAGTGGCAGG AATTCTTCCA TGGCAACCGC TATGCGGAGT CCTACATGGA CGCGCTGCCC GATTTCGTGA AGCTGGCGGA GAGTTATGGC CATGTCGGCA TGCGGATCGA ACAGCCCGGC GATGTCGAAG CCGCCTTGCA GGAAGCATTC AAGCTCAAGG ACAGGCTCGT GTTCATGGAT TTCGTCACCG ACCAGACCGA AAACGTATTC CCGATGGTGC CGGGCGGCAA GGGTCTGTCT GAAATGATTC TGGTATAA
|
Protein sequence | MFNPTYIADI QHLETQESMS ADLTGAEITV RCLQEEGVEH IFGYPGGAVL FLYDELFKQD KVRHILVRHE QAAVHAADGY ARSSNKVGVA LVTSGPGVTN AVTGIATAYM DSIPLVIISG QVPTHAIGLD AFQEVDTVGI TRPCVKHNFL VKDIAELAVT IKKAFYIAST GRPGPVLVDI PKDVSQQKTK FVYPERVTMR SYNPNIRGHA GQIKKAVQLI LEARRPMIYT GGGVILSDAA TRLTELVRLL RFPCTNTLMG LGGYPATDPQ FVGMLGMHGT YEANMAMQHC DVLVAVGARF DDRVIGNPKH FYNPDRKIIH IDIDPSSISK RVKVDVPIVG NVPDVLDELI KLLELRKEKP DQTALDAWWS QIDSWRERDC LRYDRTSAII KPQRVVETLY KVTNGDAFIT SDVGQHQMWA AQFYKFDLPR RWINSGGLGT MGFGLPSAMG VQMANPGANV ACITGEASIQ MCIQELSTCK QYHLPLKIIN LNNRYMGMVR QWQEFFHGNR YAESYMDALP DFVKLAESYG HVGMRIEQPG DVEAALQEAF KLKDRLVFMD FVTDQTENVF PMVPGGKGLS EMILV
|
| |