Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1467 |
Symbol | |
ID | 3785558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1676221 |
End bp | 1677987 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637811555 |
Product | thiamine pyrophosphate protein |
Protein accession | YP_412162 |
Protein GI | 82702596 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA CGGTCAGCGA TTTCCTGGTT CGGCGTTTGT CCGAATGGGG GGTAAAACGT ATCTTCGGCC TTCCTGGCGA CGGCATCAAC GGCATTATGG GGGCGATCAA TCGAGTTTCA GATAAGCTCG AATTTGTCCA GATCAGGCAT GAAGAGATGG CGGCGTTCAT GGCGTGCGCC CACGCCAAGT TCACGGGGGA AGTTGGTATT TGCCTGGCCA CCTCAGGCCC GGGAGCTATC CACCTGCTCA ATGGTTTATA TGATGCCAAG CTCGATCATC AGCCGGTGGT GGCGATAGTA GGTCAGCAGA AACGTACCGC TCTGGGGGGC AGCTACCAGC AGGAAGTCGA TCTCGTTTCG CTCTTCAAGG ATGTGGCGCA TGAGTATGTT CATATTGTGA CAACGCCAGG GCAAGTGCGC CACGTGCTCG ATCGCGCTAT GCGTATTGCA AAGGCCGAGC ATAACGTATG CTGCGTTATT GTTCCCAATG ACATCCAGGA CATGGAGTTT GTGAAACCCC CTCACGAACA TGGCACGATT TATTCGGGAG CAGGATATCG ATCTCCCCGT GTGGTTCCAG AGATTGAGGA CCTGCAACGG GCCGCGGATG TGCTGAACGA TGGGTCGAAG GTGGCGATTC TGGTTGGCGC GGGGGCACTG AACGCAACAA GTGAAATTCT CCAGGTGGCT GATCTGCTCG GCGCGGGAAT TGCCAAGGCG CTACTGGGAA AAACTGTGGT CCCTGACGAT CTGCCTTATG TGACAGGGGC AATCGGCATG CTGGGCACAA AGCCGAGCTA CAGCATGATG ACCGAGTGCG ACACACTCCT GATGATCGGC TCCAGCTTTC CCTATTCCGA ATTTTTGCCT GAGGAGGGGC AGGCACGCGG CGTCCAGATC GATATCGATG GACGAATGAT GAGCATGCGG TACCCGATGG AGGTAAACCT CGTCGGGAAT AGTGAGGATA CGCTAAAGCT GTTGATACCG TTACTCAAAA GGAAGGAAGA CCGTACGTGG CGAAACCGTA TCGAGAGCAG TGTCGACGAG TGGTGGAAGA AAATCGAGGC AAGGGCGATG GAGCCGGCAA ATCCCATCAA TCCCCAGCGC GTATTCTACG AATTATCGCC ACGGCTTCCG GATAATTGCA TTCTCGCAGG CGATTCTGGT TCTTCAACAT TCTGGTATGC GCGGGATATT CGAATCCGTA AAGGCATGAT GGCTTCGCTT TCCGGCGGTC TTGCCACGAT GGGATCAGCC GTGCCCTATG CAATCGCCGC TAAATTTGCG CATCCTGACA GGGTGGTAAT AGCCGTGACA GGAGATGGCG CGATGCAAAT GAACGGCATG AATGAGCTCA TTACCATCGT CAAATACTGG CGACATTGGA GCGATCCCCG GCTCGTGGTA CTGGTTTTGA ATAATCGCGA TCTGAACCTG GTAACCTGGG AGCAGAGGGC CACTGAGGGT AATCCGAAAT TCGATGCCGC TCAGGATCTT CCCGATGTCC CATACGCAGA TTATGCAAAA TTGATTGGTC TGCACGGTAT ACGCGTCGAC CGTCCAGAAA ATATCGCCAG CGCATGGGAT TGTGCCTTGA CCGCAGATCG ACCGGTGGTG CTCGAGGCAT GTACCGACCC GAACGTACCA CCGTTGCCAC CCCATATCAC TTTCAAACAG GCGAGAGCCT ATGCCTCAGC AATCGTGCAA GGTGATTCAG ACTCGAGAGA AATATTCAGG GAGACAGTAA AGCAGATTTT CGCCTGA
|
Protein sequence | MKETVSDFLV RRLSEWGVKR IFGLPGDGIN GIMGAINRVS DKLEFVQIRH EEMAAFMACA HAKFTGEVGI CLATSGPGAI HLLNGLYDAK LDHQPVVAIV GQQKRTALGG SYQQEVDLVS LFKDVAHEYV HIVTTPGQVR HVLDRAMRIA KAEHNVCCVI VPNDIQDMEF VKPPHEHGTI YSGAGYRSPR VVPEIEDLQR AADVLNDGSK VAILVGAGAL NATSEILQVA DLLGAGIAKA LLGKTVVPDD LPYVTGAIGM LGTKPSYSMM TECDTLLMIG SSFPYSEFLP EEGQARGVQI DIDGRMMSMR YPMEVNLVGN SEDTLKLLIP LLKRKEDRTW RNRIESSVDE WWKKIEARAM EPANPINPQR VFYELSPRLP DNCILAGDSG SSTFWYARDI RIRKGMMASL SGGLATMGSA VPYAIAAKFA HPDRVVIAVT GDGAMQMNGM NELITIVKYW RHWSDPRLVV LVLNNRDLNL VTWEQRATEG NPKFDAAQDL PDVPYADYAK LIGLHGIRVD RPENIASAWD CALTADRPVV LEACTDPNVP PLPPHITFKQ ARAYASAIVQ GDSDSREIFR ETVKQIFA
|
| |