Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1007 |
Symbol | |
ID | 3785838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1167307 |
End bp | 1168284 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637811091 |
Product | hypothetical protein |
Protein accession | YP_411702 |
Protein GI | 82702136 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase [COG1051] ADP-ribose pyrophosphatase |
TIGRFAM ID | [TIGR00586] mutator mutT protein [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCCCG CTCCCTCCAT CGTTGAAGTT GTTGCCGCCA TCATCATCGG GAGCGACGGT TCCTTTCTTT TGGCCCGTCG CCCGGAAGGC AAACCCTATG CCGGCTACTG GGAATTTCCG GGTGGCAAGG TCAATCCGGA AGAATCCCTG TTGCGCGCAC TCAAGCGCGA ACTGCTGGAA GAATTGGGCA TACATGTCAA GCATGCGTAC CCCTGGATAA CCCGCACATT CACCTATCCT CATGCCAGGG TACGGCTTCA CTTTTATCGG GTCGTGGAGT GGCATGGCGA GCCTCATCCG CATGAAGACC AGGAGCTGTC GTGGCAGTTT GCGGACAATG TCTCTGTAGA GCCGTTGCTC CCTGCCAACG CACCCGTATT GCGGGCACTT GCCTTGCCCC CCGTGTATGG GATTACGAAT GCCGCGGAGT GGGGGCCACA AATTGCAGCA GCCCGAATCG GGCACGCTCT CCAGAAAGGA CTACGGCTGG TGCAGCTACG CGAAAAAGGA ATGCGGAGTA AAGCGCTGGA TGCCTTTGCC CGTGAAGTAA CCGCACTGGC CCATCATTAC GGCGCCCGCA TTCTCGTAAA CAGCGGCACG GGTAACGAGA GCCTCTGCCA GGAATTGGAT ATGGATGGAA TCCACTTTAC ATCGGCTGAT CTGATGAATC TTTCGAAACG ACCCGATGTA GAATGGTGCG CGGCCTCCTG TCACAACGCC GAAGAGCTTT TCCGGGCCGA GCAACTGGAA ATGGATTTCG CCGTTCTGGC TCCCGTACTT CCGACATTGA GCCATCCCGA TTCCCCCGTG CTGGGCTGGC GAAAGCTTGC CAGAATCATC CACGGCAGCG CCATCCCTGT CTATGCGCTG GGAGGGCTGC AAAGCGAAGA TCTTGCCATT GCCTGGGAAC ACGGTGCCCA CGGCATCGCC CTGATGCGCC GTATTGAGCA GGTGCGCGGC ACCGGGCAGA AGGCCTGA
|
Protein sequence | MRPAPSIVEV VAAIIIGSDG SFLLARRPEG KPYAGYWEFP GGKVNPEESL LRALKRELLE ELGIHVKHAY PWITRTFTYP HARVRLHFYR VVEWHGEPHP HEDQELSWQF ADNVSVEPLL PANAPVLRAL ALPPVYGITN AAEWGPQIAA ARIGHALQKG LRLVQLREKG MRSKALDAFA REVTALAHHY GARILVNSGT GNESLCQELD MDGIHFTSAD LMNLSKRPDV EWCAASCHNA EELFRAEQLE MDFAVLAPVL PTLSHPDSPV LGWRKLARII HGSAIPVYAL GGLQSEDLAI AWEHGAHGIA LMRRIEQVRG TGQKA
|
| |