Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1795 |
Symbol | |
ID | 3786346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2046334 |
End bp | 2048286 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637811881 |
Product | phosphoheptose isomerase |
Protein accession | YP_412484 |
Protein GI | 82702918 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0279] Phosphoheptose isomerase [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR00441] phosphoheptose isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGA GAATTGCCCT CATCAGTGAA CACGCTTCAC CCATCGCGGC AATAGGAGGT ACGGACACCG GCGGACAAAA TATAGCGGTG GCCGAGCTGG CCCGGCATCT TGCCGCCCTC GGCTACGAAA TTGATGTCTT TACCCGCTGG GATGACCGCC GTGTTCCAAA AATCCTCAAC TGGCGGGATG GCATACGCAT CGTCCATGTG GAAGCCGGGC CCGTCACGTT CATTCCCAAG GAAAAGCTGC TGCCTTATAT GCCCGCCTTC ACGCGCGACA TCCTGCGGTT TATCAAGTCG GAAAACAATC GCTACAAGCT CGTTCACGCC CATTTCTTCA TGTCCGGGCT GGTGGCGGCG GATATCAAGC GAAAACTGGG TATTCCTTTC ATCGTTACCT TTCACGCTCT CGCAAAAGTG CGGAGGCTTC ACCAGGGAGG GAATGACTGG TTCCCGGACG AGGGCTTTGC CATCGAAGAA AGGGTGATAA CAGAAGCGGA CCAGATTGTC GCCTTGTGCC CGCAGGACCG CGATGATCTG ATCAATCTTT ATGAAGCTGA TCCCGGAAAA ATCACGGTTA TTCCAAACGG ATTCAGACCG GATGAGATCT ATCCTCTCGA CAAGCTGTTC GCGCGCATGG CGCTGAAACT CGATCCCAAG GAAAAGATTA TCCTGCAACT GGGGCGCATG GTGCGGCGAA AGGGTGTCGA TAACGTCATA AAAGCGCTGG GCTACATGCG GCGCGAGCAT AACTTCGAAG CACGTCTTCT GATAGTGGGC GGGGAGTCGG ATGAGCCCGA TCCAAAAACA ACGCCTGAAA TCGGTCGCCT GCAAAAACTG GCTGAAGCAG AGGGTGCGGG CGATCTTGTG ACGTTTGTCG GACGCCGCCC GCGCGACATG CTGCATTATT ACTATAGCGC GTGCGACGTA TTCACGACTA CGCCCTGGTA TGAACCGTTC GGGATCACCC CGCTTGAAGC AATGGCCTGC GGGACGCCCG TGATCGGGTC AAATGTTGGG GGCATTAAAT CCACCGTCAT GGATGGCAGG ACGGGCTTTC TCGTGCCGCC CAACGATCCC GCGTCACTCG GGCGCCGCAT CATAGAGCTT TTGAGCAGCA ACAAGCTCAT GACGTATTTC AAGGAAAACG CCATCCGCCA TGTCAATCAG AATTACACCT GGATGAAGGC AACGCATCTC ACGGCCAACA TGTACGAGCG GATTGCAACC CAGAGCCCCC TGCGAGCGGA CGAAGAAGAA GATTCCTTGT CCTACATCGA CGACTCCTTC GGGTCATTAA TAGAGACTAT TGAAAAGTCC AGGCGGAAAA TCCGCCTTGC CATCCTCGAT TCGGCCCAGG CTGTATACCG CTCGTTGGCG CGCGGCGGAA AGGTATTGGT TTGCGGCAAT GGCGGGAGTG CCGCCGAAGC GCAGCACTTT GCGGCCGAAC TCATGGGGCG GTTTGAGGCA AGTGGTCGTC GCGGCTTGCC CGCAATGGCA CTCACTGCCG ATACCGCCTT TGTGACCGCC TGGTCGAATG ACTATACATT TGACGATGTG TTTGCCAGAC AGGTCGAAGC GCATGGGCAG CCGGGGGACG TTCTGGTTGT CATCAGCTCG AGCGGGCAGT CAGTCAACTT GGTCAAGGCG CTCCGGACGG CGCGCCGGCA CGAGATGTTC TGCATTGGCC TGCTTGGCAA GGAGGGTGGT CCTGCCAGCG AACTGACTGA TATCAACATC ATTGTTCCGT CAAACGAAAC TTCACGCATC CAGGAAGTAC AACTGCTTGT TCTCCACGTG CTCAGTCATC TGATCGAGCA GCAAATCGTG GTGGATGACC TGAATACCGT TCAGATAACG GAAGAGTGGT CAATAAAGCA CTTTCAGGTC CAGGAAATGG CTAAAAACGT TAATAAGAGG AAAATCAAGC ATGAATCAAC AAAGTGTGAC TGA
|
Protein sequence | MKKRIALISE HASPIAAIGG TDTGGQNIAV AELARHLAAL GYEIDVFTRW DDRRVPKILN WRDGIRIVHV EAGPVTFIPK EKLLPYMPAF TRDILRFIKS ENNRYKLVHA HFFMSGLVAA DIKRKLGIPF IVTFHALAKV RRLHQGGNDW FPDEGFAIEE RVITEADQIV ALCPQDRDDL INLYEADPGK ITVIPNGFRP DEIYPLDKLF ARMALKLDPK EKIILQLGRM VRRKGVDNVI KALGYMRREH NFEARLLIVG GESDEPDPKT TPEIGRLQKL AEAEGAGDLV TFVGRRPRDM LHYYYSACDV FTTTPWYEPF GITPLEAMAC GTPVIGSNVG GIKSTVMDGR TGFLVPPNDP ASLGRRIIEL LSSNKLMTYF KENAIRHVNQ NYTWMKATHL TANMYERIAT QSPLRADEEE DSLSYIDDSF GSLIETIEKS RRKIRLAILD SAQAVYRSLA RGGKVLVCGN GGSAAEAQHF AAELMGRFEA SGRRGLPAMA LTADTAFVTA WSNDYTFDDV FARQVEAHGQ PGDVLVVISS SGQSVNLVKA LRTARRHEMF CIGLLGKEGG PASELTDINI IVPSNETSRI QEVQLLVLHV LSHLIEQQIV VDDLNTVQIT EEWSIKHFQV QEMAKNVNKR KIKHESTKCD
|
| |