Gene Nmul_A1795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1795 
Symbol 
ID3786346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2046334 
End bp2048286 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content55% 
IMG OID637811881 
Productphosphoheptose isomerase 
Protein accessionYP_412484 
Protein GI82702918 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0279] Phosphoheptose isomerase
[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR00441] phosphoheptose isomerase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA GAATTGCCCT CATCAGTGAA CACGCTTCAC CCATCGCGGC AATAGGAGGT 
ACGGACACCG GCGGACAAAA TATAGCGGTG GCCGAGCTGG CCCGGCATCT TGCCGCCCTC
GGCTACGAAA TTGATGTCTT TACCCGCTGG GATGACCGCC GTGTTCCAAA AATCCTCAAC
TGGCGGGATG GCATACGCAT CGTCCATGTG GAAGCCGGGC CCGTCACGTT CATTCCCAAG
GAAAAGCTGC TGCCTTATAT GCCCGCCTTC ACGCGCGACA TCCTGCGGTT TATCAAGTCG
GAAAACAATC GCTACAAGCT CGTTCACGCC CATTTCTTCA TGTCCGGGCT GGTGGCGGCG
GATATCAAGC GAAAACTGGG TATTCCTTTC ATCGTTACCT TTCACGCTCT CGCAAAAGTG
CGGAGGCTTC ACCAGGGAGG GAATGACTGG TTCCCGGACG AGGGCTTTGC CATCGAAGAA
AGGGTGATAA CAGAAGCGGA CCAGATTGTC GCCTTGTGCC CGCAGGACCG CGATGATCTG
ATCAATCTTT ATGAAGCTGA TCCCGGAAAA ATCACGGTTA TTCCAAACGG ATTCAGACCG
GATGAGATCT ATCCTCTCGA CAAGCTGTTC GCGCGCATGG CGCTGAAACT CGATCCCAAG
GAAAAGATTA TCCTGCAACT GGGGCGCATG GTGCGGCGAA AGGGTGTCGA TAACGTCATA
AAAGCGCTGG GCTACATGCG GCGCGAGCAT AACTTCGAAG CACGTCTTCT GATAGTGGGC
GGGGAGTCGG ATGAGCCCGA TCCAAAAACA ACGCCTGAAA TCGGTCGCCT GCAAAAACTG
GCTGAAGCAG AGGGTGCGGG CGATCTTGTG ACGTTTGTCG GACGCCGCCC GCGCGACATG
CTGCATTATT ACTATAGCGC GTGCGACGTA TTCACGACTA CGCCCTGGTA TGAACCGTTC
GGGATCACCC CGCTTGAAGC AATGGCCTGC GGGACGCCCG TGATCGGGTC AAATGTTGGG
GGCATTAAAT CCACCGTCAT GGATGGCAGG ACGGGCTTTC TCGTGCCGCC CAACGATCCC
GCGTCACTCG GGCGCCGCAT CATAGAGCTT TTGAGCAGCA ACAAGCTCAT GACGTATTTC
AAGGAAAACG CCATCCGCCA TGTCAATCAG AATTACACCT GGATGAAGGC AACGCATCTC
ACGGCCAACA TGTACGAGCG GATTGCAACC CAGAGCCCCC TGCGAGCGGA CGAAGAAGAA
GATTCCTTGT CCTACATCGA CGACTCCTTC GGGTCATTAA TAGAGACTAT TGAAAAGTCC
AGGCGGAAAA TCCGCCTTGC CATCCTCGAT TCGGCCCAGG CTGTATACCG CTCGTTGGCG
CGCGGCGGAA AGGTATTGGT TTGCGGCAAT GGCGGGAGTG CCGCCGAAGC GCAGCACTTT
GCGGCCGAAC TCATGGGGCG GTTTGAGGCA AGTGGTCGTC GCGGCTTGCC CGCAATGGCA
CTCACTGCCG ATACCGCCTT TGTGACCGCC TGGTCGAATG ACTATACATT TGACGATGTG
TTTGCCAGAC AGGTCGAAGC GCATGGGCAG CCGGGGGACG TTCTGGTTGT CATCAGCTCG
AGCGGGCAGT CAGTCAACTT GGTCAAGGCG CTCCGGACGG CGCGCCGGCA CGAGATGTTC
TGCATTGGCC TGCTTGGCAA GGAGGGTGGT CCTGCCAGCG AACTGACTGA TATCAACATC
ATTGTTCCGT CAAACGAAAC TTCACGCATC CAGGAAGTAC AACTGCTTGT TCTCCACGTG
CTCAGTCATC TGATCGAGCA GCAAATCGTG GTGGATGACC TGAATACCGT TCAGATAACG
GAAGAGTGGT CAATAAAGCA CTTTCAGGTC CAGGAAATGG CTAAAAACGT TAATAAGAGG
AAAATCAAGC ATGAATCAAC AAAGTGTGAC TGA
 
Protein sequence
MKKRIALISE HASPIAAIGG TDTGGQNIAV AELARHLAAL GYEIDVFTRW DDRRVPKILN 
WRDGIRIVHV EAGPVTFIPK EKLLPYMPAF TRDILRFIKS ENNRYKLVHA HFFMSGLVAA
DIKRKLGIPF IVTFHALAKV RRLHQGGNDW FPDEGFAIEE RVITEADQIV ALCPQDRDDL
INLYEADPGK ITVIPNGFRP DEIYPLDKLF ARMALKLDPK EKIILQLGRM VRRKGVDNVI
KALGYMRREH NFEARLLIVG GESDEPDPKT TPEIGRLQKL AEAEGAGDLV TFVGRRPRDM
LHYYYSACDV FTTTPWYEPF GITPLEAMAC GTPVIGSNVG GIKSTVMDGR TGFLVPPNDP
ASLGRRIIEL LSSNKLMTYF KENAIRHVNQ NYTWMKATHL TANMYERIAT QSPLRADEEE
DSLSYIDDSF GSLIETIEKS RRKIRLAILD SAQAVYRSLA RGGKVLVCGN GGSAAEAQHF
AAELMGRFEA SGRRGLPAMA LTADTAFVTA WSNDYTFDDV FARQVEAHGQ PGDVLVVISS
SGQSVNLVKA LRTARRHEMF CIGLLGKEGG PASELTDINI IVPSNETSRI QEVQLLVLHV
LSHLIEQQIV VDDLNTVQIT EEWSIKHFQV QEMAKNVNKR KIKHESTKCD