Gene Nmul_A0466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0466 
Symbol 
ID3786013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp518270 
End bp519664 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content56% 
IMG OID637810542 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_411166 
Protein GI82701600 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAAA CCAATGACTA CACCAAGCCA TCCGATGCAC TGGTGCTGTT TGGGATAACG 
GGTGACCTCG CTTATAAAAA GATCTTTCCG GCGCTGTACG CGATGATCAA AAAGGGCATG
CTCGATGTGC CGCTGGTCGG TGTCGCCTCG ACACCCTGGA GTCTCGATCA ACTCAAGGAA
CGGGCGACCC AGTCCATCAG CGACTCCGGA AAGATTGATG ACAAACGGGC GCTGGATCAC
CTCCTTTCCC TGCTGCGATA TGTGCGTGGA GACTACAACA ACCTCGATAC GTTCAAGGCA
CTCAAACAGG CTCTGGGAGA TGCACGTCAT CCCGTGCACT ATCTCGCCAT CCCCCCTCTT
CTTTTCGAGA ATGTCATAAG AGGTCTTGGT GCGCTCGATC TGGCCGCAGG CGCGCGCGTC
ATTGTGGAAA AACCTTTTGG ACGGGATCTT GAGTCAGCAC GTGAACTGAA CCGTATTGCA
CGCTCCGTGT TCCCCGAGGA AGCGATATTC CGTATCGACC ATTTTCTCGG AAAAGAGGCG
ATCATGAATA TTCTTTATTT CAGATTCGCC AATTCATTTC TGGAACCGAT ATGGAATCGT
AATTATGTAG CCAGCGTGCA GATCACATTG GCTGAGGAAT TCGGGGTTGA AGAACGAGGC
GCATTTTACG AATCCGCCGG CTGCCTGCGC GACGTGATCC AGAATCATCT TTTCCAGATT
GTTGCGCTGC TGGCCATGGA ACCTCCCGCT TATCGCGGTC TCGGAACGGT CCAAAGCGAA
AAAATCAACA TATTTCACGC CATGCGCCCC CTTGTACCCG AGGACCTGGT GCGGGGACAA
TATGTGGGCT ACCGTCAGGA ACCGGGTGTG GCGGAGGACT CCGATGTCGA GACATTCTGC
GCCCTGCGGC TTTTCATCGA CTCCTGGCGC TGGGAAGGGG TCCCCTGGTA TCTGCGTTCC
GGCAAGTGTC TGGCCAAGAC TGCTGCGGAG GTCCTTGTCC AGCTGAAGCC GCCGCCGCAA
AAGCTTTTTG CCGATTCGGC AAGCGCCGCA TGTGATGCCA ATTATCTCAG GTTCCGCCTT
TCTCCCGTCT CAGCGGTTGC CATTGCAGCG AGAGTCAAGC ATCCCGGAAA AGAGTTCAAA
GGCGATCAGC AGGAGTTGTG CCTGGTCGAG GAGCACTTCG GGCGCGAATC GCCTTATGAA
CGTCTCCTGC ATGATGCAAT GATTGGTGAC GACACCCTGT TTACCAAGAG GGAAGCGGTG
GAGGCATCCT GGACAGCACT CGATCCTGTG CTCAAGACAT ATCCTCACGT TCTGCCCTAC
GAGCGCGGCA GTTGGGGCCC CGCCGCAGCC GACGCGCTGA TCGAGGCGGA TGGCTGCTGG
CACAATCCGG GATAG
 
Protein sequence
MRKTNDYTKP SDALVLFGIT GDLAYKKIFP ALYAMIKKGM LDVPLVGVAS TPWSLDQLKE 
RATQSISDSG KIDDKRALDH LLSLLRYVRG DYNNLDTFKA LKQALGDARH PVHYLAIPPL
LFENVIRGLG ALDLAAGARV IVEKPFGRDL ESARELNRIA RSVFPEEAIF RIDHFLGKEA
IMNILYFRFA NSFLEPIWNR NYVASVQITL AEEFGVEERG AFYESAGCLR DVIQNHLFQI
VALLAMEPPA YRGLGTVQSE KINIFHAMRP LVPEDLVRGQ YVGYRQEPGV AEDSDVETFC
ALRLFIDSWR WEGVPWYLRS GKCLAKTAAE VLVQLKPPPQ KLFADSASAA CDANYLRFRL
SPVSAVAIAA RVKHPGKEFK GDQQELCLVE EHFGRESPYE RLLHDAMIGD DTLFTKREAV
EASWTALDPV LKTYPHVLPY ERGSWGPAAA DALIEADGCW HNPG