Gene Nmul_A0469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0469 
Symbolpgi 
ID3786016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp523946 
End bp525586 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content55% 
IMG OID637810545 
Productglucose-6-phosphate isomerase 
Protein accessionYP_411169 
Protein GI82701603 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAC CCATGATCAC GCCCCTGACG CAAAGGCCTG CCTGGAAGGC ACTGGAAGCG 
CACTACCAGA CGATCAAAGG CATGCATTTG CGTCAGCTCT TCGCTGACGA TCCGAAGCGG
GGTGAGCGAT TTACGGCCGA GGCCGTCGGC CTGTACCTGG ATTATTCGAA AAATCGCATC
ACTGATGAAA CGCTTCATCT ACTGGTGCAG CTTGCCGAAG AATGCGGCCT GCGCGAGCGC
ATTGAAGCCA TGTTCAGGGG TGACGCCATC AATGTGACAG AACAGCGTGC TGTACTCCAC
ATCGCCTTGC GCGCGCCTCG TAATGAAAAA ATCCTCGTCG ACGGAAATGA CGTTGTGCCC
GGGGTGCATG CCGTGCTCGA CCGTATGGCG GATTTCTCCG ACAAGATACG CAGCGGGGAC
TGGCAGGGGC ATACTGGCAA GCGAATTCGC AATATCATCA ATATCGGTAT CGGCGGCTCC
GATCTGGGCC CGGTGATGGC GTATGAAGCG CTGCGCCACT ACAGTCTGCA CAATCTCAGC
TTTCGTTTCA TCTCCAATGT TGATGGCACG GATTTCGTGG AGGCTACACG AGGTCTTGAT
CCCGAAGAAA CCCTGTTCAT TATCTGCTCC AAGACATTCA CGACAACGGA AACGCTGGCC
AATGCCCACA CCGCCCGGCG GTGGATGCTG CGACAGATAA AGGACCTGGA GGGGGTGCGC
AAGCACTTCG TCGCTGTTTC CACCAATGCG GAGGAAGTAG CCAGATTCGG CATCGATACC
GCCAACATGT TCGAATTCTG GGACTGGGTA GGTGGACGCT ATTCCATGGA CTCCGCGATC
GGACTCTCAA CCATGATTGC CGTCGGCCCA GAGAATTTCC GTGAGATGCT TGCCGGCTTC
CATGCAATGG ACCAGCACTT CTATTCCGCT CCGTTCGACA GGAATCTTCC TGTCCTGATG
GGATTGCTGT CGCTCTGGTA TAACAATTTC TTTGGCGCGC AGACACTCGC CGTACTGCCC
TACGAGCAAT ATTTGAAGCG CTTTCCGGCT TACCTCCAGC AACTGACAAT GGAGAGCAAT
GGAAAGCACA TTACACTGAA TGGCTCCCAG GTTGACTACC AGACCTCACC TATCGTGTGG
GGAGAACCCG GCACCAACGG ACAACATTCG TTTTACCAGC TCATCCATCA GGGAACCCGA
TTGATTCCCT GTGATTTTAT CGGCTTCTGC CAAACCCTGA ACCCCTTGGG CGATCACCAT
GACCTCCTCA TGGCGAATCT GTTTGCCCAG ACCGAGGCGC TTGCTTTCGG AAAAACGGAA
GATGAAGTCA AAGCTGAAGG TGTCCCGGAC TGGCTTTGCC CGCATCGCAG TTTTGAGGGG
AATCGCCCCA CCAATACGAT ACTTGCCGAG CGCCTCACAC CCCACACCCT CGGTGCCCTT
GTCGCTCTTT ATGAGCAGAG TGTTTTTACA CAGGGGACAA TCTGGCAGAT CGATTCGTTC
GATCAATGGG GCGTCGAACT CGGCAAAGTG CTGGCACACC GCATCGGGCA GGAACTGGAG
GATGAAAACG GCAAGTCCCT GAAACATGAT AGCTCCACCA ACGCCCTGAT ACAGCGGTAC
AACAGGCTGA AACAAAAATA G
 
Protein sequence
MTKPMITPLT QRPAWKALEA HYQTIKGMHL RQLFADDPKR GERFTAEAVG LYLDYSKNRI 
TDETLHLLVQ LAEECGLRER IEAMFRGDAI NVTEQRAVLH IALRAPRNEK ILVDGNDVVP
GVHAVLDRMA DFSDKIRSGD WQGHTGKRIR NIINIGIGGS DLGPVMAYEA LRHYSLHNLS
FRFISNVDGT DFVEATRGLD PEETLFIICS KTFTTTETLA NAHTARRWML RQIKDLEGVR
KHFVAVSTNA EEVARFGIDT ANMFEFWDWV GGRYSMDSAI GLSTMIAVGP ENFREMLAGF
HAMDQHFYSA PFDRNLPVLM GLLSLWYNNF FGAQTLAVLP YEQYLKRFPA YLQQLTMESN
GKHITLNGSQ VDYQTSPIVW GEPGTNGQHS FYQLIHQGTR LIPCDFIGFC QTLNPLGDHH
DLLMANLFAQ TEALAFGKTE DEVKAEGVPD WLCPHRSFEG NRPTNTILAE RLTPHTLGAL
VALYEQSVFT QGTIWQIDSF DQWGVELGKV LAHRIGQELE DENGKSLKHD SSTNALIQRY
NRLKQK