Gene Nwi_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1947 
Symbol 
ID3674791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2135469 
End bp2137856 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content61% 
IMG OID637713512 
Productputative phosphoketolase 
Protein accessionYP_318559 
Protein GI75676138 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3957] Phosphoketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC TGGACGATAC GCAACTGGGC TTGATGAACC TCTACTGGCG CGCGGCTAAC 
TACGTGTCGG TCGGGCAAAT CTACCTGATG GATAATCCGC TGCTGCGAAA GCCGTTGAAG
GCGGAAGACG TGAAGCCGCG GCTGCTCGGT CACTGGGGTA CGACGCCCGG GCTGAATTTC
ATCTACGTGC ATCTGAATCG GGTCATCAGG GAGCGTGGCA CAGACGTCCT CTACATCTGC
GGTCCCGGTC ACGGCGGGCC TGCAATGGTC GCCAACACCT ACCTGGAGGG CTCGTACAGC
GAGATCTATC CGGGGATCAC GCAGGATGAA GCAGGCTTGC AGAAACTTTT CCGGCAGTTT
TCATTCCCGG GTGGTGTGCC CAGCCACGTC GCCCCAGAGA CGCCCGGCTC CATCCACGAG
GGAGGAGAAC TCGGCTACGC CCTTGTCCAT GCCTTCGGCG CCGCTTTCGA CAATCCCGAC
CTCACGGTCG CCTGTGTCGT TGGGGACGGG GAAGCCGAGA CGGGTCCGCT GGCCGCTGCG
TGGCATTCCA ACAGGTTTCT GAATCCGCAG GTCGATGGCG CGGTCCTGCC GATCCTGCAT
CTTAACGGCT ACAAGATCGC CAATCCCGCC CTGTTGGCGC GCATGTCTGA AGCCGAACTC
AGCGATCTCT TCAGGGGCTA CGGCTATGAG CCGCTCTTTG TCGAGGGGCA CGAGCCGATC
CCAATGCACC GCACCATGGC TGAGTCCCTG GACCTGGCCA TGGATCGTAT TCGTGACATC
CAGCGAGAGG CGCGGAAGGA GGGCTGGTCG GGCGAGCGCC CGCGTTGGCC GATGATCATC
TTGCGCAGCC CGAAAGGATG GACAGGACCG AAGGAGGTCG ACGGGAAGAA GGTGGAGGAC
TTCTGGCGCT CGCATCAGGT CCCGGTTTCC AACGCCCGCG GTGATGCCAC ACACCGCAAG
ATCCTCGAGG AGTGGCTGCG TAGCTATCGG CCGCAGGAGC TGTTCGACGG GAAGGGTCGC
TTCCTGCCGG AGATCGCCGC CCTAGCGCCC GAAGGTGCGA AACGCATGGG CGCGCTGCCG
CATGCCAATG GCGGTCTGCT GAAGAAGGAC CTTATCCTGC CGGACTGGAA GAGCCTCGCG
CTCGACATCG GGCAACCGGG AGAAACGATC GCCGAGGCTA CCCGCTTGAT GGGAGCTTAT
CTGCGTGAGG TCATCCGCCT TAATGCCGAG GCAAGGAATT TCCGCCTCAT GGGTCCGGAC
GAGACCGCCT CGAACAGGCT CGACGCCGTC TTCGAGGTGA CGAACCGCGT CTGGATGGAG
AAGATTGAAT CCTATGATGT GCACCTCGCG TGCGAGGGCC GGGTAATGGA GGTTCTGTCA
GAGCATCTTT GCCAAGGGTG GCTGGAGGGT TACCTGCTGA CTGGACGACA TGGCGTGTTC
TCCTGCTACG AGGCCTTCAT CCACATCGTG GATTCTATGG TCAATCAGCA TGCCAAGTGG
CTGAAGGTCT CGGCCGAACT GCCATGGCGC AAGCCAATCG CCTCACTCAA CTATCTCCTG
ACCTCCCACG TCTGGAGGCA GGACCACAAC GGCTTCAGCC ACCAGGATCC GGGCTTCGCC
GACTTCGTCG CCAACAAGAA GGCCGACATC GTCCGCCTTT ATTTTCCGCC GGATGCCAAC
ACGCTGCTCT GGGTCACGGA CCACTGCCTG AGAACATGGA ATCGCATCAA CGTGATCACG
GCCGGCAAGC AGCCTCAGCC GCAGTGGCTC ACAGCCGAGC AGGCGGAACG CCATTGCGAG
GCGGGCGCCG GGATTTGGGA ATGGGCGTGC ACATGCCCGG CTGACGAGGA GCCTGACGTG
GTGATGGCCT GCTGCGGCGA CGTGCCGACA CTGGAAATTC TCGCCGCCGT CGGCCTGCTG
CGCCGCGAGT TGCCTGATCT TCGTATTAGG GTGGTCAATG TCGTCGACCT GATGACGCTG
CAATCTCACA CGACCCATCC GCACGGGTTC ACCGACGATG AATTCGACGC TCTCTTCACG
AAGGAGAAGC CCGTGATCTT TGCCTACCAC GGCTATCCCT ACCTCATTCA CCGCCTGACC
TATAAGCGAA CCAACCACGC CAATTTCCAT GTCCATGGTT TCCAGGAAGA AGGTACGACC
ACCACGCCAT TCGATATGGC GGTCATGAAC GAACTCGACC GCTTCCATCT GGTCATAGCC
GCCGTCAGGC GGCTGCCGAA TCTTGGCATG GGCGGCGAAC GAGTGATCGG ACGGTGCGAG
CAAGCGCTGG CCGAGCATGC CTCGTATGCC CGGCAATATG GTGAAGACAT GCCTGAAATC
CGGGAATGGA GCTGGCCCTA CAAGACTGCC GCCGAGGCGG GAGACTGA
 
Protein sequence
MAELDDTQLG LMNLYWRAAN YVSVGQIYLM DNPLLRKPLK AEDVKPRLLG HWGTTPGLNF 
IYVHLNRVIR ERGTDVLYIC GPGHGGPAMV ANTYLEGSYS EIYPGITQDE AGLQKLFRQF
SFPGGVPSHV APETPGSIHE GGELGYALVH AFGAAFDNPD LTVACVVGDG EAETGPLAAA
WHSNRFLNPQ VDGAVLPILH LNGYKIANPA LLARMSEAEL SDLFRGYGYE PLFVEGHEPI
PMHRTMAESL DLAMDRIRDI QREARKEGWS GERPRWPMII LRSPKGWTGP KEVDGKKVED
FWRSHQVPVS NARGDATHRK ILEEWLRSYR PQELFDGKGR FLPEIAALAP EGAKRMGALP
HANGGLLKKD LILPDWKSLA LDIGQPGETI AEATRLMGAY LREVIRLNAE ARNFRLMGPD
ETASNRLDAV FEVTNRVWME KIESYDVHLA CEGRVMEVLS EHLCQGWLEG YLLTGRHGVF
SCYEAFIHIV DSMVNQHAKW LKVSAELPWR KPIASLNYLL TSHVWRQDHN GFSHQDPGFA
DFVANKKADI VRLYFPPDAN TLLWVTDHCL RTWNRINVIT AGKQPQPQWL TAEQAERHCE
AGAGIWEWAC TCPADEEPDV VMACCGDVPT LEILAAVGLL RRELPDLRIR VVNVVDLMTL
QSHTTHPHGF TDDEFDALFT KEKPVIFAYH GYPYLIHRLT YKRTNHANFH VHGFQEEGTT
TTPFDMAVMN ELDRFHLVIA AVRRLPNLGM GGERVIGRCE QALAEHASYA RQYGEDMPEI
REWSWPYKTA AEAGD