Gene Nwi_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2067 
Symbol 
ID3675482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2256915 
End bp2258285 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content62% 
IMG OID637713633 
Productputative L-sorbosone dehydrogenase (SNDH) 
Protein accessionYP_318677 
Protein GI75676256 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAAAT TGCTTATCTC TTTGCTTGCC ATCGCATTCG TTGCCGCCAT CGCGGGAGTG 
GGCGTGATCC TGTTCTCGAA GGAACAGGCG ACCGTTGCGC TCGAGGAGCA GTACGGTTCT
GATCCGAAAC TTCCTCCGCC GGACCCGCGC ATTCTGCCGA CCGTGAACCA GGCCAGGGCC
GAGCCGTGGA AGCGAGGCGA AACGCCGACC GCCGCTCAAG GCTATGCGGT GTCCCAGTTC
GCCGATGGAC TTGATCATCC GCGCTGGCTG CATGTGCTGC CGAACGGTGA CGTGCTTGTC
GCGGAAAGCA ACAAGCCGCC CCAGGATGAA AGCAGCATGG GCATCCGCCG CTGGGTGGCG
AAGCTCGTGA TGAGCAGCGC AGGCGCCAGC ACGCCCAGCG CGAATCGCAT CTCCATCCTG
CGCGACGCGG ACGGCGACGG CGTAGCGGAG TTAAGGCAGC CCTTCATCAC CGGACTGTTC
TCACCTTTCG GCATGGCGCT GCTCGATGGC AAGCTCTATG TCGCCAACGC CAACTGCATC
GTGGCCTTTC CTTATCAGGA AGGCGCAACG GAAATCACCG CGAAGCCGGA AACAATTACC
GAGCTGCCCG CCGGGCTGAA TCACCATTGG ACAAAGGACG TCATCGCATC CCCAGACGGA
ACGAAGCTCT ATGTCACCGT CGGCTCCAAC AGCAATGTCG GCGAGAACGG CATGGAGGCG
GAGGAAGGCC GCGCGGCCGT CCATGAAATC GACCTGGCCA CAAAGCAGAA GCGCCTGTTT
GCCACGGGGC TGCGCAATCC TAACGGGCTC TCCTGGCAAC CCGACAGCGG CGAACTGTGG
GTGGTGGTCA ACGAGCGCGA CGAGATCGGC AGCGATCTCG TGCCGGACTA CATGACCTCA
GTGAAGGAGG GTGCATTCTA TGGCTGGCCA TACAGCTATT TCGGACAGCA CGTCGACGAT
CGCATAGAAC CGCGTCGACC TGACCTGGTC GCGAAGGCCA TCAAGCCGGA CTATGCACTC
GGCTCCCATA CGGCCGCGCT TGGCCTCACC TTCAATAGCG GAAGCCTGTT CGGGCCTGAA
ATGAAGAACG GCGCGTTCAT CGGCCAGCAT GGATCATGGA ATCGTATGCC GCGCAGCGGT
TACAAGGTGA TCTTCGTGCC GTTCAGCGAC GGCAAGCCCG CCGGACCTCC GCGGGATATC
CTGACCGGAT TCCTGCGTCT TGATGAGAGC GCGGCAGGCC GCCCCGTTGG CGTTGCTGTC
GCCCGCGATG GGGCGCTGCT GGTGGCGGAC GATGTCGGCA ACAGTATCTG GCGGGTGACG
CCGAGCCAAT CCGGCCAGAC AAACCCCGGA AGCGTCGGGT CGGAACCATA G
 
Protein sequence
MKKLLISLLA IAFVAAIAGV GVILFSKEQA TVALEEQYGS DPKLPPPDPR ILPTVNQARA 
EPWKRGETPT AAQGYAVSQF ADGLDHPRWL HVLPNGDVLV AESNKPPQDE SSMGIRRWVA
KLVMSSAGAS TPSANRISIL RDADGDGVAE LRQPFITGLF SPFGMALLDG KLYVANANCI
VAFPYQEGAT EITAKPETIT ELPAGLNHHW TKDVIASPDG TKLYVTVGSN SNVGENGMEA
EEGRAAVHEI DLATKQKRLF ATGLRNPNGL SWQPDSGELW VVVNERDEIG SDLVPDYMTS
VKEGAFYGWP YSYFGQHVDD RIEPRRPDLV AKAIKPDYAL GSHTAALGLT FNSGSLFGPE
MKNGAFIGQH GSWNRMPRSG YKVIFVPFSD GKPAGPPRDI LTGFLRLDES AAGRPVGVAV
ARDGALLVAD DVGNSIWRVT PSQSGQTNPG SVGSEP