Gene Nwi_1902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1902 
Symbol 
ID3675885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2087241 
End bp2088740 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID637713466 
Producthypothetical protein 
Protein accessionYP_318514 
Protein GI75676093 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.92371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.382809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATTC TGACGACGGG CGAAATGGAG CAGGCCGACC GGCTCTCGAT CGCGGGAGGC 
GTGTCGGGCT TCGCCTTGAT GCTGCGCGCC GGGCAGGCGG TCGCGAACGC GGCCGCCGAC
CTCGCGGAGA AAGGACCGAT CCTGGTCGTC GTCGGATGCG GCAACAACGG TGGCGACGGG
TTCGTTGCGG CGACGGAACT GGCGGCGCGC GGCCACGAGG TTTCGGTCAT CCTGCTGGGC
GAGCGAGACA GTCTCAAGGG TGACGCCGCG CTCGCGGCCA AGGGCTGGAA AGCGCCTGTG
CTGCCTTGCG ACCTCTCCGC GCTCGGTTCG CCCGCGCTGA TCATCGACGC GCTGTTCGGC
GCGGGCCTCA ACCGGCCGGT CAAGGGCGAC GCTCTCGCGA CGATCGAGGC CATCATGGCC
AACGGCGCGC CGGTGCTCAG CATCGATCTG CCGAGCGGCA TCAACGGCAC CACCGGCGCG
GTCATGGGCG CTGCGATCCG CGCGACCGAG ACGGTCACGT TCTTTCGCAA GAAGCCCGCG
CATCTGCTGC TGCCGGGCCG CATTCATTGC GGCCGGGTGC GCGTGGCCGA TATCGGTATC
GCCGATGCCG TGCTGGACGA GATTCGTCCC GTAACCTTCG AGAATACCCC GGATTATTGG
AACGGTGCGT TTCCCGCGCC GCATATCGAC GGACACAAGT ATAAGCGCGG CCATGCCGTT
GTCGTGTCCG GTCATCTGAC ATCAACGGGC GCGGCACGAC TGTCCGCGCG GGGCGCGTTG
CGGGCTGGAG CCGGTCTCGT CACCGTGCTC TCACCCGACG ATGCGCTGGG CGCGAACGCC
GCGGCGCTGA CCGCCGTGAT GGTGCGCCGC ATGAACGGCG CGGCCGGCTT CGCGGAAGTT
CTGGCGGATC GGCGTCTCAA TGCCTGCGTC ATCGGACCCG GCGCCGGGAT CGGCGAGGAA
ACACGCGCCC TTGTCTCCAC GGCCCTCGCA GCGGGGCGGG CGCTGGTGCT CGACGCCGAC
GCGTTGACAA GCTTCGCCGA TACCCCGGAT CGTCTGTTCG AGACCGTCAG GCAGTCCGGA
GGCTCCGACA TTATCCTGAC GCCGCACGAA GGCGAGTTCC ATCGTCTTTT CAGTGAGATG
AGTAACAAAC ATCCATTTCG ATCAAAACTT GAACGGGTGC GGGACGCGGC GCGGCGTTCG
GGATGTGTTG TCCTGCTCAA GGGGGCTGAC ACGGTGGTGG CCTCGCCGGA CGGGCGCGCG
ACCATCGCGG CCAACGCCCC GCCATGGCTC GCGACGGCAG GCTCGGGCGA CGTGCTGTCC
GGCATCATCG GCGGCCTGCT GGCGCAAAGC GTTCCGGCCT TCGAGGCGGC TTGCATGGGG
GTCTGGATGC ACGGCGAGAC CGGACGCGAG GCGGGGCCGG GACTGATCGC CGAGGATTTG
CCGGAGGTTC TGCCCGCGGT CCTGCGGCGG CTCTATGATC GGTTCGGGAT TGACTATTAA
 
Protein sequence
MEILTTGEME QADRLSIAGG VSGFALMLRA GQAVANAAAD LAEKGPILVV VGCGNNGGDG 
FVAATELAAR GHEVSVILLG ERDSLKGDAA LAAKGWKAPV LPCDLSALGS PALIIDALFG
AGLNRPVKGD ALATIEAIMA NGAPVLSIDL PSGINGTTGA VMGAAIRATE TVTFFRKKPA
HLLLPGRIHC GRVRVADIGI ADAVLDEIRP VTFENTPDYW NGAFPAPHID GHKYKRGHAV
VVSGHLTSTG AARLSARGAL RAGAGLVTVL SPDDALGANA AALTAVMVRR MNGAAGFAEV
LADRRLNACV IGPGAGIGEE TRALVSTALA AGRALVLDAD ALTSFADTPD RLFETVRQSG
GSDIILTPHE GEFHRLFSEM SNKHPFRSKL ERVRDAARRS GCVVLLKGAD TVVASPDGRA
TIAANAPPWL ATAGSGDVLS GIIGGLLAQS VPAFEAACMG VWMHGETGRE AGPGLIAEDL
PEVLPAVLRR LYDRFGIDY