Gene Nwi_1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1443 
Symbol 
ID3677185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1571189 
End bp1572238 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID637712995 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_318056 
Protein GI75675635 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.915684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA GCCTGACAGG TCCTCCTCCG TCATCAAGTC TCGGACCGTT CGCGGTCGGA 
GATATTTCGA TCCGCAACCG CGTTCTTCTT GCGCCGATGT CGGGGATCAC GGACAGGCCG
TTTCGACGTC TTACGGAGTC GCTGGGGGCG GCGCTCGTTG TTTCGGAAAT GACCGCGAGC
GACGATCTCG TCCGCGGGCG TCCAATGTCC GTCCTTCGCT GTGAAGCCAC CGGTCATGGG
CCGCACGTCG TTCAGCTTGC CGGCTGCGAG ACGCGCTGGA TGGCGGAAGC CGCGCGGATC
GCCGAGGCGG CCGGCGCCGA CATCATCGAT ATCAACATGG GTTGTCCCGC CCGGCACGTG
ACAGGCGGCC AGTCGGGTTC GGCGCTGATG CGCGATCCCG ACCATGCCCT CGACCTGATC
GAGGCGACGG TCGGCGCGGT GAACGTGCCG GTCACACTCA AAATGCGCCT CGGATGGGAC
GGCCACTCGT TCAATGCGCC TTCGCTGGCG CGGCGCGCCG AATCCGCCGG CGTGCGGATG
ATCACGGTTC ATGGCCGGAC GCGCTGCCAG TTCTACAAAG GCCGCGCCGA TTGGCGGGCC
GTGCGGGCCG TGAAGGAAGC GGTGCGCGTT CCCGTCGTCG TCAACGGCGA CATCACATCG
TTCGACGCAG CCGTTGCTGC GCTGGAGGCG TCGGGGGCCG ATGCGGTCAT GGTGGGCCGC
GGCGCGCAGG GCCGCCCCTG GCTGCCGGGT CAGATCGGGC GGCGGCTCGA AACCGGCATC
GAAGAATCCC ATCCTTCGCT CACGGATCAG TTGGCTTACA TCCGCGCGCT TTATGACGAC
CTGCTTCTGC ATTACGGCCT GCGCATCGGG CTTCGTCATG CGCGAAAGCA TCTTGGCTGG
GCGCTGGATA CGGCGGCAGC GCTTCGTGCC GTGCCGACAC CGGTCCAGAA ATCGTGGCGG
ACGAAAATTC TGACGGCCGA TGATCCCTCC GGCGTACAGC GGTTACTGGT GGACGCGTTC
GACGATTTCG CGTGGAGGGC CGTGGCATGA
 
Protein sequence
MSKSLTGPPP SSSLGPFAVG DISIRNRVLL APMSGITDRP FRRLTESLGA ALVVSEMTAS 
DDLVRGRPMS VLRCEATGHG PHVVQLAGCE TRWMAEAARI AEAAGADIID INMGCPARHV
TGGQSGSALM RDPDHALDLI EATVGAVNVP VTLKMRLGWD GHSFNAPSLA RRAESAGVRM
ITVHGRTRCQ FYKGRADWRA VRAVKEAVRV PVVVNGDITS FDAAVAALEA SGADAVMVGR
GAQGRPWLPG QIGRRLETGI EESHPSLTDQ LAYIRALYDD LLLHYGLRIG LRHARKHLGW
ALDTAAALRA VPTPVQKSWR TKILTADDPS GVQRLLVDAF DDFAWRAVA