Gene Nwi_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0473 
Symbol 
ID3676505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp529921 
End bp531411 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID637712015 
Producthypothetical protein 
Protein accessionYP_317092 
Protein GI75674671 
COG category[S] Function unknown 
COG ID[COG4223] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.926396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATG ACAAGCATGA CGACGCCTCC CAGGAGACAA ACTCACGTCC GCTTTCCACA 
GGCGACGAAG AGACTGGAAC CTCCGAGGCG GCGGAGCGCG ATGTTCAGGC AGACGCGGGT
GAAACGGCCG GCGATCACCA AAGCGATGAT TGGCCGTTTA CGACGCCTCC TGCTTCGGCC
GCGGATGACC GGCAGGACGG TTTCTCTCAG GACGATTTCT CTCAGGACAG TGGCTGGCAG
GATCAGTCCT CGGCCGCGAA CCGCGAGACC GACCGGAATT CAGAGGCGGC GGGCGATGCC
GGCTCTGGCG GCGGGTCCGG CGATCAACCG TCCGGCGAGC AGCCGGTTGC TCCGCGTTCT
CCGTCCCGCG CGTCGTCGGC CCTGATCGCC GCCGTGAGCG GTGCCGCTGC CGCAGGGATT
GTGGCTGGCG GCGCATGGCT GGCGGGTTGG CCGCCGCCAT CCTCGGCTCC GCCCCCGGCG
TCGCAAGTGA GCCAGGCGGA GCAGGTGAGC CAGGCGGAAT TCAGCGCTCT CGCGAACCGG
TTGGCCCGGA TCGAATCGAC GGCGAGCCAG CCGGCCGCTT CGCCGCTCGA TCAGGCCACG
GCGGAACGCA TCGAGACGCT GGACAAATCG GCCGCATCCT TGCGTGAGGA CCTCACGACG
TTGCGCGAGG AGTTGGCCTC GGTCCGGGGC CAGTCCGACA GGCTCGCCGC GGCGCTGAAG
GACGGCGCGG CCGCGCCGAG GAACGATGGC GAAACGCCGT CTGATGTAGC TTCATCCGAC
CTTGCGCCTT CCGATCTCGC GGCTATCAGC GCGCGCCTCG CGCAGACCGA ACAACAGATC
GAGCAGATGA CGCAGAGCCT CACGGCTGAG ATAGCCAAGC GCGATGCGGA AACAGCCAGG
CGTGACACGG AGTCAGCCAG GAACAGCGAG GAGACGGCCA AGTCCAAGCA GGCGGCCCCG
GCCGACGACG GACCGCTTCG GCAGGCCGTT GCTGCGACGC TGCTGAACGT CGCCGTGCGT
CAGGGGCAGC CTTATGGCGA TCTGTTGACG GCATTCAGGA CGCTGGTCTC CGAGCCGGAT
ACCTTGAAAC CGCTGGAGAC GTTCGCGGAG TCCGGAGTGC CGAGCGCGAA CGCGTTGTGT
CGCGAACTTC TTGCGATAGT GCCGAAACTC GAGCCGCCGC CGGTTAGGCT CTCCTCCGCC
AACGACATCG TCAACCGGCT GCAGGAAAGC GCCGTGCGTC TGGTGCGAAT CGAGCGCCTG
GACACCGCGC CGGCCGGAGA ATCCGCCGGC GCCGTCGTCG CGCGCATCAA GGTGGCGGCC
CGGCACAACG ATGTCGCCAC CGCGAGAAAA GAACTGAATA CGCTTCCGCC GACCGATCGT
ACCGCCGCGG AGTCCTGGAT CGCCAAGGTC GACGCCCGCG ATGCTGCTCT TGAAACTGCG
CGTCAATTCG CTTCCGAAGC GATGGCCGCG CTGGCCAAAC CGGCGCCATA G
 
Protein sequence
MADDKHDDAS QETNSRPLST GDEETGTSEA AERDVQADAG ETAGDHQSDD WPFTTPPASA 
ADDRQDGFSQ DDFSQDSGWQ DQSSAANRET DRNSEAAGDA GSGGGSGDQP SGEQPVAPRS
PSRASSALIA AVSGAAAAGI VAGGAWLAGW PPPSSAPPPA SQVSQAEQVS QAEFSALANR
LARIESTASQ PAASPLDQAT AERIETLDKS AASLREDLTT LREELASVRG QSDRLAAALK
DGAAAPRNDG ETPSDVASSD LAPSDLAAIS ARLAQTEQQI EQMTQSLTAE IAKRDAETAR
RDTESARNSE ETAKSKQAAP ADDGPLRQAV AATLLNVAVR QGQPYGDLLT AFRTLVSEPD
TLKPLETFAE SGVPSANALC RELLAIVPKL EPPPVRLSSA NDIVNRLQES AVRLVRIERL
DTAPAGESAG AVVARIKVAA RHNDVATARK ELNTLPPTDR TAAESWIAKV DARDAALETA
RQFASEAMAA LAKPAP