Gene Nwi_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0114 
Symbol 
ID3675881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp133009 
End bp135828 
Gene Length2820 bp 
Protein Length939 aa 
Translation table11 
GC content64% 
IMG OID637711650 
ProductSignal transduction histidine kinase 
Protein accessionYP_316734 
Protein GI75674313 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.153209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAGA AGGAAAGTGT CAGTTCCGGA GCGTTCCTTG CCGGAGGCGG GGAAATGGGC 
GAGCTGACGC GCCGTTTCGA TTGGGACTCC ACATCCGTCG GCACTCCCGA TGCGTGGCCG
CAGAGCCTGC GCACGGCGGT GCGGATCGTG CTGAACACCA ACCACCCGAT GTTCATCTGG
TGGGGGTCGG ACTTGATCCA GTTCTATAAC GATGCCTACC GGCAAACCCT GGGACCGGAG
CGGCATCCCG GCGCGCTCGG TCAAAAGGGC AGGGAGTGCT GGGCCGAGAT ATGGGACATC
ATCGGCCCCC AGATCGAGTA CGTCATGTCC GGCCGCGGGG CGACCTGGCA CGAGGAGCAG
CTTGTTCCGG TCACGCGGAA CGGACGACTG GAGCAGGTCT ATTGGACCTA TGGATTCAGC
CCCATCGATG ACGAGGAAGG TGTCGGGGGC GTGCTCGTGG TATGCCGCGA TGTCACGCGC
GAACATCTTG CCAGGATGGC CTTGCAGGAG CGGGAGACCG AGCTTGCGCG CGTGCAGGCG
ATCGGGCGGA TCGGCGGCCT CGAGGTCGAT CTGCGCACCG GCCACCGCAG TCGCAAGTCG
CCGGAGTACT TGAAGATACA CGGCCTTCCG TCCGACGCCG TCCACGAAAG CCGGGAGGAC
TGGCTTCGCC GCGTTCATCC CGATGATCGC GCGGTCGCCG AGGGGCAGTT CATCGATGCG
ATCCGGGGCG GCGATCGCGA ATACGCGGTA CGCTACCGGA TCATCCGGCC GAACGACGGC
GAGACGCGCT GGATTTCGGC CCGATCGGTG ATCGAGCGCG ATGCGGACGG GAAAGCGATC
CGCCTGTTCG GCGCCCACAT CGACATCACC GACCAGGTCG AAGCGGAACG CTCGATCCGC
CAGCGCGAAC AGGAATTTCG CACGCTTGCA GAAGCGCTGC CGCATCACGT CTGGACCGCC
ACATCCGACG GCTCGTTCAA CTGGTTCAAT CGGCGCTTCT ACGATTACGT CGGCGCACGG
CCGGGCGAAC CCGGCGGCAA CGACTGGAAT AGGATCGTTC ATCCCGATGA TGCGCCAGCC
GCGGCCGCGG CGAGGACTCG CGCGGTAACC AGCGGCGAAC CCTATGAGAT CGAATTCAGG
CTGCGGAGAT CCGACGGCGT CTATCACTGG TTTCTGTCCC GCGCCTTGCC GGCGCGCGAT
GAAGATGGTC GCATCATCCG CTGGATCGGC ACCGACACCG ATGTCCACGA TCAGAAGCGG
ATCGCGGCAA AGCTTGCCGA ACTGAACGCG ACGCTCGCCG AGCGCGTCGA GGAGAAAACC
CGCGAGCGCG ACAGGATCTG GAATGTGTCC CAGGACTTGC TTCTGGTCAT GGACCAGCAT
GGCATCTGGC GCTCCGTCAA TCCCGCCTGG ACGAGGACGC TCGGCTGGAG CGAGGCTGAG
CTGACGGAGC GGACCGCCGA ATGGATCGAG CATCCCGATG ACATCGTCAA GACCCGGGTT
GAGGTCCGCC GTCTCGCTCG CGGTAAAGCG ACGGTTCGGT TCGAAAACCG GCTTCGTCAC
AAGGACGGCT CCTACAGGTG GCTGTCGTGG ACCGCGGTTC CGGACCAGAA CCTTATTTAC
GCCGTGGCGC GCGATGTCAC TGCCGAGAAG GCGGCCGCCG AGCGGCTGAG GGCGGCCGAA
GAAGCGCTGC GCCAGTCGCA AAAGATGGAA GCCGTCGGCC AGTTGACCGG CGGCATCGCC
CACGACTTCA ACAACCTGCT CACCGGTATC GTCGGCTCGC TGGATCTGTT GCAGACGCAC
ATCAAGCAGG GGCGCACCGA TCGCGCGGGC CGTTACATCG AAGCCGCGAT GGCCTCGGCC
AATCGGGCCG CGGCGCTGAC GCACCGCCTG TTGGCGTTCG CCCGCAGACA GCCCCTGGTG
CCGAAGCCGG TCGACGCCAA CCAGCTCATC CTCTCGCTCG AGGACCTGCT GCGCAGGACG
ATCGGCGAGG CCATCGAGCT GAAAATCGCT CCATCGAAAC TTCTCTGGCC GACGCTGTGC
GATCCCAATC AGCTCGAAAG CGCGCTGCTC AATCTCGCCA TCAACGCACG GGACGCCATG
CCCAACGGCG GCAGGCTCAC GATCGCCACG GCGAATGTGA GCCTCGACAC CGCCGGGTCC
GGCTCCTCCA CCAGGCAGCC GGGCGACTAT ATCTGCATCA CCGTCGCCGA TACCGGAACA
GGCATGACCG CCGATGTCGC CGCGCGCGCC TTCGATCCCT TTTTCACCAC CAAACCGATC
GGCCACGGCA CCGGACTAGG CCTGTCGATG ATCTACGGGT TCGCGCGGCA ATCGAACGGC
CACATCGCGC TCGACACCGC GCCGGGACGA GGCACGTCGA TCAGCCTGTC TCTGCCGCGT
CATGACAGCG TCGCGGAGGC TGTTCAAGCC TCTCCCGCAA GCAGGATCAC CGCCACAGGG
GGAACGGTTC TGGTGGTCGA GGACGAGCCT GTCGTGCGCG GCATCATCGT CGAGATGCTG
CATGATCAGG GTTATGTCAC GCGCGAGGCC GCGGACGGAG CGGAAGGCTT GCGGATTCTG
CAATTGGACA AGCCGGTCGA TCTCCTGCTC ACCGACATCG GACTTCCCGG CATGAACGGA
CGCCAACTCG CGGATCAGGC GCGCGAACTC CGCCCGGATC TGAAAATCCT CTTCATGACC
GGCTATGCGG ACAATGCCGC AAACGCCAAG GGCTTCCTCC AACCCGGCAT GGACATGATC
ACCAAGCCGT TCGATCTCGG CCATCTCTCA CGGCGTGTGC GTGACATCAT TTCCCACTGA
 
Protein sequence
MPEKESVSSG AFLAGGGEMG ELTRRFDWDS TSVGTPDAWP QSLRTAVRIV LNTNHPMFIW 
WGSDLIQFYN DAYRQTLGPE RHPGALGQKG RECWAEIWDI IGPQIEYVMS GRGATWHEEQ
LVPVTRNGRL EQVYWTYGFS PIDDEEGVGG VLVVCRDVTR EHLARMALQE RETELARVQA
IGRIGGLEVD LRTGHRSRKS PEYLKIHGLP SDAVHESRED WLRRVHPDDR AVAEGQFIDA
IRGGDREYAV RYRIIRPNDG ETRWISARSV IERDADGKAI RLFGAHIDIT DQVEAERSIR
QREQEFRTLA EALPHHVWTA TSDGSFNWFN RRFYDYVGAR PGEPGGNDWN RIVHPDDAPA
AAAARTRAVT SGEPYEIEFR LRRSDGVYHW FLSRALPARD EDGRIIRWIG TDTDVHDQKR
IAAKLAELNA TLAERVEEKT RERDRIWNVS QDLLLVMDQH GIWRSVNPAW TRTLGWSEAE
LTERTAEWIE HPDDIVKTRV EVRRLARGKA TVRFENRLRH KDGSYRWLSW TAVPDQNLIY
AVARDVTAEK AAAERLRAAE EALRQSQKME AVGQLTGGIA HDFNNLLTGI VGSLDLLQTH
IKQGRTDRAG RYIEAAMASA NRAAALTHRL LAFARRQPLV PKPVDANQLI LSLEDLLRRT
IGEAIELKIA PSKLLWPTLC DPNQLESALL NLAINARDAM PNGGRLTIAT ANVSLDTAGS
GSSTRQPGDY ICITVADTGT GMTADVAARA FDPFFTTKPI GHGTGLGLSM IYGFARQSNG
HIALDTAPGR GTSISLSLPR HDSVAEAVQA SPASRITATG GTVLVVEDEP VVRGIIVEML
HDQGYVTREA ADGAEGLRIL QLDKPVDLLL TDIGLPGMNG RQLADQAREL RPDLKILFMT
GYADNAANAK GFLQPGMDMI TKPFDLGHLS RRVRDIISH