Gene Nwi_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1029 
Symbol 
ID3674880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1124334 
End bp1125869 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content58% 
IMG OID637712579 
ProductSignal transduction histidine kinase 
Protein accessionYP_317643 
Protein GI75675222 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAGCG ACGATCTTTC ATCCTTGAAC AACCTGTTCC GCGACGAGGA TTTGCAACGC 
GGCATCGACA AGTCAGGGGT CGGCATCTGG GATCTCGATC TCTCGACCCG AAAACTGTTC
TGGTCCAACC CCGCGCGAAG GCTTTTTGGC GTCCCGGAGA ACATCCCGGT CAGTTACGAT
CTATTCCTCT CCCTGCTCCG ACGGGATGAT CGAGCGCGCA CCAAACAGGC TATTCAACGG
TCGATCACGA CCGGATGCAG GTTCGATCTT CAATACCACA TTCATGGGAC ACTTGAGGGA
AGTCGCTGGA TTCGGGCACG AGGCGGTCTC GTCTTCAACG AAGACGGAAC GCCACAGCGC
CTGAGCGGCG TGATCCTCGA TATCGACCAT CAGAAGTCGA TCGAAGAAGC TCTGATCGCG
CGCGAAGACC ACTTGCGCTC GATTCTCGAT ACCATTCCCG ATGCGATGAT TGTGATTGAC
GATCGCGGCA TCATGCAGCA CTTCAGCACC GCAGCAGAAC GTCTGTTCGG CTATTCGGAA
CACGAGGCCA TTGGCCAGAA TGTCCGCATC CTCATGCCCG AGCCGGACAG CAGCCGACAC
GACGGCTATC TGGCTCGCCA TCAATCAACC GGCGAACGGC GCATCATCGG CATCGGACGG
GTGGTCACCG GAAAACGCCG CGATGGGAGT ACCTTTCCAT TGCATTTGTC CGTCGGCGAG
ATGCACTCCG GTGGAAAACG GCATTTTACC GGATTTGTTC GCGATCTCAC GGAGCATCAG
CAAACCCAGG TCAAGCTGCA GGAACTGCAA TCCGAACTGT TTCACGTATC CCGCCTCAGC
GCCATGGGTG AGATGGCTTC GGCTCTTGCT CACGAGTTGA ACCAGCCGTT GACCGCGATC
AGCAACTACA TAAAGGGATC GCGCCGCCTT CTGGCCGCGA GCACCGATCC CGACATCGCG
AAGATCGAAA TGGCGCTGGA TCGCGCCGCC GACCAGGCCA TTCGCGCCGG CCAGATCATT
AGACGACTGC GCGATTTCGT CTCACGGCGG GAATCGGAGA AGCGCGTCGA AAGCCTGTCG
AAGCTGATGG AAGAAGCCAG CGTCCTCGGT CTCGCGGGCG CCCGCGAACA AAACATCATC
CTGTTGACTG ATCTGAATGC GGACTGCGAT CGCGTCCTCG TCGACCGCGT GCAAGTTCAG
CAGGTCCTCG TCAACCTCTT TCGAAATGCC CTGGAGGCGA TGGCGAAATC GCCGAAACGG
GAACTTACCG CCTCCAGCAC GAAAGCCCCG GACAACATGG TCGAACTGTC CATATCGGAT
ACCGGCCATG GCTTCAGTGA GGAGGCGGCG GCAAACCTGT TCGATACTTT TTTTACGACC
AAGGAATCCG GAATGGGCGT GGGACTTTCC ATCAGCCGAT CGATCGTGGA AGCCCACGGC
GGCCGAATGT GGGCCGAAAC GAATGATTCA GGCGGCGCTA CGATGCGACT GACTTTGCCC
ATAGCATCGA TCGGGGATTT TCCGGATGTC ACATAA
 
Protein sequence
MTSDDLSSLN NLFRDEDLQR GIDKSGVGIW DLDLSTRKLF WSNPARRLFG VPENIPVSYD 
LFLSLLRRDD RARTKQAIQR SITTGCRFDL QYHIHGTLEG SRWIRARGGL VFNEDGTPQR
LSGVILDIDH QKSIEEALIA REDHLRSILD TIPDAMIVID DRGIMQHFST AAERLFGYSE
HEAIGQNVRI LMPEPDSSRH DGYLARHQST GERRIIGIGR VVTGKRRDGS TFPLHLSVGE
MHSGGKRHFT GFVRDLTEHQ QTQVKLQELQ SELFHVSRLS AMGEMASALA HELNQPLTAI
SNYIKGSRRL LAASTDPDIA KIEMALDRAA DQAIRAGQII RRLRDFVSRR ESEKRVESLS
KLMEEASVLG LAGAREQNII LLTDLNADCD RVLVDRVQVQ QVLVNLFRNA LEAMAKSPKR
ELTASSTKAP DNMVELSISD TGHGFSEEAA ANLFDTFFTT KESGMGVGLS ISRSIVEAHG
GRMWAETNDS GGATMRLTLP IASIGDFPDV T