Gene Saro_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0189 
Symbol 
ID3916177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp193439 
End bp195367 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content68% 
IMG OID640442915 
Producthistidine kinase internal region 
Protein accessionYP_495472 
Protein GI87198215 
COG category[T] Signal transduction mechanisms 
COG ID[COG2972] Predicted signal transduction protein with a C-terminal ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTGT CGCAAATCGA TCCGGGCGTC CCGCGCGTGC CGGCACGGCA GGTAATCCTG 
TCGGCCGTGG CGCTGTGGCT CTGCTACTTC GTGCTGATTA CCGTGCGCGG GCTGGTGGTC
GAACTGGGCG ACTTTTCGGA CCTCCTGTGG CGGCGTGCGC TGGTGACCGT TGCAGGGATC
GTCGTGACCG TAGCCTGCTG GCCGTTGCTG CGCCGCTTCG ATGCGCGCCC GCTGGCAACG
CGCGTGGGGG CGGCCCTGGT CATCATGCTT CCGGCCGCGC TTGCGCTGGC GGCAGTCAAC
CAATGGGCCT TCGCCCCGGT CGAAAAGCGG ATGATCGAGC GGATCAGCAC CGATCAGGGC
GATGAAGCCC CGACCCGCAC CTCGCAGGTC ACCGTGAAGA ACGGCGATGG CAAGGGCGGC
GCGGTCAAGA TCCGCCACGA CATGGCCGGC AACGTTCTGG TCGACGTGCT GGACGAGGGC
TTCGTTCCGC CCGTACCTGC TCCACCCCCT GCCCCGCCGC CGGCACCCCG CACCCCGCAC
CCCACCCATG CCCCGGGCTC GACGGCCCAC GCCTCGCCGA CGACGGAACC GTTGCTCAGC
GAGGAGGACG TTGCGGAACT GCGCAACCTG GGCGACGAAC GCACCATAGA GGCCTTGAAG
AACGCCGGGA TCGTGCAGAA TCCCGACGGC AGCACGGTGA TCTCCCAGCC GGGGCTCTAT
GTGCGGCAAT ATGCCGATGG CCGCTCCGAA GTGCGCACTG GCGGCAAGGT CTATGTCGTG
GGCGATGACG GCGAGGTTGA ATCGATCCTG AGCGGCGCCC GGGACCCGGT CCCGCCGGAA
CCTCCTGCCC CCGCTGCCAG CGCGGTTCCT CCAGCACCCC CGGCACCACC GGCCCCGGCA
ATTTCTGCCG CCATGCAGGA GAAGGTGCGC GAAGCCGCGC GGCGCGAGAC CGAGAAGGCG
CTGGGCAAGG CCGACGCGCT GCGCAAGGCC GCGCTGGCCC GCGCGCGCGA AGCCCGTGCC
CGACATGCCG ACGCGCATCC CGTGGCCCAG CCTTCCGTCT CGGCCCGGCC TGCCGTTTCG
GCCGAGCCCG CCGCTTCCGC CACGCCCGAA CCATCGCCCT CGGAGATCGT CACCGAGGAT
TATCGCGGCT CCGACCATGT GACGATCATC CGCCAGACGG TAGAGAACGA GGGGCTGTGG
CGGCAACTCA CCGACGTTGC CCTTGGTCGC TACTTCCTGC TCATCGCCTG GGCCGCGCTC
TATCTTGCGC TGGGCAATGG CGAGCAACTC CGCGCCGCCG AATATCGCGA GGGCGAATAC
CGGCGTGCGG CCAAGGCATC GGAATTGCGC TCGCTGCGCT ACCAGGTGAA CCCGCACTTC
CTGTTCAACA CGCTCAATTC GCTGTCGGCC CTGGTCATGG TCGGTCGCAG CGAACAGGCC
GAGCGAATGA TCCAGTCGAT CTCGCGCTTC TACCGTCACA GCCTTGCCGG CGATCCCACT
GCGGACGTCC CGCTGGAGGA CGAGATCGCG CTCCAGCGCC ACTACCTCGA CATCGAGGCG
GTGCGCTTCC CTGACAGGCT GCGGTGCGAG TTCGATGTTC CGGACGATCT GATGACGGCC
TGCGTGCCGG GCATGATCCT GCAGCCGTTG GTTGAGAACT CGATCAAGTA CGCGGTCTCG
ACCACTATCC GCCCGGTGAC GATCCGCATA GCCGCGCGCG AGGCGGGCGG TTTCCTGATC
CTGACGGTGG CAGACGACGG GCCGGGCGAA AGCTTCGTCA ACGGCGGCAC CGGCATCGGC
CTTGCCAACG TGAAGAGCCG CCTCGCCGCG CGCTTTGGCG AGGACGCGGC CAAGGTGGAA
AGCGGGCCGC TGCCGGCGGG CGGCTATGCG ACCGTGCTGA CGCTGCCGAT TGTCCGCAAC
GAATGCTGA
 
Protein sequence
MTLSQIDPGV PRVPARQVIL SAVALWLCYF VLITVRGLVV ELGDFSDLLW RRALVTVAGI 
VVTVACWPLL RRFDARPLAT RVGAALVIML PAALALAAVN QWAFAPVEKR MIERISTDQG
DEAPTRTSQV TVKNGDGKGG AVKIRHDMAG NVLVDVLDEG FVPPVPAPPP APPPAPRTPH
PTHAPGSTAH ASPTTEPLLS EEDVAELRNL GDERTIEALK NAGIVQNPDG STVISQPGLY
VRQYADGRSE VRTGGKVYVV GDDGEVESIL SGARDPVPPE PPAPAASAVP PAPPAPPAPA
ISAAMQEKVR EAARRETEKA LGKADALRKA ALARAREARA RHADAHPVAQ PSVSARPAVS
AEPAASATPE PSPSEIVTED YRGSDHVTII RQTVENEGLW RQLTDVALGR YFLLIAWAAL
YLALGNGEQL RAAEYREGEY RRAAKASELR SLRYQVNPHF LFNTLNSLSA LVMVGRSEQA
ERMIQSISRF YRHSLAGDPT ADVPLEDEIA LQRHYLDIEA VRFPDRLRCE FDVPDDLMTA
CVPGMILQPL VENSIKYAVS TTIRPVTIRI AAREAGGFLI LTVADDGPGE SFVNGGTGIG
LANVKSRLAA RFGEDAAKVE SGPLPAGGYA TVLTLPIVRN EC