Gene Saro_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0036 
Symbol 
ID3916039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp37528 
End bp39198 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content67% 
IMG OID640442761 
Producthistidine kinase 
Protein accessionYP_495319 
Protein GI87198062 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.526737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCGAAAG CGCGCCCATC CCCGCAGGCT GCCGCGCTGG ACCGCGAGAT CTACCGCATC 
CTGTTCGAGA ACATGGACGA TGGCTTCTGC GTCATCGAGT TTCTCGACGG GCCGCATGGT
CCGCTCAGCG ATTACGTCCA CGTCCTCGCC AACTCCGCCT ATGAACGCAA CACCGGCATT
CCCAACGTCG TCGGCCAGTA CCTGCGCGAG ATGGTGCCCG ACGAGGCTGA CGACTGGATC
GCGTTCTACG GCAAGGTCCT GCGCACCGGA GAACCGATCC GCTTCCGCAA CGAACTGGTC
GTCACCGGCC GCCATCTCGA AGTCTCGGCC TTCCGCCTCG GAACCTTCGA GGACCGGTTG
GTGGCAGTCC TTTTCAAGGA CGTCACGGAA CGGGTCGACG CCGAAAGCGC TCTGCACCAG
CTGAACGAAA CCCTCGAGCA GCGCGTCGCC GATGCGCTGG CGCAGCGGGA GCTGGCCGAA
AGCGCGTTGA GGCAGGCGCA GAAGATGGAA GCGGTCGGCC AGTTGACCGG CGGCCTTGCG
CACGATTTCA ACAATCTCCT CGCCGGGATC ACCGGGGCGT TCGAGATGAT CGCGCGACGA
CTGGAGCAGG GCCGCACCGC CGATGTAGAG CGTTATCTTG CGGCGGGCCT CGGAGCGGCC
CACCGCGCGG CCGCGCTCAC GCACCGCCTG CTTGCCTTCT CACGCCGCCA GACGCTGTCC
CCCCGCACGA CCGAGGCGAA CCGCCTGCTG GTCGATTTCG CCGAACTGGT GCGGCGCACG
GTCGGCCCGC AGATCGCGGT CGAGGTGCGC ACCAACCCCG CGCTATGGGC CGCGCTGGTC
GATGCAAACC AGCTCGAGAA CGCGCTGCTG AACCTGTGCA TCAATGCTCG CGACGCCATG
CCGGACGGCG GTCGACTTGC CATCGAAACC GATAACGTCA CCCTCGACGA AGCCAGCGCT
ACCGAGCGGG GCCTTCCGCC AGGCGACTAC GTGACGATAA CCGTGAGCGA CACCGGCGTC
GGCATTCCGG AAGAGGATCT CGACCGCGTG TTCGAGCCGT TCTTCACCAC CAAGCCGACC
GGGCGCGGAA CGGGCTTGGG GTTGTCGATG GTCTACGGCT TTGCCCGGCA GAGCGATGGT
ATCGTGCGCA TCCGCTCCCG CCCCGGCGAA GGCACGCAGG TCCGCATCTA CCTTCCGCGC
CACGAGGGCC CGGCCGAAGT CCCTCGCGAA ACCGGCGTTC ACGAACCGCA GCCCGAATTG
CCTGGCGCGA CGGTCCTCGT CGTCGACGAC GAGCCGACGG TGCGCATGAT GATGGTCGAT
GCGCTCGACC TGATCGGTGT CGAATGCCTC GAGGCGCACG ACGGGCCCGC CGCGCTCGCG
ATGCTCGAAC GCCACCCGGG GATCGACCTG CTCGTCACCG ACGTCGGGCT TCCCGGCGGC
CTCAACGGCC GCCAGGTGGC CGACGAGGCG CGGCGGCGGC GGTCGGACCT GGGAGTCCTC
TTCGTCACCG GCTATGCCGA CAGCGTAATC CTCCAGCGCG ACACCATGGA ACCGGGCATC
GACATCCTGA CCAAGCCGTT CACCATCGAG GACCTGCAAT CGCGCGTCGC AGTGCTGCTT
TGCGCGCGTG GGGGGCCAGG CAGGCCTGAA ACCGGACCGA ACGCCGCCTA G
 
Protein sequence
MAKARPSPQA AALDREIYRI LFENMDDGFC VIEFLDGPHG PLSDYVHVLA NSAYERNTGI 
PNVVGQYLRE MVPDEADDWI AFYGKVLRTG EPIRFRNELV VTGRHLEVSA FRLGTFEDRL
VAVLFKDVTE RVDAESALHQ LNETLEQRVA DALAQRELAE SALRQAQKME AVGQLTGGLA
HDFNNLLAGI TGAFEMIARR LEQGRTADVE RYLAAGLGAA HRAAALTHRL LAFSRRQTLS
PRTTEANRLL VDFAELVRRT VGPQIAVEVR TNPALWAALV DANQLENALL NLCINARDAM
PDGGRLAIET DNVTLDEASA TERGLPPGDY VTITVSDTGV GIPEEDLDRV FEPFFTTKPT
GRGTGLGLSM VYGFARQSDG IVRIRSRPGE GTQVRIYLPR HEGPAEVPRE TGVHEPQPEL
PGATVLVVDD EPTVRMMMVD ALDLIGVECL EAHDGPAALA MLERHPGIDL LVTDVGLPGG
LNGRQVADEA RRRRSDLGVL FVTGYADSVI LQRDTMEPGI DILTKPFTIE DLQSRVAVLL
CARGGPGRPE TGPNAA