Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0036 |
Symbol | |
ID | 3916039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 37528 |
End bp | 39198 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640442761 |
Product | histidine kinase |
Protein accession | YP_495319 |
Protein GI | 87198062 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.526737 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCGAAAG CGCGCCCATC CCCGCAGGCT GCCGCGCTGG ACCGCGAGAT CTACCGCATC CTGTTCGAGA ACATGGACGA TGGCTTCTGC GTCATCGAGT TTCTCGACGG GCCGCATGGT CCGCTCAGCG ATTACGTCCA CGTCCTCGCC AACTCCGCCT ATGAACGCAA CACCGGCATT CCCAACGTCG TCGGCCAGTA CCTGCGCGAG ATGGTGCCCG ACGAGGCTGA CGACTGGATC GCGTTCTACG GCAAGGTCCT GCGCACCGGA GAACCGATCC GCTTCCGCAA CGAACTGGTC GTCACCGGCC GCCATCTCGA AGTCTCGGCC TTCCGCCTCG GAACCTTCGA GGACCGGTTG GTGGCAGTCC TTTTCAAGGA CGTCACGGAA CGGGTCGACG CCGAAAGCGC TCTGCACCAG CTGAACGAAA CCCTCGAGCA GCGCGTCGCC GATGCGCTGG CGCAGCGGGA GCTGGCCGAA AGCGCGTTGA GGCAGGCGCA GAAGATGGAA GCGGTCGGCC AGTTGACCGG CGGCCTTGCG CACGATTTCA ACAATCTCCT CGCCGGGATC ACCGGGGCGT TCGAGATGAT CGCGCGACGA CTGGAGCAGG GCCGCACCGC CGATGTAGAG CGTTATCTTG CGGCGGGCCT CGGAGCGGCC CACCGCGCGG CCGCGCTCAC GCACCGCCTG CTTGCCTTCT CACGCCGCCA GACGCTGTCC CCCCGCACGA CCGAGGCGAA CCGCCTGCTG GTCGATTTCG CCGAACTGGT GCGGCGCACG GTCGGCCCGC AGATCGCGGT CGAGGTGCGC ACCAACCCCG CGCTATGGGC CGCGCTGGTC GATGCAAACC AGCTCGAGAA CGCGCTGCTG AACCTGTGCA TCAATGCTCG CGACGCCATG CCGGACGGCG GTCGACTTGC CATCGAAACC GATAACGTCA CCCTCGACGA AGCCAGCGCT ACCGAGCGGG GCCTTCCGCC AGGCGACTAC GTGACGATAA CCGTGAGCGA CACCGGCGTC GGCATTCCGG AAGAGGATCT CGACCGCGTG TTCGAGCCGT TCTTCACCAC CAAGCCGACC GGGCGCGGAA CGGGCTTGGG GTTGTCGATG GTCTACGGCT TTGCCCGGCA GAGCGATGGT ATCGTGCGCA TCCGCTCCCG CCCCGGCGAA GGCACGCAGG TCCGCATCTA CCTTCCGCGC CACGAGGGCC CGGCCGAAGT CCCTCGCGAA ACCGGCGTTC ACGAACCGCA GCCCGAATTG CCTGGCGCGA CGGTCCTCGT CGTCGACGAC GAGCCGACGG TGCGCATGAT GATGGTCGAT GCGCTCGACC TGATCGGTGT CGAATGCCTC GAGGCGCACG ACGGGCCCGC CGCGCTCGCG ATGCTCGAAC GCCACCCGGG GATCGACCTG CTCGTCACCG ACGTCGGGCT TCCCGGCGGC CTCAACGGCC GCCAGGTGGC CGACGAGGCG CGGCGGCGGC GGTCGGACCT GGGAGTCCTC TTCGTCACCG GCTATGCCGA CAGCGTAATC CTCCAGCGCG ACACCATGGA ACCGGGCATC GACATCCTGA CCAAGCCGTT CACCATCGAG GACCTGCAAT CGCGCGTCGC AGTGCTGCTT TGCGCGCGTG GGGGGCCAGG CAGGCCTGAA ACCGGACCGA ACGCCGCCTA G
|
Protein sequence | MAKARPSPQA AALDREIYRI LFENMDDGFC VIEFLDGPHG PLSDYVHVLA NSAYERNTGI PNVVGQYLRE MVPDEADDWI AFYGKVLRTG EPIRFRNELV VTGRHLEVSA FRLGTFEDRL VAVLFKDVTE RVDAESALHQ LNETLEQRVA DALAQRELAE SALRQAQKME AVGQLTGGLA HDFNNLLAGI TGAFEMIARR LEQGRTADVE RYLAAGLGAA HRAAALTHRL LAFSRRQTLS PRTTEANRLL VDFAELVRRT VGPQIAVEVR TNPALWAALV DANQLENALL NLCINARDAM PDGGRLAIET DNVTLDEASA TERGLPPGDY VTITVSDTGV GIPEEDLDRV FEPFFTTKPT GRGTGLGLSM VYGFARQSDG IVRIRSRPGE GTQVRIYLPR HEGPAEVPRE TGVHEPQPEL PGATVLVVDD EPTVRMMMVD ALDLIGVECL EAHDGPAALA MLERHPGIDL LVTDVGLPGG LNGRQVADEA RRRRSDLGVL FVTGYADSVI LQRDTMEPGI DILTKPFTIE DLQSRVAVLL CARGGPGRPE TGPNAA
|
| |