Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0599 |
Symbol | |
ID | 3915611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 644032 |
End bp | 644970 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640443329 |
Product | LysR family transcriptional regulator |
Protein accession | YP_495880 |
Protein GI | 87198623 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATC GCGCTCCGCT GATCGACAGG AAGTTTGCCA GCCGCGTCGA CTGGAACCTG ATGCGCACCT TCGTCGACAT CGTTCGCGCA GGCGGGATCG GTGCGGCGGC GCGCCAGCTC AATCGCCAGC AGCCGAGCAT CAGCGCGGCG CTCAAGCGGC TCGAGGATCA TGTCGGCGCG AGCCTGCTGG TCCGCACCGC GACCGGGGTC GAGATGACGC CGGCCGGCAA GGCGATGATG GCGCTTTGCG AGGACATGCT GGAGACGGCG CGAATGGTGC CACACCAGAT CGCCCAGGCG ACGAGGCGCG TCGACGGGTT CGTGCGCATC CAGATCGTCT CGGGCCTCGT CTCGGCCGAA TTCGACGAGG CAATCGCCAG TTTCCACCGG CGCAATCCCG CCATCCACAT CGAGATCAGG GTGTCGCCCT GGCGCCAGGT GCTCGATGCG CTGGAGCAGG GGGAAGTGGA AATCGGCGTG GGTTATGACG GCAGCGTGCG CGGAAGCCTG ACCTACGAGC CGCTGCTGGT GGAGCGGCAA CAGCTCTACT GCTCGCGATC CAGCCCCTAC TTCGGATATC GCGTGAGCCG GTTGCACGAG TTGAAGGACG AGGGCTTCGT CCTGACGGGC GACGACGAGA TCGAACTGAT CACCAACCTG CGCCGGCGCT ACCGGCTGGG CTCCAACGTC GGTGGCATGG CAGAGGACAT CAACGAGGCG CGGCGGCTGA TCAAGCTGGG CGTGGGCATC GGCTTCCTGC CTGTGCCCGC CGCCGAGGCG GAAGTGGCGA GAGGGACGCT CTGGCCGATG CTTCACGCCG ATTTCGAGCC TTCGTACGAC GTCTATCTGC TGGCGCGGGC GGAGCCGGCG CGCGACACCG CGACGCAGCT GTTCTGGGAC GAGGTGTTCC GGCGGGTGAG GGCTTCGAGC CGGGCATAG
|
Protein sequence | MSDRAPLIDR KFASRVDWNL MRTFVDIVRA GGIGAAARQL NRQQPSISAA LKRLEDHVGA SLLVRTATGV EMTPAGKAMM ALCEDMLETA RMVPHQIAQA TRRVDGFVRI QIVSGLVSAE FDEAIASFHR RNPAIHIEIR VSPWRQVLDA LEQGEVEIGV GYDGSVRGSL TYEPLLVERQ QLYCSRSSPY FGYRVSRLHE LKDEGFVLTG DDEIELITNL RRRYRLGSNV GGMAEDINEA RRLIKLGVGI GFLPVPAAEA EVARGTLWPM LHADFEPSYD VYLLARAEPA RDTATQLFWD EVFRRVRASS RA
|
| |