Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3797 |
Symbol | |
ID | 5077945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 443505 |
End bp | 445259 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640481520 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001166182 |
Protein GI | 146276022 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.231924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGTTTAA ATCTCAGCCT GTTCGGGCAC CCCCGTCTTG GCCTGACCAA TGGGGAACGC GTCCCGTTGC GCAGCAAGAA AGGGACCGCG CTTCTCGCCC TCCTAGCCAC GGCCGAAAAG GGTGAGCGAT CGCGGACCTG GCTCCAGCAA AAGCTGTGGG GATCGCGCGA TGTCCATCAG GCCCAGGCCA GCCTGCGCCG CGAATTGGCG AACTTGCGCA AGCTGGTTCC GCTGGATCTC GATTGGCTGG TGTCCGACAA TCATTCCGTG CGCATCGACC TCGATCTCGT TGACGTCGAT GTGCGGGGGC CGGCGAAGGA CGCTCCGCAG GGCGAGTTCC TCGAAGGCCT CGATATCCCC GGCGAAGAAG ATTTCGAGGA CTGGCTGCGG GAAGTCCGAG CGCAGTTTTC GCAGACGCAG GCATCCGGGC GGTTTGACAA GGGCGGGCGC GACGACATCG CCGGGGGCAG GTTGCGTGGT CCCGTTCGGG CGGGAGCCGA TTTCTCTCCC TTTGCGCAGG CTCTTGTCCC AGTCGCCGAA CCGGCCGAAC TCCGGCCCGT GCTCGCCATC CTGCCGGTGC GCGGTCTCTT CACGTCGCAG ACCGAAGAGC CGTTTCTTCA AGCGGTGACG CGCCTGCTCG TCAGCAGCGT AACCCGTCTT CGCTGGTTGC CGGTGCTGAC CGCGAGCGTT GCTGACGACA GCATCTACGG CGCAATCGGG CCAGAGGATG CAAAGGCCCA GGCCAACGCC AGATATGTGC TGGAATCGGA ACTGGTGCGT TCGGGCAGTC AGGCGATCCT GAGCTTCACC TTGCTGGAAA TGCCCCTGCG CACGGTCCTG TGGAGCGACA GCGAACCCGT TGCACCGACT TTCGACATCC ACGAGATCGA GCAAGTCCTG TTTCGCGCGG TCAACCTCCT GTCGGCGCGC TTCGATCGCT GCGAGCAGCA TCGGGTCATG GCCCGCAAGC CCGATCCCGC CAATCTTGCG GATGCGGTCT GGCGGATCCG GTATCACCTC CAGAAGTTCA CGCGCGAGGA CATGGCAACC GCGGCGCAAC TGCTGGATCA GGCGTTCGAA CTCGATCCGC ACCACGCCGA ACTTGCCATG CTCCGCGCCA ACCACGTGAT CTGGGATCAC TGGCTGCGCC GCGTTCCCGT CGAGAAGTGC GATGGGCTCG TGCCGCTGGC CCGTGCCGCA GTGCGCGCGG ACCCGGCGGA CGCACGCGGG CCGCTGTTCA TGGGCGTGCT CGAGACCTGG CGCAGGAATA CCGAACACGC AATCCGCTAT CTGGAGCGTA GCTGCGAACT GAACCCCTGC TTCCCGGCAG CCTTTTCCCA TCTTGGCGCC GCCTATTACC TCAACGGCAA GCCCGAGCGC TCGATCGAGC CGCTCGAGCG CTCTCTCTAT CTTTCCCCGA TGGATCCGTT CCGCTTCTTC ATTCTCGGAG AACTCGGGAC AGCGCGCTTG CTGTTGGGGG AGCACGCCGA GGCGCTGCGC ATTGCGCGCG AAGTGCAGCT TACCCATCCC AACTATGTCC TCGCCCACAT CCTCGAGACG AATGCGCTCG TCGGGCTGGG AGAGATGGAA CGCGCGCGGG CGGCCTGGTC GCGATTGCTG GCCGACCGGC CCGATCTCTA CGAGGGCATG CTCGCCTGGA TTCCCTTTCG TGAAACGAGC TGGCTTCAGC GCCTTCGCCA AGGGTGCGAC CTGATTGCTG CCGACTGGCA AAAGCCCAGG CTCGCAGCCG GATAA
|
Protein sequence | MRLNLSLFGH PRLGLTNGER VPLRSKKGTA LLALLATAEK GERSRTWLQQ KLWGSRDVHQ AQASLRRELA NLRKLVPLDL DWLVSDNHSV RIDLDLVDVD VRGPAKDAPQ GEFLEGLDIP GEEDFEDWLR EVRAQFSQTQ ASGRFDKGGR DDIAGGRLRG PVRAGADFSP FAQALVPVAE PAELRPVLAI LPVRGLFTSQ TEEPFLQAVT RLLVSSVTRL RWLPVLTASV ADDSIYGAIG PEDAKAQANA RYVLESELVR SGSQAILSFT LLEMPLRTVL WSDSEPVAPT FDIHEIEQVL FRAVNLLSAR FDRCEQHRVM ARKPDPANLA DAVWRIRYHL QKFTREDMAT AAQLLDQAFE LDPHHAELAM LRANHVIWDH WLRRVPVEKC DGLVPLARAA VRADPADARG PLFMGVLETW RRNTEHAIRY LERSCELNPC FPAAFSHLGA AYYLNGKPER SIEPLERSLY LSPMDPFRFF ILGELGTARL LLGEHAEALR IAREVQLTHP NYVLAHILET NALVGLGEME RARAAWSRLL ADRPDLYEGM LAWIPFRETS WLQRLRQGCD LIAADWQKPR LAAG
|
| |