Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1867 |
Symbol | |
ID | 3917088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1968031 |
End bp | 1968969 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444611 |
Product | AraC family transcriptional regulator |
Protein accession | YP_497141 |
Protein GI | 87199884 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.122856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGCGC TTCTCAAGTT TCATACGCAG GACGTCGCTC CGCAGGACCG GGCGCGCTAC TGGAACGAGA TTGCCGACCG GGTCTTCACG GGCACGTTCG TCAACGTTCC GGGCGAGGAT TTCAGCGGCC GGATGCTGTC GTGGCGCGTC GGCGAACTCG ACATGATCCG TACGGATTCG ACCCATTCCG GGGTCGGCCG CACCCCCATC GCGCAGGACG ACGAAAGGCT GATCCTGCAT CTGCAATGCC GCGGCACCAG CCAGCACATG CAGAAGCAGG CCGAATGCGC GCTCGAGCCG GGCGACTTCG TACTGGCGAG CCCGCACATT CCCTATTCGA TCAAGCTGAC CGGGCACGAG ATGCTGGTCG TCGAGTTCCC GCGCGCGCCC CTGGCGGAGC GGTTTCCCGG CGTGGACGAT GCCTTGTTGC AGCGCATGTG CGGTGCGTCG CCCGGCGGAC GCGTGTTCCA CGACTTCCTG CTTTCGTTGT GGCAGCAGGG TGAACGGGCC GCCGAAGACC CCGAATGGGA AGTCGGCGTG AACGCGGTGT TCTATGACCT TGCGGCGATG GCGATGCGCG GGGCGCAGCG CCCGAACGCC GAGGTTGGCG AGGCCGACTT GCGACGCAAG GTGCTGGCAA TGGTCTCCTC CAGCCTGGAG GACCCCGCGC TGCGCACGGC ATCGATCGCC GATGCCTGCA ACATCTCGGT TCGCACGGTG CAGAACGTGT TCGCGGCAAT GGGTACGACG CCGACCGCGT ACATTCTCGA GCAGCGCCTT CGCCGCGCGG CGGACCGGCT CGTTGGAAGG CCCGACGCCA GCATCACGGA GATCGCCTTC GAACTGGGCT TCAACGACAG CGCCTACTTC ACGCGGTGTT TCCGCCAGCA GTTCGGCGCG GCGCCGCGCG ACTGGCGATT GGGAAGGATG TCATCATGA
|
Protein sequence | MGALLKFHTQ DVAPQDRARY WNEIADRVFT GTFVNVPGED FSGRMLSWRV GELDMIRTDS THSGVGRTPI AQDDERLILH LQCRGTSQHM QKQAECALEP GDFVLASPHI PYSIKLTGHE MLVVEFPRAP LAERFPGVDD ALLQRMCGAS PGGRVFHDFL LSLWQQGERA AEDPEWEVGV NAVFYDLAAM AMRGAQRPNA EVGEADLRRK VLAMVSSSLE DPALRTASIA DACNISVRTV QNVFAAMGTT PTAYILEQRL RRAADRLVGR PDASITEIAF ELGFNDSAYF TRCFRQQFGA APRDWRLGRM SS
|
| |