Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_2047 |
Symbol | |
ID | 5197398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 2298592 |
End bp | 2299644 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640581591 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001262544 |
Protein GI | 148554962 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGC ACAAACATGT TTCGCGCCTG CGCTGCGAGG AGGTGCCGGT CGATCTCCTC ATCGCCCTGC TCGGCTGCGT CGCGAAGGCC GGCGGCAGTG CGGACGGCAT CCTCCGCGAC GCGGGCGCCA CCTATCGGTT TCGCGAACTG AAGCGGCGAC GCAACTGCAC GATCGCGGAG TTCACGCGGG CCAATCGCCT GTGCAACGAA TATCTGCGCG GCCACATTCT CGCCACCACC GGCTGCCCGA CGCTGAACGA ACAGCAATTC TACCTGCTCG CCAAGTGCCT CGCCGCCTGT TCCGATCTGG AGGAGGCATT GCGCACGACA GCCGCCTTCT TCGCGATGTT CGAGGGGAGG ATCGGCGAAG CCCATTTCGA GGTGCGCGGC GAGCGGGTAC ATCTTCACAT CAACCCTCCC CGTCGCGAGA AGAACGAGGC CGGCTTCCTC GTCGACATAT ATGGCTATGC GATCCTCCAG ATATTCCTCG GCTGGCTGCT GGACGAGCAG CCGATCTTCG ATGCGGTCGA TCTCATCTAT CCGCAGCCGG CTCGGGAGAG CGTCCATCTC GGGCTTTTCG ACTGCCCCAT CCGTTTCGGC CAGCCGAGCA ATCGCTTCAG CTTCAGGAGC GATCTCCTCT CGCAGCCGGT CGTTCGCGAT CAGGCGAGCC TCATGAAGCT GCTCGCGGAT TTCCCGTTCA ACCTGATGCT CGACGAGGAG CAGCGCAAGC TGTGCGACCG GGTCTATACC GCAATGATGA ATAGCTACAT GAGAAGCCAT ATGCTGCCGA CCATAGATGA TGTGGCGAAG CTGTTCAGAA CCTCGACCTG GACCTTGCGC CGCCGCCTGA CCGAGGAGGG CACCGCCTAT TCCTCGATCA AGAAGAAGTG CCAGCTCAAC CTCGCGACCG AGTTCCTCAA GCGATCCGAG ATGACGATCG ACGAGATCGC GGATATCGCC AATTTCAGCG ACGCCAACGC CTTCCGCCGC GCCTTCCACC AATGGACCGG CTGTTCGCCG ACCGCCTATC GCAAGGAACT CCTCGCCGTT TAA
|
Protein sequence | MGKHKHVSRL RCEEVPVDLL IALLGCVAKA GGSADGILRD AGATYRFREL KRRRNCTIAE FTRANRLCNE YLRGHILATT GCPTLNEQQF YLLAKCLAAC SDLEEALRTT AAFFAMFEGR IGEAHFEVRG ERVHLHINPP RREKNEAGFL VDIYGYAILQ IFLGWLLDEQ PIFDAVDLIY PQPARESVHL GLFDCPIRFG QPSNRFSFRS DLLSQPVVRD QASLMKLLAD FPFNLMLDEE QRKLCDRVYT AMMNSYMRSH MLPTIDDVAK LFRTSTWTLR RRLTEEGTAY SSIKKKCQLN LATEFLKRSE MTIDEIADIA NFSDANAFRR AFHQWTGCSP TAYRKELLAV
|
| |