Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_1540 |
Symbol | |
ID | 5198556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 1713884 |
End bp | 1714843 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640581088 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001262041 |
Protein GI | 148554459 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0000047582 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00155056 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACCTGC TCAACAGCGT GCTGCAATCG CTGAAGATCA TCGACAGTTC GCTTGGCATC CTCGAGCTTG CGCCGCCGTG GGGCTTCCGG CGGGACACGA TCCCGAAGGA CATGGCGACG TTCATTGCGC CGATGTCGGG AAGCTGCCTC GTTCGCCTCG ATGACGGCAC CACCGTCGAA ATGAAGCCCG GCAACGCCGT GCTGATCCTC CGCGACGCAT TCGACATAGT GTCGGCGGAC GGCAGCCCCA CCAAGTCCTT CGTCGAAAGT TGGACCGCCC AGGGGCTACC CGCCATGGGG CCGCATATCG AGCGATCAGG CCCGGATTAT TTCTGCGCGA TCGACCATGA GCGCCCGGAC GAGCATCGCG ACCGGCTGCT CGCTGTCGCC GTCCAGGCTG AGGACGTGGC GGAAAGCCCT ATCCTCGGCG TGCTTCCGCA GATGATCAGC TTCGACGAAA AGGAGCTGGA GCCTGCCCTG ATCGGCACCG TCGCCCAGTT CGTCGAGGCC GAGCATCGCC AACCCAATCC CGGCTACAAT GCCACCGCCC AGCAGCTCGC CAATTTCCTG TTCATCGCGT TGGTCCGCAA CCATGTGCTC TCCAACGGGA CCGACCGGGC AAGCTGGATC CGGGGCATGT CCGACCGCGC GATCGGACAT GCCCTGAAGC TCATCCATCA AGCCTTCGAC AAGCCCTGGA GTCTGCAGAA CCTGGCGCGC GCATCCGGCG TATCGCGGTC GGTCTTCGCC GCGCGCTTCA GGACGCTGGT CGGCCAGACG CCGATGACCT ATCTCGCAGC GGTCCGGATG CATGCGGCGG CGCAGCTGCT GATCGACGGC CAGCCGGTCT CCAATGTCTG CCAGCGGGTC GGCTATCGTT CCGAATGGGC GTTCCGAAAG GCCTTCCGCC AGCAGTTCGG GATGATGCCC GCCCGCTATG GAAAGTCGCT GCGCGGCTGA
|
Protein sequence | MDLLNSVLQS LKIIDSSLGI LELAPPWGFR RDTIPKDMAT FIAPMSGSCL VRLDDGTTVE MKPGNAVLIL RDAFDIVSAD GSPTKSFVES WTAQGLPAMG PHIERSGPDY FCAIDHERPD EHRDRLLAVA VQAEDVAESP ILGVLPQMIS FDEKELEPAL IGTVAQFVEA EHRQPNPGYN ATAQQLANFL FIALVRNHVL SNGTDRASWI RGMSDRAIGH ALKLIHQAFD KPWSLQNLAR ASGVSRSVFA ARFRTLVGQT PMTYLAAVRM HAAAQLLIDG QPVSNVCQRV GYRSEWAFRK AFRQQFGMMP ARYGKSLRG
|
| |