Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_2042 |
Symbol | |
ID | 5200879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 2291771 |
End bp | 2292838 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640581586 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001262539 |
Protein GI | 148554957 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.595778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGGC ATAAGCATGT CGTACCGGTG TGCAATGCAA CTGCGCCGAC CGGTATGATC GTCACCATGC TGGATTGCCT GTCCCGCATT TCGGGCGAGC CCGATCGCCT GCTCGCGGAG GTGGGTGTCT CGCACAGCTT CGCGGCGTTC AAGGCGGGCC AGGTGCGCGA GATCGGGCTG GACGCCTTCA TCGCGACAAA CCGGGCCTGC AACGCGCGCT TCCGCGACTA TCTCCACCAG TCGCAGGGGC CGCAGATGAC CGAGGAGCAG TTCTCGCTGC TGTGCCGCTG CCTGATCGCC TGCGCCGACC TGCGCGAGGT GCTGACGACC ACGTCGAGCT TCTTCGCGAT GTTCAACGGC ACGCTCGGCG CCTTTCGCCC GGAGTTCGGC GATCGGCATG TCACGCTTTT CATCGAGCCC CGCCGGCGCG GCCCTACCGA TCCTTCCTTC CTGATCGACG CCTTCGGAAT GGCGGTGCTC CAGCTGTTGT TCGGCTGGCT GATCGGGCGG TCGCTGGCGT TGGAGCGCGT CGATTTCTCC TATCCCGCCA GCGTGCGCAC GGATTTCGGC CTTGGCCTGT TCAGCTGCCC GGTGCGGTTC GACCGGCCGA GCAACATCAT GCTGTTCGAT GCCGTCTATC TCTCGGCGCC GGTGGTGCGC ACCAGCAGCG ACATGCGCAC GCTGCTGGAG ACCTTTCCCT ACGACATGAT GCTCGGGCGC GATCGCGGCC GCTCGCTCGC CGATCAGGTC TATGCGCTCA TGATGAACGC GCATTCGGCG GAGCGGCAAC TGCCCGGTGC CGAACGGGTG GCGCGGGACT TCGGCGTGTC GAGCTGGACG CTGCGGCGGC GGCTGGCCGA GGAGGGGACC GGCTTCTCGC GCATCCGGCA GCGGTGCCAG CTCAACATCA CCACCGATTT CCTGCGCCGG CCGGACCTGA CGATCGATCG CATCGCCGAG ATCGCCAATT TCAGCGACGC CAACGCCTTT CGCCGCGCCT TCCAGCAATG GACAGGCAAG TCCCCCACCG CCTTCCGCCG CGAGCTGGCC GCCGGGCGCA TCGGCTGA
|
Protein sequence | MSRHKHVVPV CNATAPTGMI VTMLDCLSRI SGEPDRLLAE VGVSHSFAAF KAGQVREIGL DAFIATNRAC NARFRDYLHQ SQGPQMTEEQ FSLLCRCLIA CADLREVLTT TSSFFAMFNG TLGAFRPEFG DRHVTLFIEP RRRGPTDPSF LIDAFGMAVL QLLFGWLIGR SLALERVDFS YPASVRTDFG LGLFSCPVRF DRPSNIMLFD AVYLSAPVVR TSSDMRTLLE TFPYDMMLGR DRGRSLADQV YALMMNAHSA ERQLPGAERV ARDFGVSSWT LRRRLAEEGT GFSRIRQRCQ LNITTDFLRR PDLTIDRIAE IANFSDANAF RRAFQQWTGK SPTAFRRELA AGRIG
|
| |