Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0074 |
Symbol | |
ID | 4027253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 92956 |
End bp | 94011 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637965225 |
Product | LacI family transcription regulator |
Protein accession | YP_572137 |
Protein GI | 92112209 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.840414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGTC GACCGACCCT CAAGACTATC GCTGATCGGG TGGGCGTTAC CGTCTCCGCG GTATCGCTGG CCCTGCGCGA CGATCCACGT ATCTCCGCGA CGACGCGTTC ACGTATCCAG GAGGCCGCCG AGGCACTGGG TTACGTCTAC AATCGACATG CGGCGGGATT GCGACGTGGC GACAGCGGCA CCGTGGCGGT CTGCCTCAAC GACCTGAGCA ACCCTTTCTT CACCGAATTC CTCGGCCATA TGGAGACCCG TTTCCGCGAG GCGGGACGCA TGATGCTGTT CTGTCATGCC CATGAGACAC CCTCGATCCA GGCTCGCTTC ATCCGCCAGA TGGCCGAACA CGGTGCCGCC GGATTGATTC TGGTCCCGGT GGAGGGCACG TCGCGCGCGG ACCTGGAAGG CCCCCGCGTG CGTCATCTGC GCGATTTTCC ACTGGTGCTG ATTTCGCGCG ATGTCGCCGA TACGGCGTTC GACCGGGTGA TCAACGACGA TGAACGCGGC ATCCGGCTAT TGTTCGAGCA CCTGTATGCC CTGGGGCACC GCCGCCTGGC CTGGCTGGGC GGAGGGGGCG ATACCTCCAC GGCGCACGAT CGCGAGCGCT CCTTTCGCCG CGAGATGGAC GCCGCAGGAT TGCCGATCGA TGACGCGACG ATGCACCACG GCCCGACGTC ATTGGCCTTT GGCGATAGCA TGCTCGACGA GTTGATGGCG CTGCCGACAC CGCCTACGGC GATCGTGTGT TTTTCCGACG TGATCGCGTT CGGGGTTCTG GCTGCCTGCT ATCGGCGGGG GCTGCGCCCG GGCATCGACC TTTCGGTGAC CGGCTTTGAC GACATGGGCG CCGCTGCCTA TTCGGCGCCG GCTCTGACCA GTGTCCGCGT GCGCACCGAC TTGATCGGCG ATCGCGCCAG CGAGTTGCTG CTGGCCCGCA TCGCCGGCGA CCGTCAGCCG GCGGTGCGGG AAATGATGCC GCCCGAACTG GTGGTCCGCG AGACCACCGG TCCTCCGCCC GAGACAGGCA TGCTACCGCC ACGGCGGCGT GGGTGA
|
Protein sequence | MTRRPTLKTI ADRVGVTVSA VSLALRDDPR ISATTRSRIQ EAAEALGYVY NRHAAGLRRG DSGTVAVCLN DLSNPFFTEF LGHMETRFRE AGRMMLFCHA HETPSIQARF IRQMAEHGAA GLILVPVEGT SRADLEGPRV RHLRDFPLVL ISRDVADTAF DRVINDDERG IRLLFEHLYA LGHRRLAWLG GGGDTSTAHD RERSFRREMD AAGLPIDDAT MHHGPTSLAF GDSMLDELMA LPTPPTAIVC FSDVIAFGVL AACYRRGLRP GIDLSVTGFD DMGAAAYSAP ALTSVRVRTD LIGDRASELL LARIAGDRQP AVREMMPPEL VVRETTGPPP ETGMLPPRRR G
|
| |