Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3259 |
Symbol | |
ID | 4029008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3633362 |
End bp | 3634360 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637968474 |
Product | AraC family transcriptional regulator |
Protein accession | YP_575302 |
Protein GI | 92115374 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.560164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCCGG CGACAGGCAT GACGCACGAT GCCACTACGC CGGCCCAGCG TCTCACCATG GCCGATTTCG GCGCTTTCGA GCGCCGTTAT CGTTTGTATC ATCACTTTCC CTCCTTGACC CGTGCCGATG CCGTGTCCAC CACGGTGGCC GAAGGCTGGG TCGATGAATG TCGCCCGGAC ACGGGCATCA GTCTCGTCGG CTCGCGACTC ACGATTCATC ACACCTACGA AACTCACGCA TTGGCGGATG CTCCGGCGCA CGTGTCGATC ATCGTCATGC TCGAGGGGCA GGCCGAGCTG ATGCATGGCG AGAAGCGCCT CACGCTGTCG CCTCGCGAGG GGGTGATGCT GGTCAACGAC GGGCGCTGTC CGCTGAGTGC GCGCCACATG GGCGGCCAGC GCCTGCGCGC GATCAATCTG ACGCTGCTCG ACGATGCACG CACCACGCAG AACCGGCTGG CCGCGCCCTT GGCAGATCTG CTGACGTCTT CACCGGATGG CGCCTGGCGG CTGGCACCCC CCGAGGGTCT TCTGGCATCG CTCGAACAGT GGTTGAACGC CGCGGGGGAG GGGATCTCGC ATACATTGCT CGGCGAGGGA CTGGGGTTGC AGCTGATGGC ACATGGGCTG GCAGCCAGAG AAAACGCCCC GGCCGAGCAG ACGCGCCGTC TGGGCGTGCG CGATCGCCAT CATCTGGCAC GCGTGCGTGC CTGTCTGCAC GATCATCCCG ATGCCTCGCA TGACCTCGAC TCCCTGGCGC AACTGGCCTG CATGAGTCCC AGCGTGTTGC GTGACAAGTT CCGCCAGGCG TACGGCCAGC CGGTCTTCGA GTACCTGCGT CAACGCCGCC TCGAAATGGC GCATGATCTG CTCCGGGAAG GCTATAGCGT GCAGTACGTC GCAACCCGTG TCGGCTATCG GCACGCCAGC AATTTCGCCA CCGCCTTCAA GCAGCGCTAT GGCGTGTCGC CCCGCGCCTT GCAGCGGCAA GGAGCATAG
|
Protein sequence | MGPATGMTHD ATTPAQRLTM ADFGAFERRY RLYHHFPSLT RADAVSTTVA EGWVDECRPD TGISLVGSRL TIHHTYETHA LADAPAHVSI IVMLEGQAEL MHGEKRLTLS PREGVMLVND GRCPLSARHM GGQRLRAINL TLLDDARTTQ NRLAAPLADL LTSSPDGAWR LAPPEGLLAS LEQWLNAAGE GISHTLLGEG LGLQLMAHGL AARENAPAEQ TRRLGVRDRH HLARVRACLH DHPDASHDLD SLAQLACMSP SVLRDKFRQA YGQPVFEYLR QRRLEMAHDL LREGYSVQYV ATRVGYRHAS NFATAFKQRY GVSPRALQRQ GA
|
| |