Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2540 |
Symbol | |
ID | 4026121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2847628 |
End bp | 2848683 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637967747 |
Product | AraC family transcriptional regulator |
Protein accession | YP_574586 |
Protein GI | 92114658 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCGG TAGTGGCGCA TCGCCTCATG CCTGAGTCGG GCAAGCACGA GATCGAAGCC AAGGAATTGC GACGGCTATC GGCATCGCTG GTCACGCGAC AGGCCCTCGA TGCACCGAAC TGGGACCGAC GCTCGGCCCT GCTCGACGGA CGCATGCAAC TCGCCGAGCT CCAGCCAGGC ATGCAGCTGC GCTTGGCCGA TGTCAGTGAT CGTTACGATC TGGTCACCCG TGCCTTACTA CCGGCGGGGG TCAAGATCGC CCTGGTCGTC GCCGGCGAGG CTCGCGTGAG CTATGGCGAT CAGGCAGTGT CGCTAGGGCT GTCGGCGCCG AGCACCGGAC TGCTGGCCAG GCTTCCCGAA GCCACTCGCT TCGCTCGACG GGGACGTATC GGCGGCCACG AGCGGACCCT GACCCTAACC CTCACACCGG ACTGGCTGTT ACGACACGGC TACTCGATTA CCTCGAGTCA CACAGCGCAG CTGGTCCGCT GGTCGCCATC GCCGGGACTG CTGAGACTCG CCGAACGGTT GTTCGACGAG CGCTTCCTGT ACAGCCGGGA TGATGCCCAT CGCCTGCAAC TGAGCGGCTG TGCTATGGCG ATGGCCGGTG AAGCATTGGC CGCTCTCGGA CACGATCGGG AGGGGCAGAA ACATGACGAG GAAAGAGAAC ACTACCCGAC AGACCGTCGA CTGCAACGCT TGATGACGCT AGTCGAGAGC GGCGAAGCCC ATCGTCTGGG GCAGGAAGAA CTGGCCCGGC GTCTGGGTAT GAGCCTGAGC AGTCTGCAAC GACGATTCCT CGCCTGTTAC GGCAAACCAC TGGGACGCTT CCTTCGACGT CGTCGCCTCG AAACAGCCTT GGCGGCACTG CGCAATGAAG CCATCAGTGT CGAAGCTGCA GCCATTCTGG CCGGCTATAC CAACGCCGCC AACTTCGCCA CGGCCTTCAA GCGCGAATTT GGAGCACGGC CCGGCGACCT GCGTCGAGGA CCTCAGACAA GCCAAGAAGA AACTCGCCTG CTCAAGGTGG CATCAGGCGG TAGCGGGCGG AGTTGA
|
Protein sequence | MTAVVAHRLM PESGKHEIEA KELRRLSASL VTRQALDAPN WDRRSALLDG RMQLAELQPG MQLRLADVSD RYDLVTRALL PAGVKIALVV AGEARVSYGD QAVSLGLSAP STGLLARLPE ATRFARRGRI GGHERTLTLT LTPDWLLRHG YSITSSHTAQ LVRWSPSPGL LRLAERLFDE RFLYSRDDAH RLQLSGCAMA MAGEALAALG HDREGQKHDE EREHYPTDRR LQRLMTLVES GEAHRLGQEE LARRLGMSLS SLQRRFLACY GKPLGRFLRR RRLETALAAL RNEAISVEAA AILAGYTNAA NFATAFKREF GARPGDLRRG PQTSQEETRL LKVASGGSGR S
|
| |