Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2607 |
Symbol | |
ID | 4028135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2918188 |
End bp | 2919099 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637967815 |
Product | AraC family transcriptional regulator |
Protein accession | YP_574653 |
Protein GI | 92114725 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.334421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGCTG CGCAGCTCAA CGATACTCGT CTTGGTGCGC CTGTCTTCGA GAGCATCAGC CAGGACCCGA AGTTCAGTTT CTACTGGCAC TGCCACGATT TTCCCGCACC CATCGCCCGC TGGAATTATC ACCCCGAGTA TGAATTGCAT CTCATCCGTT ATTCGGAAGG GAATTATTTC GTGGGTGATT ATATCGGGCG ATTCGGGCCT GGTAATTTAG TGCTGGTGGG CCCGAACGTT CCGCATGCCT GGTTGAGCGA TCCGGACGAT TCAAGGTCCG TCATCAAGGG GCGCGATGTC GTTCTCCAGT TCAAGGGCGA GTGGCTCGAG CACATGATGG CGCTTTGTCC GGAACTCAAT TGCCTGTCGA CGCTCATCGA GGACTCGAGG CAAGGCGTGG CGTTTCAGGG CGAGGAAGCG ACACGCTGTG GCGAGCTGCT GATCACCATG GGGGAACAGG ACCATGCCGG AAGGCTGTTG ACCATGCTGA CCATTTTGCG CGGCCTATCG CAGTGCGATT ACGCGACCCT GTCCAGCCGC GAATACGGCC TGGAGGCCAC GGGCGTTTCC TCCGCCCAGG TCGACGCCAT CCTGCGCTAT ATCCACGACC ACTTTCACGA AGAGTTGCGC ATGTCGGCGC TGGCAGCGCG CAACGGCATG AGCCCTTCCA GCTTTTCGCG CTTCTTCAAG CATGCCACGG GCGACACCTT CGTCGCCTTT CTGCGGCGGA TCCGTATCGG TCATGCCTGC CAGCTGTTGC TCGAAGGCCG GCAGTCCATT GCCGATATCT GCTTCCAGGT CGGTTACAAC AACCTTTCCA ACTTCAACCG GCATTTTCGT GAACTGAAAG GCATGACACC TCGCCAGTAT CAGCAGAGCA CGGCCGACCT CGTGACGGGC GATTGCCGGT GA
|
Protein sequence | MIAAQLNDTR LGAPVFESIS QDPKFSFYWH CHDFPAPIAR WNYHPEYELH LIRYSEGNYF VGDYIGRFGP GNLVLVGPNV PHAWLSDPDD SRSVIKGRDV VLQFKGEWLE HMMALCPELN CLSTLIEDSR QGVAFQGEEA TRCGELLITM GEQDHAGRLL TMLTILRGLS QCDYATLSSR EYGLEATGVS SAQVDAILRY IHDHFHEELR MSALAARNGM SPSSFSRFFK HATGDTFVAF LRRIRIGHAC QLLLEGRQSI ADICFQVGYN NLSNFNRHFR ELKGMTPRQY QQSTADLVTG DCR
|
| |