Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3237 |
Symbol | |
ID | 4028571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3606708 |
End bp | 3607676 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637968452 |
Product | AraC family transcriptional regulator |
Protein accession | YP_575280 |
Protein GI | 92115352 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTGTCG TGCAGCCTGT TTTCGAGACA CCGGATGATG AAATCGTACA ATTGGCGGAT ACTCTTTCTC GCCGGATTCA GAAATGGGCG CCGGAAGAAG GGCTGACACC GACGGCGGTG CCGGGGTTGG AACTGGTGCG CGCCAATTCC TCGTTGACCA ACTGTATCGG CTCGACGGTC TACGACCCTT CGCTGTGTCT GATCGCGCAG GGCAGCAAGC GCATATGGCT GGGAGACCGG GAAATCGATT ATGGGCCGCT GAGCTGCATG GTGTCGGCGG TGCACCTGCC GGTACTGGGC AAGATCACCG AGGCTTCGGC GGAGCGGCCC TACCTGGGGT TGAAGCTCGC CGTCGATGCC CAGGAAGTCA CCGATCTGGT ACTGGAGCTG GGTGAGGGGC TGAGCGAGAT GGAGGAGCGG GGATGCGCCG AGACCGCTTG CGGCCTCGGG CGCGTGCAGG CCGAGAAGGG GCTGGTGGAA GCCATGCTGC GATTGGTGAG CCTGCTCGAT TCCCCCCAGG ACATCCGCAT TCTCGCACCG CTGGTGCGGC GCGAGATCTT CTATCGTGCG CTGGTCGGCG AGATCGGCCT GCACATGCGC AAGTTCGCGG TGGCCGATAC CCAGACCCAT CGCATCTCGA AGGTGATCGC GGTACTCAAG GATCGCTTCA CCGAGCCGCT GCGCGTTCGC GAGCTGGCGG ACATGGTGAA CATGAGCGAG TCGTCACTGT TTCACAGCTT CAAGCAGGTC ACCCGGATGT CGCCGGTGCA GTTCCAGAAA AAGCTGCGAC TGCATGAAGC GCGCAGGCTG ATGCTGGCTG AAGGGATGGA GGCGGCCACC GCCAGTTTCC GCGTGGGGTA CGGAAGTCCA TCCCATTTCA GCCGCGAGTA CAGCCGCCTT TTCGGCGTGC CGCCCCGCAC GGACGTGAGC AAGCTACGCG GCGAGCTGCC GCAGACCGCC CGCGCCTGA
|
Protein sequence | MSVVQPVFET PDDEIVQLAD TLSRRIQKWA PEEGLTPTAV PGLELVRANS SLTNCIGSTV YDPSLCLIAQ GSKRIWLGDR EIDYGPLSCM VSAVHLPVLG KITEASAERP YLGLKLAVDA QEVTDLVLEL GEGLSEMEER GCAETACGLG RVQAEKGLVE AMLRLVSLLD SPQDIRILAP LVRREIFYRA LVGEIGLHMR KFAVADTQTH RISKVIAVLK DRFTEPLRVR ELADMVNMSE SSLFHSFKQV TRMSPVQFQK KLRLHEARRL MLAEGMEAAT ASFRVGYGSP SHFSREYSRL FGVPPRTDVS KLRGELPQTA RA
|
| |