Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0085 |
Symbol | |
ID | 4026007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 104738 |
End bp | 106663 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637965236 |
Product | putative transcriptional regulator |
Protein accession | YP_572148 |
Protein GI | 92112220 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACGG CCGCCGCCCT GCTGGAGCAG TTGCGTTTGC TGGATGAGTC CGAGCGCGTT GAAGCCAAGC GGGCTCAGAA GTTCGGCAAG TCGATGCTGG AAACCATCTG CGCTTTCGCT AACGAGCCGG GGCTGGACGG TGGTCATCTA TTGCTGGGCG TGGTCGAAGA TCAGGGCCAT TACCGGGTCG AAGGCGTGCC CGACCCCGAT ACCCTGCTGA ATGACCTGCA TTCTGCCTGT GCCTCGACGT TCAATGTGCC GCTACGGGTC CAGGCGGCTA CCGAGCTGGT GGAAGGCGAG CGGCTGGTGG TCATCCAGGT GCCCGAGGCC CGGCCGGCCG ACAAGCCGGT GTATTTCGTC AAGTCGGCCC TACCTCGGGC CGCCTGGCGG CGCGGCCCCA ACGGCGACTA CCGCTGCAAC GAGCATGATC TCGAGGTGCT TTACCAACAG CGTTCCCAGG TTGAGTTCGA TCGCTCGGTG CCACACGGCG CCACCCGCGA CGATATCGAC CCGGATGCCC TCGACGACTA TCGCCGCGAC CGCCGGGCCA TGAACGCTCA GGCCGAGGAA CTCGGCTATA GCGATGACGA GCTGCTCGAA GCTTTGGGGG CCGCCATCTG GCAGCACGGC GAGCTGAAGC CGACCCTGGC CGGCATCCTG CTCTTCGGGC GGCGCATGGC CATTCGTCGG CTGGTGCCTG CCCATCGGGT CGACTACATC CGCGTGACCG GCAAGGAGTG GATCGAGGAC CCCGACGAGC GCTTCACCAC CATCGACATG CGCGATACCC TGCCGCGGCT GATCAACCGC GCTGTGGCGG CGGTGTTGGA TGACCTGCCC ATGGCTTTTC ATCTGCCACA GGGCAGCCAG CAACGCGCGG ACCGACCGCT GATCCCCGCC AAGGTCGTGC GCGAGGCCAT CGTCAACGCC CTGATGCACC GCAACTATCG TGCCCACCAA CCGTTGCAGA TCATCCGCTT CAGCAACCGC ATCGAGGTGC GCAATCCCGG CTATTCGCTC AAGCCCGAGG AACAGCTAGG CCTGCCTGGC AGCGCCTGGC GCAACCCGAC CCTGGCGACG GTGCTGCACG AAGTCGGCTA TGCCGAAACC AAGGGCAGCG GCATCCGCGT CATGCGCCGC CAGATGGAAC AGGCGGGGCT GACGCCGCCG GTCTTCGAGT CGGTGCGTCA TGAGGATCGC TTCATGGCGA CCCTGCTGTT CGTGCATTTC CTCGATGATG AGGCGGTGGA ATGGCTCAAG CATTTCCGCC ACTGGCAACT CTCCGATGAG GAGTGCCGCG CTCTGCTGTT CGTGCGCGAG ACGGGGCGCA TCACCAATGC CGACTATCGC GACCAGAATC GCGTCGACAC CCTGGCCGCC AGTCAGCAGT TGAGCCGACT GCGCGACCTC GGGCTACTCC ACCAGGTTCC CAAGGGCGCC GAGACCTATT ATCTGCCCGG TGAACATTTT CCCATGCAGG GCAGCCCGGC CATGGAACTG CTCTCTGTAT TGGACCGCGA CCTGGCTGCC GAACAGGAGA ACTTACCTCA GGAGTCAGGC GGCTTACCTC AGGAGTCAGG CGGCTTACCT CAGGAGTCAG GCGGCTTACC TCAGGAGCCG GAAGGCCAGT CACGGGAGTC GCTTATCGCC GAGCTTCCCG GCTGGCTGGC GCTACGGCTC GAGGCCATCG GCCAGCGTTC CCGAGACAAG CGTCGGGTAC GAGAGCTGCT GCGGGCGCTG TGTGCCGAGC GCCCCTATCG GGCGGCCGAA CTCTCCCGGC TGTTGAAGCG CAACCAGGAA TACCTTCAAA AAGAATATAT TACGCCGATG CGCCAGGCAG GAGAACTGGC GTATCAGTAT CAGGACGACC CCAACCGTCC TGATCAAGCC TACGTATCGC CTACCGCCAA TAGAGAAGCC GAATGA
|
Protein sequence | MTTAAALLEQ LRLLDESERV EAKRAQKFGK SMLETICAFA NEPGLDGGHL LLGVVEDQGH YRVEGVPDPD TLLNDLHSAC ASTFNVPLRV QAATELVEGE RLVVIQVPEA RPADKPVYFV KSALPRAAWR RGPNGDYRCN EHDLEVLYQQ RSQVEFDRSV PHGATRDDID PDALDDYRRD RRAMNAQAEE LGYSDDELLE ALGAAIWQHG ELKPTLAGIL LFGRRMAIRR LVPAHRVDYI RVTGKEWIED PDERFTTIDM RDTLPRLINR AVAAVLDDLP MAFHLPQGSQ QRADRPLIPA KVVREAIVNA LMHRNYRAHQ PLQIIRFSNR IEVRNPGYSL KPEEQLGLPG SAWRNPTLAT VLHEVGYAET KGSGIRVMRR QMEQAGLTPP VFESVRHEDR FMATLLFVHF LDDEAVEWLK HFRHWQLSDE ECRALLFVRE TGRITNADYR DQNRVDTLAA SQQLSRLRDL GLLHQVPKGA ETYYLPGEHF PMQGSPAMEL LSVLDRDLAA EQENLPQESG GLPQESGGLP QESGGLPQEP EGQSRESLIA ELPGWLALRL EAIGQRSRDK RRVRELLRAL CAERPYRAAE LSRLLKRNQE YLQKEYITPM RQAGELAYQY QDDPNRPDQA YVSPTANREA E
|
| |