Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3698 |
Symbol | |
ID | 4443699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4160552 |
End bp | 4162201 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639691522 |
Product | CdaR family transcriptional regulator |
Protein accession | YP_833173 |
Protein GI | 116672240 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3835] Sugar diacid utilization regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATAA AGCTCAGTGA GATCCTAAAA CATGCCACCC TCACAGCCGC CGATCCCGTG ATCCGCGCCG GTGCAGGCGC AGTGGCGGGT ACGCAGCTGC GGTGGGTGCA CTCCAGTGAA GTCCTGGACA TCGCGCCGTT ACTTAGCGGC GGCGAGCTGC TTCTTACCGG CGGCGACGCG CTGGTTACTG CCTCGGATGC ACGCCGGGCG GAGTACGTCC GGCAGCTCTC GGAGCGCGGG GTTGGTGCGC TGGCGGTGGA AACTGGCCAG CGGCTCGCTT CACTGCCGTC GTCGATGATC CAGGCAGCGG AAACCGCCGG GCTTCCCCTT ATCGAATTCC GCAAGGTGGT GCCCTTCGTG GGCATCATGC AGGCGATTAA CTCGATGCTC GTCAGTGAAT CGGTGGCCCA TCTGCGCCGC GCGGATGAGG CCAGCCACGC CATGGCCGTT GAGCTGGCCC ACGGGAGCAG CCTGGATCAG ATCCTTGCCG TGCTGGCCGG AATCATTGGT GCGACCCTGG AGCTCTCATC CGTGTCCGGT GTAACGCTGG GCAACGCCGG CCCGGGCGGC GCCGATACGC GTCCGGCGGC CGGCGCCGGT ACGCATCCGG GGAACGGCAC GGCTTCCCAG CCGGCAGAGG GCCTGGAGTC CGGCACCGGC TCCCCGGCAG AGCCCGGAAG CGCGGCAGAG CCCGGAAGCG CGGCAGAGAC GGGCATCGGC GGCACGTTGA TAAGCATTGA TGTGCCGGTC CGTGGCGTGC CCTCCGCGAG GTTGCTCATC AATGTCCCGG CCGACGGCGA CGTCAACCTG GCGCGCGTGG CCGGCGGCCG CTGCGTGGAC ATCCTGTCGC TCGCTCTGCT GCAGCGGATG CCGCCCGGGC TGAAAGAAGT GGCCGGCACG GCGCTGCTGC GGGCCGTCAG TTCCGGCAGC CAGCCCTGGC GGCTTCAGCA GCTCTCCCCC GCCGCCGGGA TCCTTCCCTC TGCGACGGTG GTCGCCGTCG TCGTCCGGTC GTCCACCTCA CAGCAGCTGC GGGCGGCAAT GGACACTATC CTCAAGCGGT CGGCGCAGCA GAGCGCCAGC TATGTGGACA ATGCCGAGCT CCTGGCGCTT GCCGCCCTCC CTTTTGACGG GGCGGCGGCC GCGCGGGCCG GGCTCGTGGC AGCACTCAGG GAGCTCCCGG TTGAGGCGGG GACCATGACG GCGGTGGGTC CGCTGGCCGC CGGGATCGAA CACGCTCCCT GGTCGTTGTC GGAGGCGAAA AGCGCCCTGG ATCTCGCCGT GAACGGCTCA CTTCGGTCAG CGGCCCGGCC GGCGTCGGAC ACGGAAGGAG TGGTGATCGA CGTCGAAAAT CTGGCGGTGG AGCGCCTGGC GGTGCAGCAT CTGGACCAGG GCGCGCGGCA GGATTTCGTC CGCCAGCAGG TGGGGCCCCT GCTGGATCAC GACGCGCGGC GCAACTCCCA GCTGCTCGCA ACCCTGGCAA CCTGGCTGGA CTCTGGCTGC AACACCGCGC AGGCTGCCCG CGAACTGCAT GTTGAGCGGC AGTCGATGCA TCACCGCATG CAGCGGATTT TTGAGTTGTG CGGCGGCGAT CCGCGCGGAA CCGGCCGGCT CGCTGCGCTG CACCTCGCCA CCCGGCTGGC CGCGCTGTAG
|
Protein sequence | MPIKLSEILK HATLTAADPV IRAGAGAVAG TQLRWVHSSE VLDIAPLLSG GELLLTGGDA LVTASDARRA EYVRQLSERG VGALAVETGQ RLASLPSSMI QAAETAGLPL IEFRKVVPFV GIMQAINSML VSESVAHLRR ADEASHAMAV ELAHGSSLDQ ILAVLAGIIG ATLELSSVSG VTLGNAGPGG ADTRPAAGAG THPGNGTASQ PAEGLESGTG SPAEPGSAAE PGSAAETGIG GTLISIDVPV RGVPSARLLI NVPADGDVNL ARVAGGRCVD ILSLALLQRM PPGLKEVAGT ALLRAVSSGS QPWRLQQLSP AAGILPSATV VAVVVRSSTS QQLRAAMDTI LKRSAQQSAS YVDNAELLAL AALPFDGAAA ARAGLVAALR ELPVEAGTMT AVGPLAAGIE HAPWSLSEAK SALDLAVNGS LRSAARPASD TEGVVIDVEN LAVERLAVQH LDQGARQDFV RQQVGPLLDH DARRNSQLLA TLATWLDSGC NTAQAARELH VERQSMHHRM QRIFELCGGD PRGTGRLAAL HLATRLAAL
|
| |