Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1836 |
Symbol | |
ID | 4597674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1959797 |
End bp | 1960951 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639776435 |
Product | CdaR family transcriptional regulator |
Protein accession | YP_923034 |
Protein GI | 119716069 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3835] Sugar diacid utilization regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAGT CACCGAAGAA GGCCGGGCGC GAGGCGATCC AGGAGGTCCT CTCCGACCTG CTCGCCGACA TGCCGACGTT GTCGCGCGGC ATCGTCCGGG AGATCCGGGC CCGCGTCCCG GAGTACGACC GCGTGCCGCT CGGGGACCAT GCCGACCACG TGCAGGAGCA GCAGGAGCGG ATCGTCCGAG CCCTGATCGC GGAACGCGGA CTGGACCAGG AGGACCTCCG CCGAGCCGCC GCGCTCGGCC GCTTGCGCGC CACCCAGGGC GTGAGCGTGG AGGGCGTCAT CAGCGCCTAC CACGTCGGCA ACCGAGAGCT CTGGCGGCTC ATCGACGAGC GCGCCGACAA GGGCCGGGAG TTCCTGCCCG AGCTGGCCGG GACGATGTGG GAGAGCATCC ACGAGACCGC GACCGAGATC GCCGCGGCGC ACTCCTCCGT GGCTCGTGCC CGGCACACCC AGGACCTCAC CATGCGGCAC CGGTTCGTCG AGCTGCTCGG CCGCGAGGAG GGCGAGGCGG AGACGCTGGA GATCGCCGCG CGGCTCGGCT TCGACGTGCA CGGCCCCTTC CTCGCGGCCT GCATTTCCGC AGGTGAGCGG CCCGATGGGA TCGCCCAGAC GGTCCACGAG GAGCTCGAGT ACCTGGACGG CACGTCCTTC GCCATCCGGC AGGGCGCCGT CCTGCTCGTG CTTGCGCAAG GGCCCGACGA GGCGGCGCTG ACCGAGCTCG TCGGCCTCCT CGAACCGGTC CCGCGGGCCG GTGTCGGCCT GCGGCGCACC GGCCTCGCGG GCGCACGGGA CAGCATCCGC GACGCCGCCG AGGCGCTGGC GGCCACCACC ACCGCCGATC CCGTCGCGAG CTACGCCGTC GACTGGTGGC GCGCCTGCGT CGCGGCCCAG CTGGACCGCC TGGCGCCGGT GCTCGACGAG GCCCGCGAGG TCGCTGCCGG CAACCCGCAC CTGGTGGAGG CGGTGCGCGC GTTCGCGGAC GGAGGCTTCT CGGTGGCGGC AGCCGCCAAG CGCCTCCACG TGCACCCCAA CAGCGTCGCC TACCGGCTCG ACCGGTGGGA CCAGCTCACC GGCTGGAGCC CGCGGGAGTT CGCGGGTCTG GTGCACTCAC TGGGCGCCTG CATGGGCGTT CAGGACCTGG AATGA
|
Protein sequence | MAKSPKKAGR EAIQEVLSDL LADMPTLSRG IVREIRARVP EYDRVPLGDH ADHVQEQQER IVRALIAERG LDQEDLRRAA ALGRLRATQG VSVEGVISAY HVGNRELWRL IDERADKGRE FLPELAGTMW ESIHETATEI AAAHSSVARA RHTQDLTMRH RFVELLGREE GEAETLEIAA RLGFDVHGPF LAACISAGER PDGIAQTVHE ELEYLDGTSF AIRQGAVLLV LAQGPDEAAL TELVGLLEPV PRAGVGLRRT GLAGARDSIR DAAEALAATT TADPVASYAV DWWRACVAAQ LDRLAPVLDE AREVAAGNPH LVEAVRAFAD GGFSVAAAAK RLHVHPNSVA YRLDRWDQLT GWSPREFAGL VHSLGACMGV QDLE
|
| |