Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4823 |
Symbol | |
ID | 4595404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | - |
Start bp | 149733 |
End bp | 151424 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639772610 |
Product | CdaR family transcriptional regulator |
Protein accession | YP_919270 |
Protein GI | 119714128 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3835] Sugar diacid utilization regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGCG CCGAACCCGA GCACCCTTTG ACCATTGAGG ACGCCGAACT GCGGGAGCAG TTGTCGAACG TCCAGGGTCT GCTGATGCTT TCGATGTTGA TGACCCAGAG CGGTGACCCG CAGAAGATCC TCCACCTCGC TGAGACCTCG GTCCCTTCGT TCGGGCCGTG CCACATCGTC GGTGCACAGG TAGACGGGAA GTGGCTCCCG GCAGCTTCCG CGAAGTTGAG CCCGGAGATG CAGGTGAGCC TGAGTGGCCA GCTGGCAAGA CTCTCATCGA CCCGTGCTGA GCTCACTCTT CCTGACCGTG CGTGGGTCAA GGCTCTGCCG TTGCGGAGCA TGCAGAGCCA GGTCGGGTAC CTGGTGATCG GGGCGGCAGC GCAACCGACG GGTAACCAGC TTTTCCTGCT TCAAGTCTTG GCCCAGCAGA CCGGTATTGC CCTGATCAAT GCGCGACTCC ACGCCAAGGA GCGGGCAACA GCGCAGGAAC TTCAAGGCGC CGTTGAGACC TTGGCGGAGA CAGTTCGAGC ACTTGAGCGA ACTACGGAGA TTCACGTCAG GTTTACGCGG GTCGCGGCAA AGAGCGAAGG ACAGGAAGGC ATCGCCCAGG CACTTCACGA GTTGACGGGG TTTCCTGTGG CTGTCGAGGA CCGCTTCGGG AATCTACGAG CTTGGGCGGG CCCTCACCGG CCGGAGCCAT ACCCGAAGGA CCCGCAGGCT CGGCGCGAGC AGATGCTCCG TCGGGCGTTG CGTGAGGGCC AGCCCGTGCG CGAAGGGGGC CGGCTTGTCG CTATTGCGAA TCCTGCTGCT GATGTACTCG GCGTGCTCGC GCTGATTGAC CCGGCCCAAC GGGCGGGAGA GCAGGAACAA ATCGCCATCG AACATGGAGC GACTGTGCTC GCCATGGAGC TGGCTCGTCT CAGAAGCCTG ATTGAGACTG AGCTTCGGCT GCGACGGGAC CTTGTTGATG AACTGCTGAG CGGCACCGAG GAGTCAAGCG CGCTGGAACG AGCGCAGGCG CTCGGTCACG ATCTCGAGAA GCCGCACCGT GTGGTGATGA TCGAAGGCAA CGGTCGAGTC CACGACAGTG GCGCCTTCTT CCTGGCCGTT CGACGCGCCG CCCGCCAAAT GCACGCTGGA ACCCTGCTGG TGGCACGCGG AGAGTCGGTG GTCGTGCTTT CCGAGGCGGA CCTCGACTGG GAGAAATTCC GCACCCTTGT GATCAAGGAG CTTGGCGGAG GGCGATGTCG AATTGGCGTC GGCGACTACT GCCAGGGTCC CGGGGGATTT CCGCAGTCCT ACCGGGAGGC TCAACTGGCG CTGCGGATGC AGGGCGTCGC GAAGGCGGAG GATCAGGCGA CGGTGTACGC CGACCTCGGC GTCTATCGAA TTCTTGCGGA GGTTGAGAAC CCGGCCGCCG TCGAGCGGTT CGTTCGTCTC TGGCTGAGTC CTTTGTTGGA CTACGACGCT CGGAAGGGCT CCGACCTCGT CCACACGTTG AGTAGATACT TGGAGTGCGG CGGCAAGTAC GACGCAACGG CGATCGAGCT GTCCGTCCAT CGCAGCACCC TGAAATACCG ACTCCAGAGG ATCCGCGAGA TCTCTGGCCA CGACCTGTCG GATCCAGACA CCGCGTTCAA CCTCCAACTG GCCACCCGGG GCTGGCAGAC GTTGCAAGGC CTGCAGGCCT GA
|
Protein sequence | MVSAEPEHPL TIEDAELREQ LSNVQGLLML SMLMTQSGDP QKILHLAETS VPSFGPCHIV GAQVDGKWLP AASAKLSPEM QVSLSGQLAR LSSTRAELTL PDRAWVKALP LRSMQSQVGY LVIGAAAQPT GNQLFLLQVL AQQTGIALIN ARLHAKERAT AQELQGAVET LAETVRALER TTEIHVRFTR VAAKSEGQEG IAQALHELTG FPVAVEDRFG NLRAWAGPHR PEPYPKDPQA RREQMLRRAL REGQPVREGG RLVAIANPAA DVLGVLALID PAQRAGEQEQ IAIEHGATVL AMELARLRSL IETELRLRRD LVDELLSGTE ESSALERAQA LGHDLEKPHR VVMIEGNGRV HDSGAFFLAV RRAARQMHAG TLLVARGESV VVLSEADLDW EKFRTLVIKE LGGGRCRIGV GDYCQGPGGF PQSYREAQLA LRMQGVAKAE DQATVYADLG VYRILAEVEN PAAVERFVRL WLSPLLDYDA RKGSDLVHTL SRYLECGGKY DATAIELSVH RSTLKYRLQR IREISGHDLS DPDTAFNLQL ATRGWQTLQG LQA
|
| |