Gene Noca_4823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4823 
Symbol 
ID4595404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp149733 
End bp151424 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content63% 
IMG OID639772610 
ProductCdaR family transcriptional regulator 
Protein accessionYP_919270 
Protein GI119714128 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGCG CCGAACCCGA GCACCCTTTG ACCATTGAGG ACGCCGAACT GCGGGAGCAG 
TTGTCGAACG TCCAGGGTCT GCTGATGCTT TCGATGTTGA TGACCCAGAG CGGTGACCCG
CAGAAGATCC TCCACCTCGC TGAGACCTCG GTCCCTTCGT TCGGGCCGTG CCACATCGTC
GGTGCACAGG TAGACGGGAA GTGGCTCCCG GCAGCTTCCG CGAAGTTGAG CCCGGAGATG
CAGGTGAGCC TGAGTGGCCA GCTGGCAAGA CTCTCATCGA CCCGTGCTGA GCTCACTCTT
CCTGACCGTG CGTGGGTCAA GGCTCTGCCG TTGCGGAGCA TGCAGAGCCA GGTCGGGTAC
CTGGTGATCG GGGCGGCAGC GCAACCGACG GGTAACCAGC TTTTCCTGCT TCAAGTCTTG
GCCCAGCAGA CCGGTATTGC CCTGATCAAT GCGCGACTCC ACGCCAAGGA GCGGGCAACA
GCGCAGGAAC TTCAAGGCGC CGTTGAGACC TTGGCGGAGA CAGTTCGAGC ACTTGAGCGA
ACTACGGAGA TTCACGTCAG GTTTACGCGG GTCGCGGCAA AGAGCGAAGG ACAGGAAGGC
ATCGCCCAGG CACTTCACGA GTTGACGGGG TTTCCTGTGG CTGTCGAGGA CCGCTTCGGG
AATCTACGAG CTTGGGCGGG CCCTCACCGG CCGGAGCCAT ACCCGAAGGA CCCGCAGGCT
CGGCGCGAGC AGATGCTCCG TCGGGCGTTG CGTGAGGGCC AGCCCGTGCG CGAAGGGGGC
CGGCTTGTCG CTATTGCGAA TCCTGCTGCT GATGTACTCG GCGTGCTCGC GCTGATTGAC
CCGGCCCAAC GGGCGGGAGA GCAGGAACAA ATCGCCATCG AACATGGAGC GACTGTGCTC
GCCATGGAGC TGGCTCGTCT CAGAAGCCTG ATTGAGACTG AGCTTCGGCT GCGACGGGAC
CTTGTTGATG AACTGCTGAG CGGCACCGAG GAGTCAAGCG CGCTGGAACG AGCGCAGGCG
CTCGGTCACG ATCTCGAGAA GCCGCACCGT GTGGTGATGA TCGAAGGCAA CGGTCGAGTC
CACGACAGTG GCGCCTTCTT CCTGGCCGTT CGACGCGCCG CCCGCCAAAT GCACGCTGGA
ACCCTGCTGG TGGCACGCGG AGAGTCGGTG GTCGTGCTTT CCGAGGCGGA CCTCGACTGG
GAGAAATTCC GCACCCTTGT GATCAAGGAG CTTGGCGGAG GGCGATGTCG AATTGGCGTC
GGCGACTACT GCCAGGGTCC CGGGGGATTT CCGCAGTCCT ACCGGGAGGC TCAACTGGCG
CTGCGGATGC AGGGCGTCGC GAAGGCGGAG GATCAGGCGA CGGTGTACGC CGACCTCGGC
GTCTATCGAA TTCTTGCGGA GGTTGAGAAC CCGGCCGCCG TCGAGCGGTT CGTTCGTCTC
TGGCTGAGTC CTTTGTTGGA CTACGACGCT CGGAAGGGCT CCGACCTCGT CCACACGTTG
AGTAGATACT TGGAGTGCGG CGGCAAGTAC GACGCAACGG CGATCGAGCT GTCCGTCCAT
CGCAGCACCC TGAAATACCG ACTCCAGAGG ATCCGCGAGA TCTCTGGCCA CGACCTGTCG
GATCCAGACA CCGCGTTCAA CCTCCAACTG GCCACCCGGG GCTGGCAGAC GTTGCAAGGC
CTGCAGGCCT GA
 
Protein sequence
MVSAEPEHPL TIEDAELREQ LSNVQGLLML SMLMTQSGDP QKILHLAETS VPSFGPCHIV 
GAQVDGKWLP AASAKLSPEM QVSLSGQLAR LSSTRAELTL PDRAWVKALP LRSMQSQVGY
LVIGAAAQPT GNQLFLLQVL AQQTGIALIN ARLHAKERAT AQELQGAVET LAETVRALER
TTEIHVRFTR VAAKSEGQEG IAQALHELTG FPVAVEDRFG NLRAWAGPHR PEPYPKDPQA
RREQMLRRAL REGQPVREGG RLVAIANPAA DVLGVLALID PAQRAGEQEQ IAIEHGATVL
AMELARLRSL IETELRLRRD LVDELLSGTE ESSALERAQA LGHDLEKPHR VVMIEGNGRV
HDSGAFFLAV RRAARQMHAG TLLVARGESV VVLSEADLDW EKFRTLVIKE LGGGRCRIGV
GDYCQGPGGF PQSYREAQLA LRMQGVAKAE DQATVYADLG VYRILAEVEN PAAVERFVRL
WLSPLLDYDA RKGSDLVHTL SRYLECGGKY DATAIELSVH RSTLKYRLQR IREISGHDLS
DPDTAFNLQL ATRGWQTLQG LQA