Gene Noca_4866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4866 
Symbol 
ID4595242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp196710 
End bp198110 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content70% 
IMG OID639772651 
ProductCdaR family transcriptional regulator 
Protein accessionYP_919311 
Protein GI119714169 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGG CAGGGGGCCG ACCGGTCCAC GCCCCAGGCT CGTCGCCTCA GGACGTGACG 
CCACCTGCGG AACCGCGTTT GCCGGGCCCC GGCAGCGTGT CGTGTACTGG TAGGCCGAGG
CCAAGTGAGG CGATGCGCAT GAAGAACACC GGGGCCAACC GCAACCCGAG CAGTTCGGTG
CGTCGGACCG AGTGGGCCGA CCAGGAGTGG CTGGTTCGGG CGGCCGAAGG CGCGAGCCAG
GACGCCGGCG TACCGGTCGC TCTGCTGGGC GACTACCTGC CCATGCTTGC CGAGGCAGCC
ACGCACGGCG AGTTTCCTGG CCGGGACCAG ATCGACGCGG TCCGCCGCCA GGGCCGTCAC
GCTGCGGAGC AAGGAGTCCC GGTCGGTCGA GGCGTCGATC TCTACCTCTC CGCAGCCCGG
CGCGTGTGGA GCGAGCTGCC CGCGGTCGTT CGTGAGCGGG ACAGATCGGC TGTCCGCGCG
GCCGCTGAGG CGGTCCTGCA CGTCGTTGAC GACGCGGTCG CGGCATTTGC CGAGGGCCAC
GCCGAGGCCG GGCGCGAGTT GGTCCGCCGG GAGGAGACCC TGCGACGAGA GCTGATCGAG
GACCTGCTCC GCGGCGATGC TCATCTCAGT GACCTTGTGG AGCGGGCGGA ACCCTTTGGA
CTCGACCTGA CTCGTGCCCA CCAGGTCGCC CTCGCTCAAC CGGGCAAGCG GCTCCCGTCG
ATCGCCGCCG CAACGACTTC ACTGGAACGA GTCGTCTTGG ACCGTTTCGG TGACCGGGAC
GTCCTGGTCG CAACGAAGGA AGGATGGGTT GTCGTCATCG CCCTGGCCGA CGCCACCGGC
GCCCCACCGA CGTCCGGCGG CCTGGGAACA ACCGGTGACC TCGGAAAGAT CGTGTACGGC
GAGCTCTCCC GGCTGAGGCG GGGACGTCCC TGGCGGGTGG CGGTGGGTCG CGCTCACCCC
GGTGCGTACG GGATCGCCCG CTCCTACGAA GAGGCTCGTG AGGGCATGAC CATGGCCACC
CGCATGCAGT TGGACCGGCC GATCGTCGAG ACCCGAGACC TCCTGACCTA CCGAGTGCTC
GCCCGAGACC AGCCCGCCTT GGTCGACCTG GTGCACTCGG TGCTCAACCC GCTCCATCAG
GCCCGGGGAG GCGCCCAGCC GCTGGTGGAA ACCTTGGCGG CTTACTTCAA CTGCGGCTGC
GTCGCAACCA CGACGGCCAC CACGCTCCAC CTCTCGGTCC GGGCAGTGAC CTACCGCCTT
GACCGCGTCA AGGCACTCAC CGGTTTCGAT GCGCTCGACC CGGCCCATCG CTTCACGCTG
CAGGCGGCCG TCCTCGGCGC GAGGCTGCTC GGCTGGCCCG AACGGCCGCT CCCGCAGGCC
TCAGCCCCTG GGTCGACGTA G
 
Protein sequence
MPQAGGRPVH APGSSPQDVT PPAEPRLPGP GSVSCTGRPR PSEAMRMKNT GANRNPSSSV 
RRTEWADQEW LVRAAEGASQ DAGVPVALLG DYLPMLAEAA THGEFPGRDQ IDAVRRQGRH
AAEQGVPVGR GVDLYLSAAR RVWSELPAVV RERDRSAVRA AAEAVLHVVD DAVAAFAEGH
AEAGRELVRR EETLRRELIE DLLRGDAHLS DLVERAEPFG LDLTRAHQVA LAQPGKRLPS
IAAATTSLER VVLDRFGDRD VLVATKEGWV VVIALADATG APPTSGGLGT TGDLGKIVYG
ELSRLRRGRP WRVAVGRAHP GAYGIARSYE EAREGMTMAT RMQLDRPIVE TRDLLTYRVL
ARDQPALVDL VHSVLNPLHQ ARGGAQPLVE TLAAYFNCGC VATTTATTLH LSVRAVTYRL
DRVKALTGFD ALDPAHRFTL QAAVLGARLL GWPERPLPQA SAPGST