Gene Noca_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3971 
Symbol 
ID4598106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4187894 
End bp4189129 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content74% 
IMG OID639778576 
Producttranscriptional regulator domain-containing protein 
Protein accessionYP_925155 
Protein GI119718190 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTTG ACGCGGATCT CCAGGCGCGA CGCCTCGACG CCGTGCGCGC GTGGACGTCG 
TTCGTCGAGC GCGGCGACGA CGCGGCGGGA CTGGTGCGTC CCGAGATCCT GAGCAGCTGG
ACCCGCTCCG AGGCCGCCGT ACGCACCGAT GTCAGCGAGG CGCCGCTCGC CGACGAGGCC
GACACGGCGG CCCGCTGGCG CGGCTCGCCC CTGCAGGCCG CGGTGGAGCG GGTCGAGGCG
GAGCTGCGGC GTACCGCCGA GGACGGCGAC CTCGTCGTCG CCATCACCGA CGCCCAGACC
CGGATCCTGT GGACGTACGG CGGGCGGGTG ATGCGCCGCA AGGCCGAGAC CGTGAACTTC
GTGGTCGGTG GACGCTGGGA CGACCAGAGC GTGGGCACCA ACGCCCTCGA CCTCGCCAAC
CGGCTGGCCG CCCCGGCGAT GGTCTTCAGC GCCGAGCACT ACGCGCCGAT CGTGCACAAC
TGGGTCTGTT GGGCCGCGCC CGTGCACGAC CCGGTGACCG GCGCGCAGCT CGGCGTCATC
GACCTGTCCA CCACCTGGGA TCGGACCCAC CCGATCGGCC TGGCGACCGC GCGAGTGCTG
GCCCGGCTGA TCGAGACTGC GATGCCGGTC TCCGCGTACC ACCCCACCGC GACCGCCGAC
GAGGGCACCG AGCCGGGCCT CGTGATGCGG CTGCTCGGCA CCGCCGAGAC CTGGCTCGAC
GGGCAGCGGC TGCTGCTCAA TCGCCGGCAG ACCGAGGTCC TCGCGCTGCT CGCCATGCAC
CCGGAGGGGC TCTCGCTGGA GCACCTGCAC GCGCTGGTCT ACGGCGACCA GGCGGTCACC
CTGTCCACGC TCAAGGCCGA GGTGTCGCAC CTGCGCTCCG CGTTGGGCGG CCAGCTCACC
TCGCGGCCCT ACCGGTTGCC GATGCCGATC ACGACCGACG TCGACCTGGT GCTCGGGCTG
CTCCGCCGGG GCCGGGTCGC CGCGGCGGTC GACGCCTACG GAGGCGACCT CCTGCCCGGC
ACCAACTCCC CGGCGCTCAC CGAGCTGGGG GAGTACGTCG CGGTCGCGGT CCGCGAGGCC
CTGCTCACCG ACCCGCAGCC CGATGCGGTG GTCCGCTACG GCGAGCTGGC GCCGTACGAC
ACCGAGGTGG TCGAGGTCTG CCTGGCCGCC CTCGGCGGCC GCGCCCACCC CGCCGTACCC
CTGCTCAAGG CCCGCCTCGC CGCCGCCGCC CGCTGA
 
Protein sequence
MTVDADLQAR RLDAVRAWTS FVERGDDAAG LVRPEILSSW TRSEAAVRTD VSEAPLADEA 
DTAARWRGSP LQAAVERVEA ELRRTAEDGD LVVAITDAQT RILWTYGGRV MRRKAETVNF
VVGGRWDDQS VGTNALDLAN RLAAPAMVFS AEHYAPIVHN WVCWAAPVHD PVTGAQLGVI
DLSTTWDRTH PIGLATARVL ARLIETAMPV SAYHPTATAD EGTEPGLVMR LLGTAETWLD
GQRLLLNRRQ TEVLALLAMH PEGLSLEHLH ALVYGDQAVT LSTLKAEVSH LRSALGGQLT
SRPYRLPMPI TTDVDLVLGL LRRGRVAAAV DAYGGDLLPG TNSPALTELG EYVAVAVREA
LLTDPQPDAV VRYGELAPYD TEVVEVCLAA LGGRAHPAVP LLKARLAAAA R