Gene Noca_4962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4962 
Symbol 
ID4595344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp292952 
End bp296008 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content71% 
IMG OID639772744 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_919404 
Protein GI119714262 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0162607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGGAC CCCTTGAGGT TCGCCGCGAC GGCGTCCTGC TCACGCTGCC GTCGGGCAAG 
ACCACTGAGG TGCTGGTTCG GCTCGCCCTC GATGCCGGGC GGCCGGTCCG CACCGACCGG
ATCATCGAGG ACCTCTGGGG CGATGCCGCC ACGGGCCGGA ACACGCTCCA GTCGAAGGTG
TCCCAGCTGA GACGTGCCCT GGGCGACCCC AGTTTGGTCA CCAGCGGGAC CGGCGGCTAC
ACCCTCGATG TCGACCCCGA CCGTGTCGAT GCGTTGCAGG TCGTGGGGCT GGCCGCGTCG
GCAACCGCTG CGCGACGTGC GGGTGATCCG GCTACTGCGC TGGAGATCTC AACCGAAGGG
CTGGAGCTGT TTCGGGGCGA GGTGCTGGTC GACGCGGGAG AGGGGGACTG GCTGCTGCCG
CACCGCGCGC GCCTCGAGGA GGTGCGCCTC GGCCTGCTGG AGGACCAGCT GGCGGCACGG
GTGGACCTGG GTCTCGGCGG CGAGGTGGTC GGGGAGCTCG AGGGGCTCGT CAGCCAGCAC
CCGCTCCGCG AGGGCTTGTG GTCCTGCCTC ATCACCGCGC TCTACCGGAT GGGCCGCCAG
GCCGACGCAC TCGCGGCGTA CACCCGGGTG CGGGAAATGC TCGTCGACGA GCTCGGGGTA
GACCCAGGCC CTGGCCTGCG CGCCCTGGAG GACCAGATCC TGCAGCAGAG CCAGGCTCTC
GATCCCACCG GAGGTCGACC CGAGCTGCTT GCCGGGCCGG TGGGCAACCT GCCGGCACTG
TCCTCGTCGC TGGTGGGGCG GGCGGCCGAG GTCTCCGCTG TCGACGATCT TCTGCGCGAG
CGGCGTCTGG TGACGGTGGT CGGGCCGGCC GGCGTCGGCA AGACTCGCCT GGCTATCGAG
GTTGCCCGCG GACTCGCGCC AGCCGGGGGT GTGTGGCTGG TGCGACTCGA CGGTGTCGAT
GCCTCCGCGT CGATCCCACG GACGGTCGCG GAGACGCTGC GGTTGGCGGG CGGCGAACAG
ATGCTGGTCG AACGGTTCTC CGGCTCCGAG ACCGTCCTGG TGCTCGACAA CTGCGAACAC
GTCGTCGACG GCGTGGCCGA ACTGGCGAGC AGCCTGTTGG ATGCCACGAC CGAGCTGCGG
GTGCTGGCGA CGAGCCAGGT CCGGCTCGAC CTGGACGGCG AGACTATCTA CCAGCTGGAG
CCGCTTCCGA TCGCAGACTC CATGGCCCTC TTCACGGACC GGGCGGCCGA GATCCGCAAG
CGGTTCGTGC TCGACGACGA GACCGCGACG TCCGTCGAGG AGGTCTGCCT CTCCCTCGAC
GGACTGCCCC TGGCCATCGA GCTGGCCGCG GCCAGGGTCA GGTCCCTGTC GGTGCAGGAC
ATCGCCAGAC GACTCGACGA CCGCTTCGCG TTGCTCCAGG ACCCGACCAG CCGTCGCCCC
GAGCGGCGCC GCGCGCTCGC TGCCGCGATC GGCTGGAGCT ACGAGCTGCT CTTCCCCGAC
GAACAACGTG GACTCTGGGC GCTCTCCTGC TTCGCCGGCG GTGCACCTCT CGACGCCGCG
GAACATGTCC TCGCGGCCCT GGGCGTGCCC GCGGCGTCGG CTGTCGACGT CGTCGGCCGG
CTCGCCGATC GATCACTGGT CAGCGTCGAG GTCACCACGG AAGGCGCAGT GCGCTACCGG
CTGCTCGACA GCATCCGGGA CTTCGCGCTC GACCGGCTGC GCGAGTCCGG CCTCGACGAC
GACGCCCGCG CGGCGCACGC CGCGTGGCTC GCCGAGGCCG CCGATCGCTG CGAGGCGACC
GTGCGTGGCA AGGCACAGCC CGAGTGTCTT GCCGTGATCC GGGCCGAGCG TGCCAACATC
GACGCCGCGC TCAGTTGGTC CGCCGACCAC GGCCCGATGC TGGGGGTCCG GATCGCGACC
GGGTTCGGCT GGGCCTGGGT CGTGCACGGC GACGGCGTGG CGGGTGCAAC CCGGGTCCGA
TCCGCCCTCC AGGCAGCCGA ATCGCTCACC AAGCCGAGAG AGCGGGCAAC GGGCCTGCTG
CTCGCGGGCT GGCTCGAAAC CTCCGCCGGA AACCTCGACC AGGCCGAGAC CGATCTCGAT
GAAGCGCTCG GCTTCGCGAC GCAACGTGGC GACGACCGCC TCCGAGCCGA CGCCCACCGG
CACCTGGCCT TCCTCCGCAT CCAGCAGGGC CGCCCGCAGG ATGCGCTCGA GCTGGCCACC
GCGAGCCTCA CCGTCTACCG GCCGCTCGGC CTCGACTGGG AGGTGGCCAC GAGCCTCGTC
CTCGCGGCGT ACGCCTCGAG CATGCTCGGC GACACCACCG GTGCCACCAC AGCAGCCAAC
GAAGCCGTGG ACCTCCTCAC ACCCATCGGC GACTCCTGGG CGCTGGTCCA TGCCGACGGA
CTGCTCGGCG CCATCGCCCA GGCCGTCGGC CACCTCGACG AGGCCGCCGG CTTCCTCACC
CGGGCCGCCG AGGCTTCCGA ACGCCTTGGG TTCCTCGGGC AGGCCGCTCT CCACCTGACA
ACGCTCGGCA GGGTCGAACA TCGATCCGGC AACACAGCCA ATGCCACCGA GACCCTGAAG
CGCGCGATCG TCGCCGCCGG ACGCAGCGGC GACCTGCGCA TCGCGGCCAC GGCCCGGGTG
AACCTTGCCC GGCTGCTGCG GGGAGCGGGC CAGCCCGACG CCGCCCTCGT CCTGCTCGAA
CAGACCGACC GGTGGTACCG CACATCCGGG GGAGGCGATG GCGCCCTGCT CACCCGATGT
CTGCTCGCCG CACTCTCCTC CGCGACGGGC AGCACACGCG CCGCCGAACA GCTGAAGCCG
GTACTCGACG AAGCAGTGAG TGCTCGCGAC GCGGAAGTCC AGGTGCTCGC GATGGACGCG
CTGGCACGGA TGGCTGCCGA CCGAGGCGAC CTTGACGCGG CGCGACGGCT CCTCCGATCC
GCTGACGACC TGAGCTCTGG GATCCAGCAT GTTCTCGACG ACCTGGACCG AACCGACGCT
CACCTCGCTC GGCTACGCAT CGCCAGCGGC GCTGATCAGC CGGGCGCCCG CCGCTGA
 
Protein sequence
MLGPLEVRRD GVLLTLPSGK TTEVLVRLAL DAGRPVRTDR IIEDLWGDAA TGRNTLQSKV 
SQLRRALGDP SLVTSGTGGY TLDVDPDRVD ALQVVGLAAS ATAARRAGDP ATALEISTEG
LELFRGEVLV DAGEGDWLLP HRARLEEVRL GLLEDQLAAR VDLGLGGEVV GELEGLVSQH
PLREGLWSCL ITALYRMGRQ ADALAAYTRV REMLVDELGV DPGPGLRALE DQILQQSQAL
DPTGGRPELL AGPVGNLPAL SSSLVGRAAE VSAVDDLLRE RRLVTVVGPA GVGKTRLAIE
VARGLAPAGG VWLVRLDGVD ASASIPRTVA ETLRLAGGEQ MLVERFSGSE TVLVLDNCEH
VVDGVAELAS SLLDATTELR VLATSQVRLD LDGETIYQLE PLPIADSMAL FTDRAAEIRK
RFVLDDETAT SVEEVCLSLD GLPLAIELAA ARVRSLSVQD IARRLDDRFA LLQDPTSRRP
ERRRALAAAI GWSYELLFPD EQRGLWALSC FAGGAPLDAA EHVLAALGVP AASAVDVVGR
LADRSLVSVE VTTEGAVRYR LLDSIRDFAL DRLRESGLDD DARAAHAAWL AEAADRCEAT
VRGKAQPECL AVIRAERANI DAALSWSADH GPMLGVRIAT GFGWAWVVHG DGVAGATRVR
SALQAAESLT KPRERATGLL LAGWLETSAG NLDQAETDLD EALGFATQRG DDRLRADAHR
HLAFLRIQQG RPQDALELAT ASLTVYRPLG LDWEVATSLV LAAYASSMLG DTTGATTAAN
EAVDLLTPIG DSWALVHADG LLGAIAQAVG HLDEAAGFLT RAAEASERLG FLGQAALHLT
TLGRVEHRSG NTANATETLK RAIVAAGRSG DLRIAATARV NLARLLRGAG QPDAALVLLE
QTDRWYRTSG GGDGALLTRC LLAALSSATG STRAAEQLKP VLDEAVSARD AEVQVLAMDA
LARMAADRGD LDAARRLLRS ADDLSSGIQH VLDDLDRTDA HLARLRIASG ADQPGARR