Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4962 |
Symbol | |
ID | 4595344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | - |
Start bp | 292952 |
End bp | 296008 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639772744 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_919404 |
Protein GI | 119714262 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0162607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGGAC CCCTTGAGGT TCGCCGCGAC GGCGTCCTGC TCACGCTGCC GTCGGGCAAG ACCACTGAGG TGCTGGTTCG GCTCGCCCTC GATGCCGGGC GGCCGGTCCG CACCGACCGG ATCATCGAGG ACCTCTGGGG CGATGCCGCC ACGGGCCGGA ACACGCTCCA GTCGAAGGTG TCCCAGCTGA GACGTGCCCT GGGCGACCCC AGTTTGGTCA CCAGCGGGAC CGGCGGCTAC ACCCTCGATG TCGACCCCGA CCGTGTCGAT GCGTTGCAGG TCGTGGGGCT GGCCGCGTCG GCAACCGCTG CGCGACGTGC GGGTGATCCG GCTACTGCGC TGGAGATCTC AACCGAAGGG CTGGAGCTGT TTCGGGGCGA GGTGCTGGTC GACGCGGGAG AGGGGGACTG GCTGCTGCCG CACCGCGCGC GCCTCGAGGA GGTGCGCCTC GGCCTGCTGG AGGACCAGCT GGCGGCACGG GTGGACCTGG GTCTCGGCGG CGAGGTGGTC GGGGAGCTCG AGGGGCTCGT CAGCCAGCAC CCGCTCCGCG AGGGCTTGTG GTCCTGCCTC ATCACCGCGC TCTACCGGAT GGGCCGCCAG GCCGACGCAC TCGCGGCGTA CACCCGGGTG CGGGAAATGC TCGTCGACGA GCTCGGGGTA GACCCAGGCC CTGGCCTGCG CGCCCTGGAG GACCAGATCC TGCAGCAGAG CCAGGCTCTC GATCCCACCG GAGGTCGACC CGAGCTGCTT GCCGGGCCGG TGGGCAACCT GCCGGCACTG TCCTCGTCGC TGGTGGGGCG GGCGGCCGAG GTCTCCGCTG TCGACGATCT TCTGCGCGAG CGGCGTCTGG TGACGGTGGT CGGGCCGGCC GGCGTCGGCA AGACTCGCCT GGCTATCGAG GTTGCCCGCG GACTCGCGCC AGCCGGGGGT GTGTGGCTGG TGCGACTCGA CGGTGTCGAT GCCTCCGCGT CGATCCCACG GACGGTCGCG GAGACGCTGC GGTTGGCGGG CGGCGAACAG ATGCTGGTCG AACGGTTCTC CGGCTCCGAG ACCGTCCTGG TGCTCGACAA CTGCGAACAC GTCGTCGACG GCGTGGCCGA ACTGGCGAGC AGCCTGTTGG ATGCCACGAC CGAGCTGCGG GTGCTGGCGA CGAGCCAGGT CCGGCTCGAC CTGGACGGCG AGACTATCTA CCAGCTGGAG CCGCTTCCGA TCGCAGACTC CATGGCCCTC TTCACGGACC GGGCGGCCGA GATCCGCAAG CGGTTCGTGC TCGACGACGA GACCGCGACG TCCGTCGAGG AGGTCTGCCT CTCCCTCGAC GGACTGCCCC TGGCCATCGA GCTGGCCGCG GCCAGGGTCA GGTCCCTGTC GGTGCAGGAC ATCGCCAGAC GACTCGACGA CCGCTTCGCG TTGCTCCAGG ACCCGACCAG CCGTCGCCCC GAGCGGCGCC GCGCGCTCGC TGCCGCGATC GGCTGGAGCT ACGAGCTGCT CTTCCCCGAC GAACAACGTG GACTCTGGGC GCTCTCCTGC TTCGCCGGCG GTGCACCTCT CGACGCCGCG GAACATGTCC TCGCGGCCCT GGGCGTGCCC GCGGCGTCGG CTGTCGACGT CGTCGGCCGG CTCGCCGATC GATCACTGGT CAGCGTCGAG GTCACCACGG AAGGCGCAGT GCGCTACCGG CTGCTCGACA GCATCCGGGA CTTCGCGCTC GACCGGCTGC GCGAGTCCGG CCTCGACGAC GACGCCCGCG CGGCGCACGC CGCGTGGCTC GCCGAGGCCG CCGATCGCTG CGAGGCGACC GTGCGTGGCA AGGCACAGCC CGAGTGTCTT GCCGTGATCC GGGCCGAGCG TGCCAACATC GACGCCGCGC TCAGTTGGTC CGCCGACCAC GGCCCGATGC TGGGGGTCCG GATCGCGACC GGGTTCGGCT GGGCCTGGGT CGTGCACGGC GACGGCGTGG CGGGTGCAAC CCGGGTCCGA TCCGCCCTCC AGGCAGCCGA ATCGCTCACC AAGCCGAGAG AGCGGGCAAC GGGCCTGCTG CTCGCGGGCT GGCTCGAAAC CTCCGCCGGA AACCTCGACC AGGCCGAGAC CGATCTCGAT GAAGCGCTCG GCTTCGCGAC GCAACGTGGC GACGACCGCC TCCGAGCCGA CGCCCACCGG CACCTGGCCT TCCTCCGCAT CCAGCAGGGC CGCCCGCAGG ATGCGCTCGA GCTGGCCACC GCGAGCCTCA CCGTCTACCG GCCGCTCGGC CTCGACTGGG AGGTGGCCAC GAGCCTCGTC CTCGCGGCGT ACGCCTCGAG CATGCTCGGC GACACCACCG GTGCCACCAC AGCAGCCAAC GAAGCCGTGG ACCTCCTCAC ACCCATCGGC GACTCCTGGG CGCTGGTCCA TGCCGACGGA CTGCTCGGCG CCATCGCCCA GGCCGTCGGC CACCTCGACG AGGCCGCCGG CTTCCTCACC CGGGCCGCCG AGGCTTCCGA ACGCCTTGGG TTCCTCGGGC AGGCCGCTCT CCACCTGACA ACGCTCGGCA GGGTCGAACA TCGATCCGGC AACACAGCCA ATGCCACCGA GACCCTGAAG CGCGCGATCG TCGCCGCCGG ACGCAGCGGC GACCTGCGCA TCGCGGCCAC GGCCCGGGTG AACCTTGCCC GGCTGCTGCG GGGAGCGGGC CAGCCCGACG CCGCCCTCGT CCTGCTCGAA CAGACCGACC GGTGGTACCG CACATCCGGG GGAGGCGATG GCGCCCTGCT CACCCGATGT CTGCTCGCCG CACTCTCCTC CGCGACGGGC AGCACACGCG CCGCCGAACA GCTGAAGCCG GTACTCGACG AAGCAGTGAG TGCTCGCGAC GCGGAAGTCC AGGTGCTCGC GATGGACGCG CTGGCACGGA TGGCTGCCGA CCGAGGCGAC CTTGACGCGG CGCGACGGCT CCTCCGATCC GCTGACGACC TGAGCTCTGG GATCCAGCAT GTTCTCGACG ACCTGGACCG AACCGACGCT CACCTCGCTC GGCTACGCAT CGCCAGCGGC GCTGATCAGC CGGGCGCCCG CCGCTGA
|
Protein sequence | MLGPLEVRRD GVLLTLPSGK TTEVLVRLAL DAGRPVRTDR IIEDLWGDAA TGRNTLQSKV SQLRRALGDP SLVTSGTGGY TLDVDPDRVD ALQVVGLAAS ATAARRAGDP ATALEISTEG LELFRGEVLV DAGEGDWLLP HRARLEEVRL GLLEDQLAAR VDLGLGGEVV GELEGLVSQH PLREGLWSCL ITALYRMGRQ ADALAAYTRV REMLVDELGV DPGPGLRALE DQILQQSQAL DPTGGRPELL AGPVGNLPAL SSSLVGRAAE VSAVDDLLRE RRLVTVVGPA GVGKTRLAIE VARGLAPAGG VWLVRLDGVD ASASIPRTVA ETLRLAGGEQ MLVERFSGSE TVLVLDNCEH VVDGVAELAS SLLDATTELR VLATSQVRLD LDGETIYQLE PLPIADSMAL FTDRAAEIRK RFVLDDETAT SVEEVCLSLD GLPLAIELAA ARVRSLSVQD IARRLDDRFA LLQDPTSRRP ERRRALAAAI GWSYELLFPD EQRGLWALSC FAGGAPLDAA EHVLAALGVP AASAVDVVGR LADRSLVSVE VTTEGAVRYR LLDSIRDFAL DRLRESGLDD DARAAHAAWL AEAADRCEAT VRGKAQPECL AVIRAERANI DAALSWSADH GPMLGVRIAT GFGWAWVVHG DGVAGATRVR SALQAAESLT KPRERATGLL LAGWLETSAG NLDQAETDLD EALGFATQRG DDRLRADAHR HLAFLRIQQG RPQDALELAT ASLTVYRPLG LDWEVATSLV LAAYASSMLG DTTGATTAAN EAVDLLTPIG DSWALVHADG LLGAIAQAVG HLDEAAGFLT RAAEASERLG FLGQAALHLT TLGRVEHRSG NTANATETLK RAIVAAGRSG DLRIAATARV NLARLLRGAG QPDAALVLLE QTDRWYRTSG GGDGALLTRC LLAALSSATG STRAAEQLKP VLDEAVSARD AEVQVLAMDA LARMAADRGD LDAARRLLRS ADDLSSGIQH VLDDLDRTDA HLARLRIASG ADQPGARR
|
| |