Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4928 |
Symbol | |
ID | 4595309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | - |
Start bp | 260016 |
End bp | 263243 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639772711 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_919371 |
Protein GI | 119714229 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.66885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.971064 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGCG TGCAGATGCT CGGCCAGCTG TCGGTCGAGC TGGACGGCGT CCCGGTGTCC CCGCCGGAGA GCCGGCGGGC GTGGTCACTG CTGGCCTGGC TCGCACTGAA CCCTGGGCCG CATCCGCGCG TCGTGCTTGC GGCCCGGTTC TGGCCGGACG TTCTCGACAC CAGTGCGCGG GCCAGCCTGC GCACCGCGAT CTGGTCGCTC CGGCAGGCGC TCGGCCCGGA CAGCGGTCGC TGTCTGACGT CGTCGCGCGA CCACGTGGGC CTGGCGCCAG GGGAACTGTG GACCGACGTG GCGGAGTTCG GCCGGCTGCT CGACGCCGGT CATGTCGAGA GCGCTGTCGA ACTCTGCCGT GGGGAGGTGC TTGCGGGCCT CGAGGACGAG TGGGCGTACG AGGCCCGCGA GGAGCACCGC GACCGACTCG CGACTGCCCT CGGCACGCTG GCGGCGGAGG AGGAGAAGCG CGGTGACCTC GCCGCCGCTG TCGCTGCGAC GCGCCGCCAG CTGCAGTTCG CCCCATTCAC CGAACACGTA CACGCCGACC TGATCCGCCG GCTCGCGGCC TCGGGTGACC GCGCCGCGGC GATGCTCGCC TACCGTCGGC TCCGCGACCG GTTCCGGGAC GAGCTGGGCC TCGAGCCGTC GGCGGGAACC CGACGGCTGG CAGCGTCCCT CCGTGCGGCG GATCCGGCCC CCGGACCGGT CAAACCGGCC GGGCGGCCGT CCGGCGAGGG AGGCTCGGGG GTCCGGGGCC TCCCGATGGT GGGCCGGAGC GCATCGATGG CCCTGCTCGA GGAGGCGTGG CGGACCGCCC GGACCGGTCA CGGAGGAGTC GTCCACCTCA GCGGCGAGGC CGGCATCGGC AAGACCCGAC TGGTCGAGGA GCTCGCCGCG CGGGCACGCG ACGAGGGTGC ACGGTCCGCG ACGTGCGCCG CCGTCGACCT GTCCGGTAGC GCGCCCTTCG GTCTCTGGGC CGAGCTGCTG CGCGAGGTGT ACCGGGACCT GCAGCCGCCC CGGCTCGAGG CGTCCCGCGC CACGATCCTC GCCCGTCTGC TCCCGGACCT GGCCCCTCGC CTGGGCGTGG CTGCCCCCTC GCTCGAGATC GCGTCCCCGG ACCTGGAACG GACCCTGCTG TTCGAAGGAA TCGTCGAGCT GGTCGAGTGG GCCTGCCGGG ACAGGCCGCT GCTGGTCGTG ATGGAGGACG TGCACCTGGC CGACGCGCCC AGCCTGCAGC TCGTCGGGTA CGTCGCCCGC CGCATTCGGA CGCTCCCGTT GCTCGTCGCC CTCACCAGAC GGGAGCTGCC TCGGCGTACC GACGCCGATG CACTGCAGGA CACCCTTCGG TCCCGCGGCG TCCTCCTGCA GGAGATCCAG CTCGGTCCCC TGCGAGACGA CGAGATCGCC TCCCTTGCCC GGACGGTCGC GGGCCTGCCG GAGGCTGAGG TGGACAAGGT CGTCGCCGTC TCGGACGGGA ACCCGTTCCT CGCCCTCGAG TCCGCGCGTG CCCGCGGCCG TGCCGAGACG ACACCGCCCG CGAGCCTGCG GGGCAACGTC CGCGCCGTCT TCGGCGGCCT CGGTCCCGAC GCCCGGCTCC TGGCCGAGTT CGCGGCCGTG GCGGGGCGGC CACTGGAGAG GGAGGAGAGG GAGGCACTCC CACTCGACCG CGGCACCGAG GCTGCCACCG CGGCGGTGGA GAGCGGTCTG CTCGTGGCCG ACGCGAGCCG GGTGGAGTAC CGGCACGCCC TGCTTCGGGA AGCGGTCTAC GCCGACATGT CGGCGCCGAG GCGCGCCTGG TTGCACGAGA CCTTCGCTGC CGCACTGCAG TCGTGCGAGG CCAGACGAAC GCGACGCAGG GCCGCCGAGG TGGCGCGACA CCTGCGACTC GCCGGGAGGG ACGAACTCGC AGTCGGGCAC CTGGTGCGGG CCGCGGCCGA CGCACGCGCG GTCGCCGCAC TGCCCGAAGC AGTCGAGTTC CTGATCGAGG CGGCCGGCAT CGCCCCTGAC GACGACCGGC TTTTGCTCGA CCTCTCCGAG ATCCAGGCAT GGTTGGGACG CCGCGCCGAG GCCGACCAGG CGTTCGACCG CGCCATCGGC CTGATCCCGA CTGGCGACTC GGACCGGCTG GCGGACGCGT GGCTCAGACG CGGCCGGTGG CTGCGAGGAG CGCTGTGCGC GCCGCGGGCC GCCCGTGACG CCTACCGCGC GGCAGAAGCG GCCCTCGGCT CGATCCCTGC CCCGGCACCG GAGGCACGTG CCGAAGCGCT GGCCGGTTTG GCGTGGGCCG AGGCGGTCGC CGGCGACCTC GACGCCGTCG AGCCGCTGCT CGACCGGCTG TCCGCACCGG CCCCTTCCAG TGCGGACACC GGTATCCACG CCTACGAGAT CGGCGCCGCC CGAGCCTTCT GCCTAATCCG CAGGGGGCGC TTCAAAGAGA GCTACGAACC GGCGATCGCC GCCGGCGAGG CCGCGCAAGC CGCCGGACGA CCGGACATGG CCTACGGCTG TTGGGCGAAC GCCGCCTCGG CCGCAGCCTG CGCCGGCGAC TTCGAGCGTG CCCTGGAGTT CACCGACCAG GGGCTCGCGG CTGTCCAACG GGTGCTGCCG ACCGGAGAGC TTCACCTGCT CGCCGCCCGC GCGCACATCC TGACCCGGCT CGGCCGCTTC GACGAGGCTG CCGCAGCGGC CGACGCTGAG CGGCAGCTGG CGGACCGACT GGACCGGCCG GAGTTGGTGG CCACAGCACA GCACGACGCC GGGATGGTCG CGTTCGCCTC CGGCGACCCG GGCCGCGCCG CAGAGCTTCT GGCGGCGGCA CTCGCGCGGG GTGCGCCGGT GAGCCGTCCC CGGGCGCGCC TGGTCCGCGC CGAGGCGCTG GTGAGCGTCG GACGTTTGGA GGAGGCCGAG CAGGAACTGC GCGAGACCGT GCTGGAGCCG GTGACCGAGA GCGACTTCCC CCACACCCTC GTGCCCCGGC TGACGCGCGT TCAGGGTCTT CTGGCCGCGG CCCGCGGTGA TCGCGTCCTC GCCCGCAAGC GCCTGGGCGA GGCGGCCGAG TCATGGCGCC GATACTCCTC GGCCGCTCAG GGACACGGTG AGGAGTACGT CGTGAACCTC GCCGACCTCG GACGCCCCCC GGTGGAGGGG TTGATCGAGC CACTGCGAGA GCTCGATCGG GTACTCGAAG AGATCAAGTC ACTGGACTCC GAAGTCGAGA CTGCGTGA
|
Protein sequence | MLRVQMLGQL SVELDGVPVS PPESRRAWSL LAWLALNPGP HPRVVLAARF WPDVLDTSAR ASLRTAIWSL RQALGPDSGR CLTSSRDHVG LAPGELWTDV AEFGRLLDAG HVESAVELCR GEVLAGLEDE WAYEAREEHR DRLATALGTL AAEEEKRGDL AAAVAATRRQ LQFAPFTEHV HADLIRRLAA SGDRAAAMLA YRRLRDRFRD ELGLEPSAGT RRLAASLRAA DPAPGPVKPA GRPSGEGGSG VRGLPMVGRS ASMALLEEAW RTARTGHGGV VHLSGEAGIG KTRLVEELAA RARDEGARSA TCAAVDLSGS APFGLWAELL REVYRDLQPP RLEASRATIL ARLLPDLAPR LGVAAPSLEI ASPDLERTLL FEGIVELVEW ACRDRPLLVV MEDVHLADAP SLQLVGYVAR RIRTLPLLVA LTRRELPRRT DADALQDTLR SRGVLLQEIQ LGPLRDDEIA SLARTVAGLP EAEVDKVVAV SDGNPFLALE SARARGRAET TPPASLRGNV RAVFGGLGPD ARLLAEFAAV AGRPLEREER EALPLDRGTE AATAAVESGL LVADASRVEY RHALLREAVY ADMSAPRRAW LHETFAAALQ SCEARRTRRR AAEVARHLRL AGRDELAVGH LVRAAADARA VAALPEAVEF LIEAAGIAPD DDRLLLDLSE IQAWLGRRAE ADQAFDRAIG LIPTGDSDRL ADAWLRRGRW LRGALCAPRA ARDAYRAAEA ALGSIPAPAP EARAEALAGL AWAEAVAGDL DAVEPLLDRL SAPAPSSADT GIHAYEIGAA RAFCLIRRGR FKESYEPAIA AGEAAQAAGR PDMAYGCWAN AASAAACAGD FERALEFTDQ GLAAVQRVLP TGELHLLAAR AHILTRLGRF DEAAAAADAE RQLADRLDRP ELVATAQHDA GMVAFASGDP GRAAELLAAA LARGAPVSRP RARLVRAEAL VSVGRLEEAE QELRETVLEP VTESDFPHTL VPRLTRVQGL LAAARGDRVL ARKRLGEAAE SWRRYSSAAQ GHGEEYVVNL ADLGRPPVEG LIEPLRELDR VLEEIKSLDS EVETA
|
| |