Gene Noca_4928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4928 
Symbol 
ID4595309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp260016 
End bp263243 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content74% 
IMG OID639772711 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_919371 
Protein GI119714229 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.66885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.971064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCG TGCAGATGCT CGGCCAGCTG TCGGTCGAGC TGGACGGCGT CCCGGTGTCC 
CCGCCGGAGA GCCGGCGGGC GTGGTCACTG CTGGCCTGGC TCGCACTGAA CCCTGGGCCG
CATCCGCGCG TCGTGCTTGC GGCCCGGTTC TGGCCGGACG TTCTCGACAC CAGTGCGCGG
GCCAGCCTGC GCACCGCGAT CTGGTCGCTC CGGCAGGCGC TCGGCCCGGA CAGCGGTCGC
TGTCTGACGT CGTCGCGCGA CCACGTGGGC CTGGCGCCAG GGGAACTGTG GACCGACGTG
GCGGAGTTCG GCCGGCTGCT CGACGCCGGT CATGTCGAGA GCGCTGTCGA ACTCTGCCGT
GGGGAGGTGC TTGCGGGCCT CGAGGACGAG TGGGCGTACG AGGCCCGCGA GGAGCACCGC
GACCGACTCG CGACTGCCCT CGGCACGCTG GCGGCGGAGG AGGAGAAGCG CGGTGACCTC
GCCGCCGCTG TCGCTGCGAC GCGCCGCCAG CTGCAGTTCG CCCCATTCAC CGAACACGTA
CACGCCGACC TGATCCGCCG GCTCGCGGCC TCGGGTGACC GCGCCGCGGC GATGCTCGCC
TACCGTCGGC TCCGCGACCG GTTCCGGGAC GAGCTGGGCC TCGAGCCGTC GGCGGGAACC
CGACGGCTGG CAGCGTCCCT CCGTGCGGCG GATCCGGCCC CCGGACCGGT CAAACCGGCC
GGGCGGCCGT CCGGCGAGGG AGGCTCGGGG GTCCGGGGCC TCCCGATGGT GGGCCGGAGC
GCATCGATGG CCCTGCTCGA GGAGGCGTGG CGGACCGCCC GGACCGGTCA CGGAGGAGTC
GTCCACCTCA GCGGCGAGGC CGGCATCGGC AAGACCCGAC TGGTCGAGGA GCTCGCCGCG
CGGGCACGCG ACGAGGGTGC ACGGTCCGCG ACGTGCGCCG CCGTCGACCT GTCCGGTAGC
GCGCCCTTCG GTCTCTGGGC CGAGCTGCTG CGCGAGGTGT ACCGGGACCT GCAGCCGCCC
CGGCTCGAGG CGTCCCGCGC CACGATCCTC GCCCGTCTGC TCCCGGACCT GGCCCCTCGC
CTGGGCGTGG CTGCCCCCTC GCTCGAGATC GCGTCCCCGG ACCTGGAACG GACCCTGCTG
TTCGAAGGAA TCGTCGAGCT GGTCGAGTGG GCCTGCCGGG ACAGGCCGCT GCTGGTCGTG
ATGGAGGACG TGCACCTGGC CGACGCGCCC AGCCTGCAGC TCGTCGGGTA CGTCGCCCGC
CGCATTCGGA CGCTCCCGTT GCTCGTCGCC CTCACCAGAC GGGAGCTGCC TCGGCGTACC
GACGCCGATG CACTGCAGGA CACCCTTCGG TCCCGCGGCG TCCTCCTGCA GGAGATCCAG
CTCGGTCCCC TGCGAGACGA CGAGATCGCC TCCCTTGCCC GGACGGTCGC GGGCCTGCCG
GAGGCTGAGG TGGACAAGGT CGTCGCCGTC TCGGACGGGA ACCCGTTCCT CGCCCTCGAG
TCCGCGCGTG CCCGCGGCCG TGCCGAGACG ACACCGCCCG CGAGCCTGCG GGGCAACGTC
CGCGCCGTCT TCGGCGGCCT CGGTCCCGAC GCCCGGCTCC TGGCCGAGTT CGCGGCCGTG
GCGGGGCGGC CACTGGAGAG GGAGGAGAGG GAGGCACTCC CACTCGACCG CGGCACCGAG
GCTGCCACCG CGGCGGTGGA GAGCGGTCTG CTCGTGGCCG ACGCGAGCCG GGTGGAGTAC
CGGCACGCCC TGCTTCGGGA AGCGGTCTAC GCCGACATGT CGGCGCCGAG GCGCGCCTGG
TTGCACGAGA CCTTCGCTGC CGCACTGCAG TCGTGCGAGG CCAGACGAAC GCGACGCAGG
GCCGCCGAGG TGGCGCGACA CCTGCGACTC GCCGGGAGGG ACGAACTCGC AGTCGGGCAC
CTGGTGCGGG CCGCGGCCGA CGCACGCGCG GTCGCCGCAC TGCCCGAAGC AGTCGAGTTC
CTGATCGAGG CGGCCGGCAT CGCCCCTGAC GACGACCGGC TTTTGCTCGA CCTCTCCGAG
ATCCAGGCAT GGTTGGGACG CCGCGCCGAG GCCGACCAGG CGTTCGACCG CGCCATCGGC
CTGATCCCGA CTGGCGACTC GGACCGGCTG GCGGACGCGT GGCTCAGACG CGGCCGGTGG
CTGCGAGGAG CGCTGTGCGC GCCGCGGGCC GCCCGTGACG CCTACCGCGC GGCAGAAGCG
GCCCTCGGCT CGATCCCTGC CCCGGCACCG GAGGCACGTG CCGAAGCGCT GGCCGGTTTG
GCGTGGGCCG AGGCGGTCGC CGGCGACCTC GACGCCGTCG AGCCGCTGCT CGACCGGCTG
TCCGCACCGG CCCCTTCCAG TGCGGACACC GGTATCCACG CCTACGAGAT CGGCGCCGCC
CGAGCCTTCT GCCTAATCCG CAGGGGGCGC TTCAAAGAGA GCTACGAACC GGCGATCGCC
GCCGGCGAGG CCGCGCAAGC CGCCGGACGA CCGGACATGG CCTACGGCTG TTGGGCGAAC
GCCGCCTCGG CCGCAGCCTG CGCCGGCGAC TTCGAGCGTG CCCTGGAGTT CACCGACCAG
GGGCTCGCGG CTGTCCAACG GGTGCTGCCG ACCGGAGAGC TTCACCTGCT CGCCGCCCGC
GCGCACATCC TGACCCGGCT CGGCCGCTTC GACGAGGCTG CCGCAGCGGC CGACGCTGAG
CGGCAGCTGG CGGACCGACT GGACCGGCCG GAGTTGGTGG CCACAGCACA GCACGACGCC
GGGATGGTCG CGTTCGCCTC CGGCGACCCG GGCCGCGCCG CAGAGCTTCT GGCGGCGGCA
CTCGCGCGGG GTGCGCCGGT GAGCCGTCCC CGGGCGCGCC TGGTCCGCGC CGAGGCGCTG
GTGAGCGTCG GACGTTTGGA GGAGGCCGAG CAGGAACTGC GCGAGACCGT GCTGGAGCCG
GTGACCGAGA GCGACTTCCC CCACACCCTC GTGCCCCGGC TGACGCGCGT TCAGGGTCTT
CTGGCCGCGG CCCGCGGTGA TCGCGTCCTC GCCCGCAAGC GCCTGGGCGA GGCGGCCGAG
TCATGGCGCC GATACTCCTC GGCCGCTCAG GGACACGGTG AGGAGTACGT CGTGAACCTC
GCCGACCTCG GACGCCCCCC GGTGGAGGGG TTGATCGAGC CACTGCGAGA GCTCGATCGG
GTACTCGAAG AGATCAAGTC ACTGGACTCC GAAGTCGAGA CTGCGTGA
 
Protein sequence
MLRVQMLGQL SVELDGVPVS PPESRRAWSL LAWLALNPGP HPRVVLAARF WPDVLDTSAR 
ASLRTAIWSL RQALGPDSGR CLTSSRDHVG LAPGELWTDV AEFGRLLDAG HVESAVELCR
GEVLAGLEDE WAYEAREEHR DRLATALGTL AAEEEKRGDL AAAVAATRRQ LQFAPFTEHV
HADLIRRLAA SGDRAAAMLA YRRLRDRFRD ELGLEPSAGT RRLAASLRAA DPAPGPVKPA
GRPSGEGGSG VRGLPMVGRS ASMALLEEAW RTARTGHGGV VHLSGEAGIG KTRLVEELAA
RARDEGARSA TCAAVDLSGS APFGLWAELL REVYRDLQPP RLEASRATIL ARLLPDLAPR
LGVAAPSLEI ASPDLERTLL FEGIVELVEW ACRDRPLLVV MEDVHLADAP SLQLVGYVAR
RIRTLPLLVA LTRRELPRRT DADALQDTLR SRGVLLQEIQ LGPLRDDEIA SLARTVAGLP
EAEVDKVVAV SDGNPFLALE SARARGRAET TPPASLRGNV RAVFGGLGPD ARLLAEFAAV
AGRPLEREER EALPLDRGTE AATAAVESGL LVADASRVEY RHALLREAVY ADMSAPRRAW
LHETFAAALQ SCEARRTRRR AAEVARHLRL AGRDELAVGH LVRAAADARA VAALPEAVEF
LIEAAGIAPD DDRLLLDLSE IQAWLGRRAE ADQAFDRAIG LIPTGDSDRL ADAWLRRGRW
LRGALCAPRA ARDAYRAAEA ALGSIPAPAP EARAEALAGL AWAEAVAGDL DAVEPLLDRL
SAPAPSSADT GIHAYEIGAA RAFCLIRRGR FKESYEPAIA AGEAAQAAGR PDMAYGCWAN
AASAAACAGD FERALEFTDQ GLAAVQRVLP TGELHLLAAR AHILTRLGRF DEAAAAADAE
RQLADRLDRP ELVATAQHDA GMVAFASGDP GRAAELLAAA LARGAPVSRP RARLVRAEAL
VSVGRLEEAE QELRETVLEP VTESDFPHTL VPRLTRVQGL LAAARGDRVL ARKRLGEAAE
SWRRYSSAAQ GHGEEYVVNL ADLGRPPVEG LIEPLRELDR VLEEIKSLDS EVETA