Gene Noca_4892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4892 
Symbol 
ID4595273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp223083 
End bp224876 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content69% 
IMG OID639772677 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_919337 
Protein GI119714195 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.123982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCGTT CCGCGAGCGT TCCGATCTGT GGACTGGTCG GAGCCGTGGG CGGCTCATAC 
GATTCGGAAG TGGAAAAGGT GGCGATCCAA CTCCTGGGCG GGTTCTCGGT CACGGTCGAC
GGCACCCCCG TTGCCGGGGA CAGCTGGCGG AGCCGGCGCG CGGCCGATGT GCTCAAGTTG
CTCGCGCTCT CGCCTGACCG CCGGCTCCAC CGGTCGCAGG TGATGGAGGC GTTCTGGCCC
GACAGCGATC CGCAGGCGTC CGGCACCAGC CTCCGCAAGG CGCTGCACTT CGCTCGACGC
GCCACGGGAG ACGAACAGGT GATCGTGAGC GAGCAGGGCT TGCTCGTGCT GTGGCCGCAC
GCCGAGGTCG ACATCGACGC CGAGCGCTTC GAAACCGCGG CACGCCGGGC GCTGGCGACA
GATAACTCCG CGGCCTGCCG CGACGTCGTG GACTTGTATG GCGGCGACCT GCTTCCCGAT
GATCGCTACG AGTCCTGGTT GGCCGAGCCA CAACACCGGT TGCGGCAGCG CTACCTGGAT
TTGCTGCGCG TCGGATCTCT GTGGGCGCGG CTGGCCGAAG AAGACCCCAC CGACGAGCAG
GCAGCCCGCT CGCTCATGCG CGCCCACCTC GACGCCGGCG AGCGCCGCGA GGCGATCCGG
CGCTTCGAAA GGCTCCGCGA GGCCCTGCAC GACCAACTCG GCGTCGGGCC GGACCGCGCG
ACAATCGCGC TCTACGAGGA AGTCCTCGCT GTCGAAGGCG CCCCCAAACC CACGGAGGCC
GAACGGGCGC ACGCGCTGCT GGCCTGGGCT CTGGTGCACA TGAACCGAAA CGAGATCGAG
GAAGCCGAGC GCGCCGCAGA GGAAGCGCGC GCCATTGCCC TCGACTCTGG CCTGGGCCGC
GAACTCGGCG AAGCCGCGGT CATCCTGGCC AAAGTCGCCA TGGCTCAAGG CCGATGGCGA
GAACGATTCG CCGAGGAACT CGGCGAATCC ATGCGGCTGC GGGCCAACAT GGAGCCCATC
GTGTACGACG CCCACCTGTG CCTCGCCGAG TACTACCTCG CGGCGCCCGA CGGCTACGAC
CTCGCCGCAG ACTTCGCCCG CCAGATGATG CAGATCGCCG ACGAGGCCGG GTCGGCGACC
GGTGCCGCCC TCGCCACGCT CATGCTCGGC GAAGCCGAAC TACTCGCCGG CCACCTCATC
GAGGCCGAAC AACACCTCAA GGAGGCGGCC GAAGCCAACG ACCACGAAGG CTGCCTCTCC
GGGTCAGCCC TCGCCCGACA ACGGCTCGCC GAAGCCGCTG TGATCAACGG CCGCAAGTTC
GACGCCAACC GACTGCTCAC CCGTGCCCGC TCCATCGCGG TCCGCTCCGA CCTCGCCAGC
CACCTCATGG TTCGCGTCTT CGGCACCATG ATCCAGGCGG CCGACCCAAC GCACACCCTC
ACCGTTCTGC GCACGGCAGA GCGCGAGCTC GCCCAGATGC GATCGTGCGA GCCCTGCTCG
ATGGGCTACC TGACCAGCGC CGCGGCCGCC TGCGCACGAG CCGGCGAACT GGACCGAGCA
CGCGCCTTCA TCACCGAAGC AGAGCGGATC GCCGGCATGT GGCAAGGCGG CCTGTGGAAC
GGCGCCGTCT GGGAAGCTCG CGGCGTACTC CGCCACGCCG AAGGCGCGCC CGAACAGGCC
CGTGCGATGT ACCGAGAAGC CGCACAGGAA TACACGCGCG CTGGAAACCA GTCCGATGCT
GCCCGGTGCG CGGAAGCCGC CAGCCAGCTG CAGAACCACG CCACCGCGGA GTAG
 
Protein sequence
MDRSASVPIC GLVGAVGGSY DSEVEKVAIQ LLGGFSVTVD GTPVAGDSWR SRRAADVLKL 
LALSPDRRLH RSQVMEAFWP DSDPQASGTS LRKALHFARR ATGDEQVIVS EQGLLVLWPH
AEVDIDAERF ETAARRALAT DNSAACRDVV DLYGGDLLPD DRYESWLAEP QHRLRQRYLD
LLRVGSLWAR LAEEDPTDEQ AARSLMRAHL DAGERREAIR RFERLREALH DQLGVGPDRA
TIALYEEVLA VEGAPKPTEA ERAHALLAWA LVHMNRNEIE EAERAAEEAR AIALDSGLGR
ELGEAAVILA KVAMAQGRWR ERFAEELGES MRLRANMEPI VYDAHLCLAE YYLAAPDGYD
LAADFARQMM QIADEAGSAT GAALATLMLG EAELLAGHLI EAEQHLKEAA EANDHEGCLS
GSALARQRLA EAAVINGRKF DANRLLTRAR SIAVRSDLAS HLMVRVFGTM IQAADPTHTL
TVLRTAEREL AQMRSCEPCS MGYLTSAAAA CARAGELDRA RAFITEAERI AGMWQGGLWN
GAVWEARGVL RHAEGAPEQA RAMYREAAQE YTRAGNQSDA ARCAEAASQL QNHATAE