Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4892 |
Symbol | |
ID | 4595273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | + |
Start bp | 223083 |
End bp | 224876 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639772677 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_919337 |
Protein GI | 119714195 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.123982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCGTT CCGCGAGCGT TCCGATCTGT GGACTGGTCG GAGCCGTGGG CGGCTCATAC GATTCGGAAG TGGAAAAGGT GGCGATCCAA CTCCTGGGCG GGTTCTCGGT CACGGTCGAC GGCACCCCCG TTGCCGGGGA CAGCTGGCGG AGCCGGCGCG CGGCCGATGT GCTCAAGTTG CTCGCGCTCT CGCCTGACCG CCGGCTCCAC CGGTCGCAGG TGATGGAGGC GTTCTGGCCC GACAGCGATC CGCAGGCGTC CGGCACCAGC CTCCGCAAGG CGCTGCACTT CGCTCGACGC GCCACGGGAG ACGAACAGGT GATCGTGAGC GAGCAGGGCT TGCTCGTGCT GTGGCCGCAC GCCGAGGTCG ACATCGACGC CGAGCGCTTC GAAACCGCGG CACGCCGGGC GCTGGCGACA GATAACTCCG CGGCCTGCCG CGACGTCGTG GACTTGTATG GCGGCGACCT GCTTCCCGAT GATCGCTACG AGTCCTGGTT GGCCGAGCCA CAACACCGGT TGCGGCAGCG CTACCTGGAT TTGCTGCGCG TCGGATCTCT GTGGGCGCGG CTGGCCGAAG AAGACCCCAC CGACGAGCAG GCAGCCCGCT CGCTCATGCG CGCCCACCTC GACGCCGGCG AGCGCCGCGA GGCGATCCGG CGCTTCGAAA GGCTCCGCGA GGCCCTGCAC GACCAACTCG GCGTCGGGCC GGACCGCGCG ACAATCGCGC TCTACGAGGA AGTCCTCGCT GTCGAAGGCG CCCCCAAACC CACGGAGGCC GAACGGGCGC ACGCGCTGCT GGCCTGGGCT CTGGTGCACA TGAACCGAAA CGAGATCGAG GAAGCCGAGC GCGCCGCAGA GGAAGCGCGC GCCATTGCCC TCGACTCTGG CCTGGGCCGC GAACTCGGCG AAGCCGCGGT CATCCTGGCC AAAGTCGCCA TGGCTCAAGG CCGATGGCGA GAACGATTCG CCGAGGAACT CGGCGAATCC ATGCGGCTGC GGGCCAACAT GGAGCCCATC GTGTACGACG CCCACCTGTG CCTCGCCGAG TACTACCTCG CGGCGCCCGA CGGCTACGAC CTCGCCGCAG ACTTCGCCCG CCAGATGATG CAGATCGCCG ACGAGGCCGG GTCGGCGACC GGTGCCGCCC TCGCCACGCT CATGCTCGGC GAAGCCGAAC TACTCGCCGG CCACCTCATC GAGGCCGAAC AACACCTCAA GGAGGCGGCC GAAGCCAACG ACCACGAAGG CTGCCTCTCC GGGTCAGCCC TCGCCCGACA ACGGCTCGCC GAAGCCGCTG TGATCAACGG CCGCAAGTTC GACGCCAACC GACTGCTCAC CCGTGCCCGC TCCATCGCGG TCCGCTCCGA CCTCGCCAGC CACCTCATGG TTCGCGTCTT CGGCACCATG ATCCAGGCGG CCGACCCAAC GCACACCCTC ACCGTTCTGC GCACGGCAGA GCGCGAGCTC GCCCAGATGC GATCGTGCGA GCCCTGCTCG ATGGGCTACC TGACCAGCGC CGCGGCCGCC TGCGCACGAG CCGGCGAACT GGACCGAGCA CGCGCCTTCA TCACCGAAGC AGAGCGGATC GCCGGCATGT GGCAAGGCGG CCTGTGGAAC GGCGCCGTCT GGGAAGCTCG CGGCGTACTC CGCCACGCCG AAGGCGCGCC CGAACAGGCC CGTGCGATGT ACCGAGAAGC CGCACAGGAA TACACGCGCG CTGGAAACCA GTCCGATGCT GCCCGGTGCG CGGAAGCCGC CAGCCAGCTG CAGAACCACG CCACCGCGGA GTAG
|
Protein sequence | MDRSASVPIC GLVGAVGGSY DSEVEKVAIQ LLGGFSVTVD GTPVAGDSWR SRRAADVLKL LALSPDRRLH RSQVMEAFWP DSDPQASGTS LRKALHFARR ATGDEQVIVS EQGLLVLWPH AEVDIDAERF ETAARRALAT DNSAACRDVV DLYGGDLLPD DRYESWLAEP QHRLRQRYLD LLRVGSLWAR LAEEDPTDEQ AARSLMRAHL DAGERREAIR RFERLREALH DQLGVGPDRA TIALYEEVLA VEGAPKPTEA ERAHALLAWA LVHMNRNEIE EAERAAEEAR AIALDSGLGR ELGEAAVILA KVAMAQGRWR ERFAEELGES MRLRANMEPI VYDAHLCLAE YYLAAPDGYD LAADFARQMM QIADEAGSAT GAALATLMLG EAELLAGHLI EAEQHLKEAA EANDHEGCLS GSALARQRLA EAAVINGRKF DANRLLTRAR SIAVRSDLAS HLMVRVFGTM IQAADPTHTL TVLRTAEREL AQMRSCEPCS MGYLTSAAAA CARAGELDRA RAFITEAERI AGMWQGGLWN GAVWEARGVL RHAEGAPEQA RAMYREAAQE YTRAGNQSDA ARCAEAASQL QNHATAE
|
| |