Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2542 |
Symbol | |
ID | 7970019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 2685075 |
End bp | 2686037 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644793129 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002944434 |
Protein GI | 239815524 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.602522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCGT TGCTCAGCAC CGATGCCGTT CCGCGCGGCC AGCGGCTCGC CTACTGGACC GACATGATCT GCAACGTCTA CGTGCAGCTC GGATGCGATC CGGTGCGCGA AGGCGATAGC GGCAATTTCG AAGGCAGCAT CCGCCAGCAC ACGCTGCCGA GCCTGGACGT GTCGGTGGTC AGGTCAGGAC CGCAGAAGGT GATGCGCACG CCGGGCCACA TCTCGCGCTC GGGCGACGAC TGCTTTCTGG TCAGCATCCA GGCGCGCGGC CAGGGCGTGG TGCGGCAGGA TGGGCGCGAC GCCGTGCTGG CGGCGGGCGA CTTCGCGCTG TACGACAGCA CGCGGCCCTA CCAGCTGCTG TTCGACGACA GCTTCGAGCA GATCGTGCTC AAGCTGCCGG GCGAGCGCCT GCGCAGCGAG CTGCACCACA CCGAAGCGCT GACGGCCACC ACCGTGTCGG GGCGCGAGGG CGCGGGGCAC CTGCTGCTGG GCATGATCCG CACGCTGCGC GAGGACATCG ACACCTTGCA GCCGGCGTCG GCCCTGGCGG TGGCCAACGG CGTGCAGAGC ATTCTCGTGG CCGGGCTGCA GACGCTGCCC GCGGCGCGCT CACCGGGCCT GAGCAGTCTC ACGGCCTACC ACCTGGCCCG CGTGAAGCGG CGCATCGACG AGCAGTTGGC GGATCCTTCG CTGTCGGTGG GAAGCCTCGC GGCGCAGCTG GGCGTCTCGG CGAGCCACAT CCACCGCGTC TTCAAGAGCG AGCCGCTCAC GCCTTCGCAG TACATCTGGG AGCGCCGGCT CGAAGCCTGC AGCCGCGACC TGCTCGAGGC GCGCCTGGCC GGCCGGCCCG TGGCCGAGAT TGCCTACGGC CGCGGCTTCA ACGACGCGGC GCATTTCAGC CGCGCGTTCC GCGAACGCTT CGGCTGCTCG CCGCGCGAAT GGCGGCAGCA GCGCGTGCAA TAA
|
Protein sequence | MTPLLSTDAV PRGQRLAYWT DMICNVYVQL GCDPVREGDS GNFEGSIRQH TLPSLDVSVV RSGPQKVMRT PGHISRSGDD CFLVSIQARG QGVVRQDGRD AVLAAGDFAL YDSTRPYQLL FDDSFEQIVL KLPGERLRSE LHHTEALTAT TVSGREGAGH LLLGMIRTLR EDIDTLQPAS ALAVANGVQS ILVAGLQTLP AARSPGLSSL TAYHLARVKR RIDEQLADPS LSVGSLAAQL GVSASHIHRV FKSEPLTPSQ YIWERRLEAC SRDLLEARLA GRPVAEIAYG RGFNDAAHFS RAFRERFGCS PREWRQQRVQ
|
| |