Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5980 |
Symbol | |
ID | 7975019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012792 |
Strand | + |
Start bp | 707864 |
End bp | 708937 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644796543 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002947817 |
Protein GI | 239820632 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATCC TCTCGACCTG GCTCGACGCT GTCGTGGATC AGTTGGAAGT GCAGGATCTG AACCCAAACG ATCTCACCGC TGGCCTGTGG GGTCCGGGCC TGGCGCGTCT CGCACCCACA CGCCAGGTCG GCCTGGTACT CGCTCGGCGT TTGTGGCAAC GGGCGGCCAA AGTCTCCACT GACCCGCTTC TTGGCCTGAA GGTCGGCATC GGATTGCCGT TGCAGGCCAT GAACGTATCG GGGCTGCTGA TGATGCACAG TCCAACGTTG CGGGAGGCTT TGACGCACAT GGAACGCTAC CAGCAGCTCG TGAGCAACAG CGGGCGCGTG ACGGCGCACA AGGTGCCGGG CGGCCTGGAG CTGGGCTATA TCGTTACGCC GTGCCATGTC GACATGCACC ACATGCAGAT TGACTCGGTG TTCGGCGGGA TACTGGCTTT CCTCAAGCGC TGTAGCACGC GAAGCATTGC ACCCAGGCAC ATCGCCCTCA CCGCGCCAGA CAAGCTGCTC GGCTCGAGTT ACGCAACGCT ACTTGGCAGT CCGGTCACGC TCGGCGAGCG CAACGTGTGC GTTGCGTATG ACGACCAAAC GCTCGACCAA CCCTTTCAAG GCGCCGACCC CGCTCTGCTG GCATTGCTCC GCGCGCAAGC CGACGGCCTG CTGCGGGCAC AAAACTCATC CGACTCGCTT GAGGCCGCTG TGAGGGCGGC GATTGGTCAA CGTGGTTTCG GCCACGTGTC GTGTGACGAT GTGGCAAATG ACCTCGGCAT CACTTCGCGC ACGCTGCAGC GGCGCCTCAG CCAGAACGGC ATGCCTTTTC GTCGTGTCCT CGAGGCAGCC CGAATGGAAG AAGCCCTGCT TCTTTTGACT CATGGCAGCA TGCCCCTGCC CGACGTGGCT GAACACCTGG GCTACACCGA GCTCAGTTCC TTCTGGCATG CCGTTAAATC GTGCTGGGGC TCGACACCGC GCGAACTGCG CAACAATGCC AACGAGCTCG CAAGCATGGC GTCGATGCAT ATGAATCCAC CGGGGAAATC ACACCCCTCT TCAACGACCC ACGCAATTCG TTGA
|
Protein sequence | MTILSTWLDA VVDQLEVQDL NPNDLTAGLW GPGLARLAPT RQVGLVLARR LWQRAAKVST DPLLGLKVGI GLPLQAMNVS GLLMMHSPTL REALTHMERY QQLVSNSGRV TAHKVPGGLE LGYIVTPCHV DMHHMQIDSV FGGILAFLKR CSTRSIAPRH IALTAPDKLL GSSYATLLGS PVTLGERNVC VAYDDQTLDQ PFQGADPALL ALLRAQADGL LRAQNSSDSL EAAVRAAIGQ RGFGHVSCDD VANDLGITSR TLQRRLSQNG MPFRRVLEAA RMEEALLLLT HGSMPLPDVA EHLGYTELSS FWHAVKSCWG STPRELRNNA NELASMASMH MNPPGKSHPS STTHAIR
|
| |