Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5408 |
Symbol | |
ID | 7975869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012792 |
Strand | + |
Start bp | 112206 |
End bp | 115067 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644795996 |
Product | transcriptional regulator, winged helix family |
Protein accession | YP_002947270 |
Protein GI | 239820085 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.17847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGGCA TCGCGGCGCG CTCCACGTCA TCGCAGGCGC ATGCCGGCGC GTCGCAGGGG CCGAGCGAGG TCCGCTGGCG CTTTGGCGCG TTCATCGTCT GGGAGGCCCA GCGCCGGCTC GAAAGACTCG GGCAGAGCGT GCGCCTGGGC TCCCGCTCGT TCGAACTGCT GCTGCAACTG CTCAGGCGCG TGGGCGAAGT CGTGGGCAAG GACGAGCTGC TCGCCACCGT CTGGGCCGGG GTGGTGGTCG AGGAGTCCAG TGTGCGGGTC CACATCTCCG CGCTCCGCAA GGCCTTGGGG GAGCCGCAAG ACGGGGACAA CTGCAAGGAA TGGATTTCGA ACATTCCCCT GCGCGGCTAC CGGTTCAATG GAACGGTTTT TCGCGAACAG GTCGGCATGC CGACCGAGGA TCGGCCGGGG GCGGCGGCCC TGGCGTTCGC GAAGCTGCCG GAGCGCCTCA CCCGGCTTGT GGGCCGCGAC GCCGACATGG CGCGCGTGCT GGCGGCGCTC GCCACGCGCC GGCTGGTCAC CATCGTCGGC GCCGGCGGCA TCGGGAAGAC CCGTGTCGCC ATCCATGCGG CCGATTGCCA TGCAGAACGC ACGGCCATGC AATTGGCCTT CATCGATCTC TCGCCGCTGG TCTCGCAGGC CCATCTGGCG AGCACGGTGG CCCGGTCGCT CGGCGCGCCG CCGGACACCC CCGACACGAC GCGGGCCATC CTCCAGAGGC TGGCGGACCG CGATGTGCTC CTGCTGATCG ACAACTGCGA GCACATGCTG GACGCACTGG CGCCGCTCAT CGCCGAACTG CTCGGCGCCC TGCCCGACCT GCGCGTCCTC GCAACCAGCC GGGAGGCCAT TCGCATCGAG GGAGAACACG TCGAGCGGCT GTCGCCGCTC GCGGTGCCCG ACGCGCAGTG TGCCGACCTG GCCGAAGCCA TGGCGTCGCC GGCGGTCGAA CTGCTGGTCG AGCGTGCGAA GGCCGTGGGT GCCCGCGCGT TCGAGGATTC CGATGGCCGG CTGCTCGCCG CAGTCGCCCG GCAGGTGGAC GGCATCCCGC TCGCCATCGA ACTGGTGGCT GCACGCCTGG GCGTGCAGCC GATCGGCGAT CTTGCGTTCC GGCTGAATGA CCACATGCGG CTCTATTCCG CGGGCAGCAG GGCCGTCCTG CCCCGGCACA GCACGCTCGC GGCGGCGCTG GACTGGAGCA TCGCGCTGCT GGACGATGCG GAACTGCGGC TCTTCCGGCG GCTCTCGGTC TTCCGGGGGC GCTTCGATGT CGAGTCGGCG CTGAGCGTCA CCCGGGCGGA CATGGACTCC GAAGTGGCGT TCGATGCATT GATCTCGCTG GCGAACAAGT CGCTGGTGTC GTTCGACAAC AGCGACGCCA TCGCGCCCTA CCGGCTGCTC GACACGACGC GAAGCTACGC CGCCGCGCTT CTCGCGCAAA CCGATGAAGG CCCGGCAATG CGGCGGCGCC ATGCGCTCTT CATGCGCGAG CTGATGGGCG CCGCGACCTC GGATTCGGGC GAACTCACCG TGCAGGCCTG GAACGCGCGC TACGCACACC GGCTGGACGA CGTGCGCAGT GCGTTGGACG ACTGCCTGGC GCAGCAGGAC GATTGGGAGA CCGGCGCCGC GCTCACCATC GCTTCGGCGC CCCTGTGGTT TCACGTCTCG CAGGTCGAGG AGTACCGTGA CCGGGTCATC GCCGCGCTCG CGTGCCTCGC GCAACAGCCG GGCACGGGCA CGGAGACAGA GGCCTGGCTG CAGATCGCAC TGGGCAATGC GCTGTGGCAC ACACGGGGAC CGGTACCGGA AATGGGCGCC GCCTACGATC GGGCGCTTGC TGTCGCCATG TCGGCCAGCT CGGTCGTGCT GGAGCTTCAG GCCCGCTGGG GCATCTGCGT GCTGCATGCG ATCCGCGGCG AGTATGCGGC CGCGTTGCAC CACTCGCAGG TGCTGTTCGA ATTCGCGCAA TCGACGCCCG ACCCCGCCGC GCTCAACCTG GCGCACCGAA TGACCGCGCT GGCGAGTCAC TTCTGCGGGG ATTTTTCTGC GGCGAGTGCC GGCGCCCAGG CGGCGATCCT TGTGGGCAGC ACCGTGCGCC AGACGCCCGT CAACTTGTTC CAGGTCGATG CCGCCGTCGC ATCGAATGCC TTGCTGGCCC GCACGCTCTG GCTCCAGGGC GATGCGGCGA AGGCCATGGC TACCGCCACC CGAGCGGTGG CTCTCGCCGA GGCCGGGGGC AATGCGCTGT CTCTGTGCTT CGCGCTGTTC GGCACATGCC CCGTCGCCCT CTGGTCGGGC GAGCTGGAAC TGGCGCGCAA ATGGGTTCGC ATGCTGCTGG ACGAGGCCCA GCGCAGAGGG CTGGCGTACT GGCACCAGTG GGCACACTGC TATGCGCTGG GCCTGCAGGC GCGCACCGCG GACGACCGGG ACCGCCATGT TCGCCGAGTC GCGCAGCAAC TCGACGATTT CGACGCGCCA CGCAAGGAAA TGCTGGTGAC CTTCTGCGCC GACTGGATCG ACGACGAGAC GATCGCGCGC GCCGGTGCCG GGCACGGCCA ATGGAGCGCG GCCGAGACCT GGCGCGCTGC GGGCCAGCGT TGCGAGCAGC GCGGCCTCGA CGATGAAGCC GAGGCGTTCT ACTTGCGTGC CATCGGCACC GCCAGGCAGC AAGGCGCGCT GGGATGGGAA TTTCGTGCGG CGATCAGCGC CTCCCAGCTG TGGGTTCGCC GGGGCAAGGC GAAGGACGCC CTCGACCTGC TGGACGAGGT CTGCGCGCGC GCGGCACCTC GGGGCGAGCA CCCCGGCCTT GCGCAAGCCC GCGCACTGCG CAGCGCACTC TCCCGGAACT GA
|
Protein sequence | MYGIAARSTS SQAHAGASQG PSEVRWRFGA FIVWEAQRRL ERLGQSVRLG SRSFELLLQL LRRVGEVVGK DELLATVWAG VVVEESSVRV HISALRKALG EPQDGDNCKE WISNIPLRGY RFNGTVFREQ VGMPTEDRPG AAALAFAKLP ERLTRLVGRD ADMARVLAAL ATRRLVTIVG AGGIGKTRVA IHAADCHAER TAMQLAFIDL SPLVSQAHLA STVARSLGAP PDTPDTTRAI LQRLADRDVL LLIDNCEHML DALAPLIAEL LGALPDLRVL ATSREAIRIE GEHVERLSPL AVPDAQCADL AEAMASPAVE LLVERAKAVG ARAFEDSDGR LLAAVARQVD GIPLAIELVA ARLGVQPIGD LAFRLNDHMR LYSAGSRAVL PRHSTLAAAL DWSIALLDDA ELRLFRRLSV FRGRFDVESA LSVTRADMDS EVAFDALISL ANKSLVSFDN SDAIAPYRLL DTTRSYAAAL LAQTDEGPAM RRRHALFMRE LMGAATSDSG ELTVQAWNAR YAHRLDDVRS ALDDCLAQQD DWETGAALTI ASAPLWFHVS QVEEYRDRVI AALACLAQQP GTGTETEAWL QIALGNALWH TRGPVPEMGA AYDRALAVAM SASSVVLELQ ARWGICVLHA IRGEYAAALH HSQVLFEFAQ STPDPAALNL AHRMTALASH FCGDFSAASA GAQAAILVGS TVRQTPVNLF QVDAAVASNA LLARTLWLQG DAAKAMATAT RAVALAEAGG NALSLCFALF GTCPVALWSG ELELARKWVR MLLDEAQRRG LAYWHQWAHC YALGLQARTA DDRDRHVRRV AQQLDDFDAP RKEMLVTFCA DWIDDETIAR AGAGHGQWSA AETWRAAGQR CEQRGLDDEA EAFYLRAIGT ARQQGALGWE FRAAISASQL WVRRGKAKDA LDLLDEVCAR AAPRGEHPGL AQARALRSAL SRN
|
| |