Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3928 |
Symbol | |
ID | 7970357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 4166799 |
End bp | 4168577 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644794514 |
Product | putative bifunctional OHCU decarboxylase/allantoate amidohydrolase |
Protein accession | YP_002945808 |
Protein GI | 239816898 |
COG category | [S] Function unknown |
COG ID | [COG3195] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family [TIGR03164] OHCU decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.744958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGA CCCTCGACCA ACTGAATGCC GCCGCGCCGG GCGATGCCGT GGCGCTGCTC GACGGCACCT ACGAACATTC GCCGTGGATC GCCGAGCGCG CGCTCGCGGC GCGCCCGTTC CGCTCGCTCG CGCACCTGAA GCATGCGCTC GTGCAGGCGG TGGCCGCGGC CTCGACCGAC GAAAAGCTGG GCCTGATCCG TGCCCACCCC GAGCTCGCGG GCAAGGCCAT GGTCAGCAAG ACGCTCACCG CCGAGTCGAC CCATGAGCAA GGCAAGGCCG GCCTCACCGA CTGCACGCCC GAAGAGTTCG CGAAGATCCA GCAGCTCAAT GCCGACTACA ACGCGAAGTT CGGCTTCCCG TTCATCCTGG CGGTGCGCGG GCCGCGCGGC ACCGGGCTTT CCAAGCGCGA GATCATCGAC ACCTTCGAGC GCCGGCTGCA CCATCACCCG GACTTCGAGC TGGGCGAGGC GCTGCGCAAC ATCCACCGCA TCGCCGAGAT CCGGCTCGAC GACAAGTTCG GCGCCGACGT CTCGCTCGGC AACGACGTCT GGGATTGGCA CGAAGCGCTC TCGGCGCACA CCGACCCGGG CTACGCCGAG AAAGGCCAGC TCACCGTCAC CTACCTGACC GATGCGCACC GCGCCTGCGC CGCGCAAATT TCCGGCCTGA TGCGCGACTG CGGCTTCGAC TCGGTGCACA TCGATGCGGT CGGCAACGTG GTCGGCCGCT ACGAAGGCAG CACGCCCAAT GCGAAGGCTT TGCTCACCGG CTCGCACTAC GACACCGTGC GCAACGGCGG CAAGTACGAC GGCCGCCTGG GCATCTTCGT GGCCGTAGCC TGCGTGCGCG AACTCAAGCG CCAGGGCCGG CGCCTGCCGT TCGCGTTCGA GGTGGTGGGC TTTGCCGAAG AGGAAGGCCA GCGCTACAAG GCCACCTTCC TGGGATCGGG CGCGCTCATC GGCCACTTCG ACCAGCGCTG GCTCGACCAG AAGGATGCCG ACGGCATCAC GATGCGCGAG GCCATGCGGC ACGCGGGCCT GAAGGAAGAA GACATTCCCA AGATCCAGCG CGACCCGGCG CGCTACCTCG GCTTCGTCGA GGTGCACATC GAGCAGGGGC CGGTGCTCAC CGAGCTCGAC ATTCCCCTGG GCATCGTCAC CTCCATCAAC GGCGGCGTGC GTTACGTGGG CGAGATGATC GGCATGGCCA GCCATGCCGG CACCACGCCG ATGGGCCGGC GCCGCGACGC CGCCGCCGCG GTGGCCGAGC TGATCCTCTT CGCCGAGCAG CGCGCGGCAA AGGACGGCGA CTCCGTCGCG ACCGTGGGCA TGCTCGAAGT GCCGAGCGGC TCGATCAACG TGGTGCCCGG GCGCTGCAAG TTCAGCCTGG ACATCCGCGC GCCCAACGAT CCGCAGCGCG ACGCCGTGGT GCGCGACGTG CTGGCCGCGC TGCAGGAGAT TGCCGACCGC CGCGGCGTGC GCTTCGTCAT TGAGGAGGCC ATGCGCGCGG CCGCCGCGCC CAGCGCGCCC GCATGGCAGC AGCGCTGGGA AAAGGCGGTC GAATCGCTCG GCGTGCCGCT CTTTCGCATG CCCAGTGGCG CCGGCCACGA CGCGATGAAG CTGCACGAGG TGATGCCGCA GGCCATGCTC TTCGTGCGCG GCATCAACTC GGGCATCAGC CACAACCCGC TCGAATCGAG CACCAACGAC GACATTCAAT TGGCTGTGCA GGCCTTCCAG CACCTGCTGG ACAGCCTCGC CGCCGAACAA GCCCACTGA
|
Protein sequence | MSLTLDQLNA AAPGDAVALL DGTYEHSPWI AERALAARPF RSLAHLKHAL VQAVAAASTD EKLGLIRAHP ELAGKAMVSK TLTAESTHEQ GKAGLTDCTP EEFAKIQQLN ADYNAKFGFP FILAVRGPRG TGLSKREIID TFERRLHHHP DFELGEALRN IHRIAEIRLD DKFGADVSLG NDVWDWHEAL SAHTDPGYAE KGQLTVTYLT DAHRACAAQI SGLMRDCGFD SVHIDAVGNV VGRYEGSTPN AKALLTGSHY DTVRNGGKYD GRLGIFVAVA CVRELKRQGR RLPFAFEVVG FAEEEGQRYK ATFLGSGALI GHFDQRWLDQ KDADGITMRE AMRHAGLKEE DIPKIQRDPA RYLGFVEVHI EQGPVLTELD IPLGIVTSIN GGVRYVGEMI GMASHAGTTP MGRRRDAAAA VAELILFAEQ RAAKDGDSVA TVGMLEVPSG SINVVPGRCK FSLDIRAPND PQRDAVVRDV LAALQEIADR RGVRFVIEEA MRAAAAPSAP AWQQRWEKAV ESLGVPLFRM PSGAGHDAMK LHEVMPQAML FVRGINSGIS HNPLESSTND DIQLAVQAFQ HLLDSLAAEQ AH
|
| |